Understanding Unstructured Data with Language Models

ODSC Europe 2018 · (80 minutes)

It's estimated that up to 90% of the data which organizations hold is unstructured, living in the form of documents, emails, reviews, reports, chat logs and so on. This talk hopes to cast a new light on this often overlooked source of insight with a deep dive into language models: one of the most powerful tools at our disposal for understanding unstructured data. The talk spans the history of language models, from the techniques used break Nazi codes at Bletchley Park to the emerging universal language models of 2018.

Slides (PDF)

Ruby-us Hagrid: Writing Harry Potter with Ruby

Brighton Ruby 2018 · (25 minutes)

This is a talk based around a big idea: using Ruby to write a brand new Harry Potter story, completely automatically. Whether we’re embarking on this journey for self-enlightenment, or to grab a slice of J. K.’s billions, along the way we’ll see first-hand the power and elegance of Ruby for tackling problems in the field of natural language processing.

Read more Slides (PDF)

The Magic of Dimensionality Reduction

ODSC Europe 2017 · (80 minutes)

Dimensionality reduction is one of the most crucial tools in a data scientists’ toolbox, and modern tools can yield truly magical results. This talks looks at how we can take complex, messy, real-world datasets and simplify them, in order to create beautiful visualizations, to better understand the relationships between our data points, and to discover some surprising insights: including how dimensionality reduction could help you find love...

📹 Watch video Slides (PDF)

More talks

These ones just have slides for now. Email me if you'd be interested in a recording of any of these talks.

This site is open-source (more info) - here's the source code for this page.