How to look at data

Each data type can be seen as either a snapshot in time or a series of events. With that in mind, I tried grouping them in pairs so you’ll be able to see that any type of data can be broken down like that.

Often systems that don’t care about temporal information are easier to develop and maintain. However, sometimes you just can’t make do without encoding temporal information in some form.

Usually when building an AI you’d try to start with a version that doesn’t care about time. Then use that as a building block to develop a more complex system.

1. Images and Videos

Videos are just a sequence of images that play in quick succession which give the illusion of movement (similar to a flip book). Thus we can say that an image is the state of the world frozen in time.

There are a bunch of applications where we can make do only using a single image (face recognition, object detection). However, without some trickery or fancy math, you won’t be able to figure out how an object moves (left to right, coming closer or further from the camera). To infer movement or distance you’ll need two or more frames in succession.

Tip

You can first train an Image Encoder - that would extract features from images. Then we can use the image embedding with a recurrent network to support videos.

If this example doesn’t make full sense to you, that’s ok, we’ll go into more details in further blog posts

Here are a few applications for image and video AIs

Applications - Image AI:

Use-caseExample
security camerassome casinos have face detecting software for bad clients. Also in China there surveillance systems that can identify you even in crowded areas
sorting facilitiesaround Europe you have machines that collect bottles and cans for recycling. These objects have a QR code or symbol that tells the system if the material is recyclable or not
medical imagingthere are pre-screening software that can interpret X-Ray scans with higher accuracy and help speed up the diagnosis process
optical character recognition (OCR)extracting text from images. It’s useful for digitising books or documents
automotivelane detection, traffic sign recognition, weather conditions, driver drowsiness detection

Applications - Video AI:

Use-caseExample
advanced securityidentifying shoplifting by analysing human behaviour
sportsthey sometimes do object tracking (for balls or players) in order to pan / zoom cameras during live events
motion trackingin the fitness industry they sometimes use AI for repetition counting

2. Text and Text generation

From checking whether a tweet you wrote is racist to talking to ChatGPT, these are the kinds of things NLP (Natural Language Processing) focuses on.

Text AIs usually take some chunk of text as input and output a label or a score. Like: “is this spam?”, “what’s the sentiment?”, “what topic is this?”, “is this toxic?”

Text-generation AIs do the opposite. They take some prompt / context and output new text. Like: “finish this sentence”, “write me an email”, “summarise this”, “translate this”, “answer this question”

Here are a few applications for text and text-generation AIs

Applications - Text AI:

Use-caseExample
spam filteringdeciding whether an email is spam or not
topic classificationauto-tagging news articles or support tickets so they get routed to the right place
toxicity / moderationflagging hateful content or harassment in comments
information extractionpulling out names, dates, prices, or addresses from messy text

Applications - Text-generation AI:

Use-caseExample
chatbots / assistantsanswering questions using a knowledge base or company docs
summarisationturning a long article / meeting transcript into a short summary
translationconverting text between languages while preserving meaning
code generationhelping write / refactor code from a prompt

3. Tabular data and Time series

Tabular data is what you get in spreadsheets: each row is an example, each column is a feature. Usually the rows don’t have an order, you can shuffle them around and nothing changes.

Time series is tabular data where the order does matter. Each row is a snapshot at a specific time (or a time window), so if you shuffle the rows you lose information.

Tabular AIs usually output a number or a category. Like: “will this customer churn?”, “is this transaction fraud?”, “what’s the price of this house?”

Time-series AIs usually output a forecast, a warning, or an anomaly score. Like: “what will sales look like next week?”, “is this sensor acting weird?”, “are we about to run out of stock?”

Here are a few applications for tabular and time-series AIs

Applications - Tabular AI:

Use-caseExample
credit scoringpredicting whether someone is likely to repay a loan
fraud detectioncatching suspicious transactions based on patterns in the metadata
churn predictionestimating whether a user will cancel their subscription
pricingpredicting house prices, insurance premiums, or ad bids
recommendation rankingdeciding what items to show higher in a feed (based on user/item features)

Applications - Time-series AI:

Use-caseExample
forecastingpredicting demand, revenue, website traffic, or energy usage
anomaly detectionspotting weird behaviour in servers, sensors, or payments
predictive maintenanceguessing when a machine is about to fail based on vibration / temperature logs
algorithmic tradingusing market history to trigger buys/sells (with lots of caveats)
inventory / supply chainpredicting stockouts and lead times so you can plan ahead

If you’re still deciding how to attack the problem once you know the data shape, Picking the right AI is the checklist I use.