From grids to areas

Imago is the Imagery Data Service for sustainability, prosperity and wellbeing. At Imago, our mission is to make (satellite) imagery more useful, useable, and used.

Pixel by pixel, clouds by cloud

Satellite data looks like a map—but it definitely isn’t one yet.

Before it becomes ready for analysis, it passes through a long chain of processing steps. What satellites actually beam down are raw, multi-band images: massive files full of noise, distortions, clouds, and values that don’t mean little until we transform them.

Types of Satellite Data: A Quick Overview

Satellite data is collected:

by sensors on the satellites that are sensitive to specific ranges of wavelengths of light (the fancy term for this is spectral bands)
at a specific time and from a specific geographic area (the fancy term for this is spatiotemporal)
at a particular resolution, meaning that one pixel corresponds to some geographic area with units like “meters per pixel” (images have a total size that is often referred to as a scene size or frame)
with a particular frequency known as the refresh rate, revisit time, or repeat cycle (the rate at which all geographic areas the satellite observes are re-visited)

Satellite data comes in several forms, each capturing the Earth differently and serving different analytical purposes. The two most commonly discussed types are:

Spectral (Optical) Data

Optical sensors record reflected sunlight across multiple wavelengths—visible, near-infrared, shortwave infrared, and beyond. When these sensors collect more than three wavelengths simultaneously, the data is called multispectral; when they capture dozens to hundreds of wavelengths, it becomes hyperspectral. These richer spectral profiles allow us to distinguish vegetation types, water quality, soil characteristics, burn severity, and more.

Optical data is especially useful for:

Land-cover classification
Vegetation indices
Water detection and turbidity
General “photo-like” interpretation

However, optical data is limited by daylight and cloud cover, which can block the sensor’s view.

SAR (Synthetic Aperture Radar)

SAR sensors use microwave energy rather than sunlight, meaning they can operate through clouds, through smoke, and at night. Instead of capturing reflected light, SAR measures the interaction between microwave pulses and surface structure, generating detailed information on texture, moisture, and change.

SAR is particularly valuable for: • Flood detection and inundation mapping • Soil moisture estimation • Ice and glacier movement • Forest structure and deforestation monitoring • Urban infrastructure mapping

SAR data is extremely powerful but also more complex to process and interpret due to noise, speckle, and geometric distortions.

From Orbit to Neighborhood-Level Insights: How Satellite Data Becomes Useful

Turning satellite imagery into meaningful metrics is a multi-step journey. What starts as raw data captured hundreds of kilometers above Earth eventually becomes the neighborhood-scale information we use in research, planning, and policy. The transformation looks roughly like this. Each step removes errors, aligns the data, and extracts meaningful information:

1. Capturing Raw Imagery

Missions like Sentinel and Landsat continuously collect raw optical, multispectral, or radar imagery. At this stage, the data is massive, unrefined, and full of distortions—far from the clean maps we’re used to seeing.

2. Atmospheric Correction & Cloud Masking

Before the data can be analyzed, it must be “cleaned.” Atmospheric correction removes the effects of haze, aerosols, and varying illumination. Cloud masking identifies and removes cloud-covered pixels (and often cloud shadows). This step is crucial—otherwise, half the world would be obscured by clouds.

3. Temporal Compositing to Reduce Noise

Because a single satellite pass might still contain noise, missing data, or residual clouds, multiple images across a time window are combined. Temporal composites—like monthly or seasonal stacks—select the clearest or most representative pixel for each location. This improves quality and consistency.

4. Applying Models to Extract Meaning

Once the cleaned, composited imagery is “ready”, it doesn’t automatically become a usable indicator. Most neighbourhood-level metrics come from models applied to the processed pixels—e.g., vegetation indices, land-cover classifiers, built-up area models, water detection algorithms, or machine-learning products trained on labeled examples.

Through these steps, solar reflections, microwave signals, and raw pixel values evolve into structured, interpretable data—metrics that can be mapped, summarized, and ultimately used to understand conditions on the ground.

Important

And once you finally have usable pixels, you’re only halfway there. To make satellite-derived information useful, usable, and actually used, by policy makers or social scientists, you need to convert pixel grids into familiar formats—vector data (.gpkg), tables, CSVs. Otherwise, your insights stay locked inside a raster.

5. Aggregate up Even the best pixel-level data isn’t immediately useful for most social, economic, or policy questions. The final step is to translate pixel grids into meaningful geographic units—neighbourhoods, districts, buffers, or any custom boundary. This requires carefully aggregating thousands of pixels that often don’t align neatly with administrative shapes. Differences in spatial extent, resolution, and boundary overlap must be resolved so that each unit receives a consistent, interpretable value.

This is the moment where satellite data becomes something that can actually be used: a table, a geopackage, or a metric attached to a place—ready for statistical analysis, comparison, or integration with survey and administrative data.

Challenges in Raw Workflow

Working directly with raw satellite data is difficult for most users:

Files are huge (literally),
Require significant compute power,
Rely on complex remote-sensing algorithms.

Even after processing, linking pixel-level information to meaningful social or administrative units is technically demanding. Pre-processed and validated products save time, reduce errors, and ensure consistent, reliable results.

Why Products Like Imago Simplify the Process

Pre-computed, ready-to-use MOSA/LSOA-level statistics.
No need to download or handle large imagery or run complex algorithms.
Preserved local detail. Clear benefit of small-area data.

Icebreaker questions:

What’s the largest dataset or file you’ve ever had to manage—and how did you handle it?
Looking at the satellite data workflow, which steps do you think are most critical for ensuring accurate results?
Where do you imagine errors or uncertainty are most likely to sneak into the data—and why?
If you could automate one part of the satellite-to-metrics pipeline, which would it be and why?