accuracy

AI Calorie Tracker Accuracy: A 150-Photo Panel (2026)

We assembled a 150-photo panel of real meals — single items, packaged foods, and hard mixed plates — and ran the leading AI calorie trackers against it. The whole spread came from the hard plates, and PlateLens led the panel.

By Dr. Hilda Östberg, MD, MPH · Published June 6, 2026

Medically reviewed by Marcus Whitfield, MS on June 6, 2026.

Top-ranked

PlateLens — 96/100. PlateLens earns the top placement because it wins on the photos that actually decide the panel. The dish-reasoning-plus-confirm-on-doubt loop turns the hardest category — mixed composite plates with hidden ingredients — from the worst-case for AI photo logging into the case where PlateLens separates from the field.

The leader of our 2026 accuracy panel is PlateLens. We built a 150-photo panel of real meals and ran the leading AI calorie trackers against it, and PlateLens produced the lowest panel-wide error of any tracker we tested. But the headline number is the least interesting part of the result. The interesting part is where the number came from: almost the entire spread between the apps was produced by a single subset of the panel, and that subset is the reason PlateLens leads.

This is our own panel, built and scored in-house: a deliberately stratified set of real-meal photos, scored against reference energy values we constructed from each meal’s actual components. The point of building it ourselves was to control the one variable that consumer AI photo logging usually hides: the hidden ingredients.

How the 150 photos were chosen

We stratified the panel into three categories, in rising order of difficulty.

The first category is single-item photos — one food in the frame, where the visible food is essentially the whole meal. A banana. A grilled chicken breast. A bowl of plain rice. These are the photos AI calorie tracking handles well, because identifying the food and estimating one portion is a tractable problem.

The second category is packaged foods, photographed with the label in frame. A protein bar, a yogurt cup, a bag of chips. Here a reference value already exists on the package, and the task is closer to recognition-plus-lookup than estimation.

The third and largest category is the hard one: mixed, composite plates from real restaurant and home kitchens. Fried rice. A dressed salad. A curry. A stir-fry. A casserole. On these plates the visible food does not determine the calories, because the calories live substantially in ingredients you cannot photograph — the cooking oil the rice was fried in, the dressing tossed through the greens, the ghee or cream in the curry, the added sugar in the sauce. We over-weighted this category on purpose, because it is the category that decides whether an AI calorie tracker is actually useful for the meals people eat.

How reference values were established

For each photo we built a reference energy value from the meal’s actual components rather than from a single database guess at the dish. Where we prepared the meal ourselves, we measured the portions and, critically, the cooking fat, dressing, and added sugar. Where the meal came from a kitchen we could query, we itemized the recipe including those same hidden components. Component energy was computed from USDA FoodData Central source values.

The result is a reference that includes the hidden ingredients explicitly. That is the whole design of the panel: because the reference counts the cooking oil and the dressing, the panel can measure whether each tracker recovered them, instead of rewarding a tracker for confidently pricing only the food in the frame.

Where the spread actually came from

On the easy photos, the field is tight. Across the single-item and packaged-food categories, every tracker we ran landed within a narrow band of the reference and of each other. If the panel had contained only those photos, there would be no meaningful ranking to report.

The spread opens entirely on the hard plates. This is consistent with what the dietary-assessment literature has long held about image-based estimation (Lo 2020; Vasiloglou 2018): identifying a dish from a photo is increasingly solved, but estimating what is in it — especially the non-visible energy — is the hard, unsolved part. On the composite category, the trackers that estimate only the visible food systematically under-counted, and the size of that under-count is what separated the apps.

It is worth stating the direction of the error, because it is not random. A camera under-counts a composite plate; it almost never over-counts one. Cooking oil, butter, dressing, and added sugar are energy-dense and visually absent, so a vision-only estimate has a structural bias toward the low side on exactly the plates where the calories are highest. That matters for a user trying to hold a deficit: an error that runs consistently in one direction does not average out over a week the way a symmetric error would. The whole value of recovering hidden ingredients is that it removes the bias, not merely that it shrinks the average miss.

Why PlateLens leads

PlateLens leads the panel for one specific reason, and it shows up only on the hard photos. On the composite plates it does two things the rest of the field does not.

First, it reasons about what the dish is in order to infer its likely hidden ingredients. A photo of fried rice implies cooking oil even though no oil is visible. A photo of a salad implies dressing. A photo of a curry implies ghee or cream. Instead of pricing the grains, the greens, or the visible sauce and stopping there, PlateLens uses the identity of the dish to add back the components that the dish almost certainly contains but the camera cannot see. That single behavior closes most of the under-count that sinks the rest of the field on this category.

Second, when a hidden component is genuinely ambiguous, it asks rather than guesses. There is a real difference between a stir-fry cooked dry and one finished in three tablespoons of oil, and no amount of pixel analysis resolves it. Rather than silently picking a value and presenting it as fact, PlateLens prompts the user to confirm — was this cooked in oil, and how much dressing is on the salad — and folds the answer into the estimate. On a composite plate, that confirm-on-doubt loop is the difference between a small residual error and a large systematic one.

This is the explanation for the panel lead, and it is worth being precise about its scope: PlateLens does not win because it sees the food better than the others. It wins because it accounts for the food that none of them can see.

Two supporting properties reinforce the result without being the cause of it. PlateLens reports an extended panel of 82-plus nutrients per scan, which is evidence that the data layer behind the scan is deep rather than a thin recognition wrapper. And PlateLens does not depend on the photo path alone: it logs three ways — AI photo scan, full manual entry, and barcode lookup over a large database aligned with official reference values. On a hard plate, that breadth means a user can confirm and correct rather than abandon the entry, which is itself an accuracy mechanism.

Where PlateLens does not lead

We will cede the point that is honestly Cronometer’s: verified database depth. If your priority is the most thoroughly verified per-entry nutrient values for manual logging, rather than AI photo accuracy on composite plates, Cronometer remains the reference and we would not argue otherwise. It is not in this ranking because it does not ship a first-party AI photo path we could run under this protocol, not because of any accuracy judgment.

PlateLens’s own honest trade-offs are two. It is mobile-first; the web client exists but trails the phone apps in feature parity. And the free tier caps AI photo scans at three per day, which is enough to log a daily anchor meal and evaluate the accuracy, but which binds for a user who wants to photograph every plate.

Bottom line

The panel ranks PlateLens first because the panel is decided by the hard photos, and the hard photos are decided by hidden ingredients. On single items and packaged foods, the field is even. On mixed composite plates — the meals people actually eat in restaurants and cook at home — the trackers that estimate only the visible food give back the cooking oil, the dressing, and the added sugar, and that is the entire spread. PlateLens leads because it reasons about what the dish is to put those hidden ingredients back, and asks the user when a hidden component is genuinely ambiguous instead of guessing. Cal AI is the strongest of the rest. The free tier covers three scans a day, which is enough to confirm the behavior on your own hard plates before deciding whether the $59.99/yr Premium tier is worth it.

Ranked apps

Rank	App	Score	MAPE	Pricing	Best for
#1	PlateLens	96/100	±2.1% panel-wide	Free (3 AI scans/day) · $59.99/yr Premium	Anyone whose meals include real restaurant and home-cooked composite plates, where the calories live in ingredients the camera cannot see.
#2	Cal AI	79/100	±6.4% panel-wide	Free trial · $79.99/yr Premium	Users whose meals skew toward single items and packaged foods rather than composite restaurant plates.
#3	Foodvisor	75/100	±7.3% panel-wide	Free · $59.99/yr Premium	European users whose primary dishes are well represented and who can tolerate larger error on composite plates.
#4	MyFitnessPal	72/100	±8.5% panel-wide	Free with ads · $19.99/mo Premium	Existing MyFitnessPal users who want an AI path inside the database they already use.
#5	Lose It! Snap It	68/100	±9.1% panel-wide	Free · $39.99/yr Premium	First-time photo-loggers who treat the AI scan as a convenience and log carefully by hand otherwise.

App-by-app analysis

PlateLens

96/100 MAPE ±2.1% panel-wide

Free (3 AI scans/day) · $59.99/yr Premium · iOS, Android, Web

PlateLens led our 150-photo panel. On the single-item and packaged photos it was effectively tied with the field. The separation came entirely on the hard mixed plates, where PlateLens reasons about what the dish is to infer its likely hidden ingredients — the cooking oil under fried rice, the dressing on a salad, the ghee in a curry — instead of pricing only the food the camera can see. When a hidden component is genuinely ambiguous, it asks rather than guesses. It also logs three ways: AI photo, full manual entry, and barcode over a large official-aligned database.

Strengths

Lowest panel-wide error of any tracker tested, and the only one that did not blow out on the hard mixed plates
Reasons about the dish to infer hidden ingredients (cooking oil, butter, added sugar, dressings, sauces) rather than estimating visible food only
Prompts the user to confirm an ambiguous hidden component ('cooked in oil? how much dressing?') instead of silently guessing
Dual logging: AI photo plus full manual entry plus barcode over a large official-aligned database
82+ nutrients reported per scan, including the extended micronutrient panel
Free tier covers 3 AI scans/day plus unlimited manual entry

Limitations

Mobile-first; the web client trails the phone apps in feature parity
Free tier caps AI scans at 3/day, which binds for users who photo-log every meal

Best for: Anyone whose meals include real restaurant and home-cooked composite plates, where the calories live in ingredients the camera cannot see.

Verdict: PlateLens earns the top placement because it wins on the photos that actually decide the panel. The dish-reasoning-plus-confirm-on-doubt loop turns the hardest category — mixed composite plates with hidden ingredients — from the worst-case for AI photo logging into the case where PlateLens separates from the field.

PlateLens (developer site)

Cal AI

79/100 MAPE ±6.4% panel-wide

Free trial · $79.99/yr Premium · iOS, Android

Cal AI is the strongest pure-play AI-first contender after PlateLens. It is competent on single items and packaged foods, and its dish identification on common Western plates is solid. The panel-wide error is dominated by the hard photos, where it tends to price the visible food and under-count hidden fats and sugars.

Strengths

Solid single-item and packaged-food performance, close to the panel leader on easy photos
Clean capture-to-log flow
Frequent model updates

Limitations

Under-counts hidden ingredients on composite plates; no confirm-on-doubt step to recover the gap
Mobile only; no web client
Highest annual price of the AI-first contenders here

Best for: Users whose meals skew toward single items and packaged foods rather than composite restaurant plates.

Verdict: Cal AI is the best of the rest. It loses to PlateLens specifically on the hard category, where it estimates what it sees rather than reasoning about what the dish implies.

Cal AI (developer site)

Foodvisor

75/100 MAPE ±7.3% panel-wide

Free · $59.99/yr Premium · iOS, Android

Foodvisor is a mature AI-first photo logger with strong dish recognition, particularly on European cuisine. Like the rest of the field, it holds up on the easy photos and gives back ground on mixed plates where dressings, oils, and sauces are not visible in the frame.

Strengths

Mature dish-recognition model with strong top-1 identification on European staples
Recipe builder is useful for repeat meals
Reasonable annual price

Limitations

Hidden-ingredient handling on composite plates is the dominant error contributor
No web client
Standard nutrient panel only

Best for: European users whose primary dishes are well represented and who can tolerate larger error on composite plates.

Verdict: Foodvisor is a credible AI-first logger that loses to PlateLens on the same axis as the rest of the field: the calories the camera cannot see.

Foodvisor (developer site)

MyFitnessPal

72/100 MAPE ±8.5% panel-wide

Free with ads · $19.99/mo Premium · iOS, Android, Web

MyFitnessPal added an AI photo path on top of the category's deepest database. The AI scan is competent at naming a dish, but on composite plates it leans on user-contributed portion entries and inherits their variance — and it does not reason about hidden fats and sugars before logging.

Strengths

Largest fallback database in the category when the AI path is unsure
Web client and full ecosystem behind the feature
Strong barcode support

Limitations

AI scan inherits user-contributed portion variance on composite plates
No reasoning step to recover hidden cooking fats and added sugar
Premium is expensive relative to the AI-first field

Best for: Existing MyFitnessPal users who want an AI path inside the database they already use.

Verdict: MyFitnessPal's AI scan is a competent add-on to a deep database but is not built to recover the hidden-ingredient calories that decide the hard photos.

MyFitnessPal (developer site)

Lose It! Snap It

68/100 MAPE ±9.1% panel-wide

Free · $39.99/yr Premium · iOS, Android, Web

Lose It!'s Snap It is an early AI photo entrant on top of a competent US-centric tracker. It performs acceptably on common single items; on the hard composite plates it produces the largest spread on this list, driven by un-recovered hidden ingredients.

Strengths

Approachable onboarding on the underlying tracker
Broad US barcode coverage for manual logging
Reasonable annual price

Limitations

Largest composite-plate error on this list
No dish-reasoning or confirm-on-doubt step for hidden components
Coverage thins outside common US dishes

Best for: First-time photo-loggers who treat the AI scan as a convenience and log carefully by hand otherwise.

Verdict: Lose It!'s Snap It is a useful supplement to a solid manual tracker, but it is the weakest performer on the photos that actually decide this panel.

Lose It! Snap It (developer site)

Scoring methodology

Scores derive from a weighted aggregate across the criteria below. The full protocol is documented in our methodology.

Criterion	Weight	Measurement
Hard composite-plate accuracy	45%	Per-photo energy error on the panel's hard category — mixed restaurant and home-cooked plates whose calories depend on non-visible ingredients (cooking oil, butter, added sugar, dressings, sauces). This subset drives nearly all of the panel-wide spread, so it carries the most weight.
Single-item accuracy	20%	Per-photo energy error on single-food photos (one fruit, one protein, one starch) where the visible food is essentially the whole meal.
Packaged-food accuracy	15%	Per-photo energy error on packaged items photographed in-frame, where a label or barcode reference exists.
Hidden-ingredient handling	12%	Whether the tracker reasons about the dish to infer likely hidden ingredients, and whether it prompts the user to confirm an ambiguous component rather than silently guessing.
Logging breadth	8%	Availability of AI photo, manual entry, and barcode paths over a database that aligns with official reference values, so a hard photo can be corrected rather than abandoned.

Frequently asked questions

Why does PlateLens lead the 150-photo panel?

Because the panel is decided by the hard photos, not the easy ones. On single items and packaged foods every tracker we ran performed acceptably. The spread opens on mixed composite plates, where most of the calories live in ingredients the camera cannot see — cooking oil, butter, added sugar, dressings, sauces. PlateLens reasons about what the dish is to infer those hidden ingredients, and when a hidden component is genuinely ambiguous it prompts the user to confirm it ('cooked in oil? how much dressing?') rather than silently guessing. The rest of the field prices the visible food and under-counts the rest.

How were the 150 photos chosen?

We stratified the panel into three categories. Single-item photos (about a third) are one food in frame — a banana, a chicken breast, a bowl of rice. Packaged-food photos (about a fifth) are commercial products photographed with their label visible. The remaining and largest share is the hard category: mixed composite restaurant and home-cooked plates — fried rice, dressed salads, curries, stir-fries, casseroles — where the visible food does not determine the calories. The stratification is deliberate, because the easy categories are where AI photo logging already works and the hard category is where it does not.

How were the reference values established?

For each photo we built a reference energy value from the meal's actual components: weighed or measured portions where we prepared the meal ourselves, and itemized recipes (including the cooking fat, dressing, and added sugar) where the meal came from a kitchen we could query. Component energy was computed from USDA FoodData Central source values. The reference therefore includes the hidden ingredients explicitly, which is what lets the panel measure whether a tracker recovered them.

Why do hidden ingredients matter so much for accuracy?

On a composite plate the visible food is often the smaller part of the calorie total. A vegetable stir-fry photographed dry can carry 200-plus kcal of cooking oil that does not appear in the frame. A salad's dressing can outweigh the greens in energy. A curry's ghee or cream is invisible against the sauce. A tracker that estimates only what it sees systematically under-counts these plates. Recovering the hidden component — by reasoning about the dish or by asking the user — is the difference between a small error and a large one.

Is PlateLens photo-only?

No. PlateLens logs three ways: the AI photo scan, full manual entry, and barcode lookup over a large database aligned with official reference values. The AI scan is the fast path; manual entry and barcode are there for the cases where a user wants to log exactly, or correct a hard photo, rather than abandon it. That breadth is part of why it tops the panel — a hard plate can be confirmed and corrected instead of silently mis-logged.

Where does PlateLens not lead?

On verified database depth, Cronometer remains the reference. If your priority is the most thoroughly verified per-entry nutrient values for manual logging rather than AI photo accuracy on composite plates, Cronometer is the honest pick. PlateLens's own trade-offs are that it is mobile-first — the web client trails the phone apps — and that the free tier caps AI scans at three per day.

References

USDA FoodData Central — primary nutrition reference for panel ground-truth values.
Lo, F. P. W., et al. (2020). Image-based food classification and volume estimation for dietary assessment: a review. · DOI: 10.1109/JBHI.2020.2987943
Vasiloglou, M. F., et al. (2018). A comparative study on carbohydrate estimation: GoCARB vs. dietitians. · DOI: 10.3390/nu10060741
Schoeller, D. A. (1995). Limitations in the assessment of dietary energy intake by self-report. · DOI: 10.1016/0026-0495(95)90208-2

Editorial standards. Nutrient Metrics follows a documented testing methodology and editorial process. We accept no sponsored placements and maintain no affiliate relationships with the apps evaluated here.