The pitch is irresistible: point your camera at a plate, and a number appears. No searching a database, no scanning a barcode — just a photo and a calorie count. Cal AI built one of the largest followings in health apps on exactly that promise, with hundreds of thousands of reviews and a high average rating. So it's a fair question to ask before you hand it your eating habits for a year: is the number it gives you actually right?
The honest answer is yes-and-no, and the "no" part is specific enough to matter. Here's what the peer-reviewed research says about photo calorie counting — and where it quietly falls down.
The short answer
AI has gotten genuinely good at the first job: looking at a plate and naming what's on it. Across recent studies, food identification lands somewhere around 68 to 86 percent on common foods, and higher still for simple, well-lit, single items. A plain apple, a chicken breast, a bowl of rice — modern vision models recognise these reliably.
The problem is the second job, and it's the one that actually decides your calorie number: how much of the food is there. Estimating portion size from a photo is the weakest link in the entire pipeline. A 2025 scoping review in Frontiers in Nutrition found one image-based system was only about 39 percent reliable for portion size across the dishes it tested, and wearable photo tools underestimated portions by roughly 14 percent on average. Once you stack those errors together, published studies put real-world calorie error for photo apps in the range of 15 to 25 percent — and one 2024 trial measured errors swinging anywhere from a fraction of a percent up to 38 percent depending on the meal.
What that means in practice
A meal a photo app calls 600 calories might genuinely be anywhere from 450 to 750. For a plain plated food that's usually close enough. For a mixed bowl, a homemade curry, or anything with oil you can't see, the miss gets bigger — and it's almost always an underestimate, because the camera can't see the tablespoon of oil the pan was cooked in.
Why a photo can't see your portion
This isn't a bug Cal AI can patch — it's baked into the physics of the input. A photo is a flat, two-dimensional image, but calories live in three dimensions. The amount of food on a plate is a question of volume, and a 2D picture doesn't carry the depth information needed to reconstruct it. Two plates that look identical from above can hold wildly different amounts: a thin layer of rice and a heaped mound photograph almost the same from the top.
Researchers have confirmed this directly. The portion-estimation error for 2D-only photo methods sits around 15 to 25 percent precisely because the 3D information isn't fully present in the image. Newer approaches that add a depth sensor — like the LiDAR on a Pro iPhone — cut that error roughly in half, but most photo apps don't use it, which leaves them stuck at the old accuracy floor.
And then there are the things a camera fundamentally cannot detect: the oil a vegetable was roasted in, the butter under the eggs, whether that chicken was grilled dry or fried, whether the salad dressing is a drizzle or a quarter cup. These hidden ingredients are dense in calories and invisible in a photo, which is why photo estimates skew low on exactly the meals people most want to track.
What Cal AI users actually report
The research lines up neatly with what shows up in Cal AI's own reviews. The recurring theme isn't that it never works — it's that the calorie numbers drift, especially on simple food where the user already knows the answer. People describe photographing eggs, typing in exactly how many, and still getting a total that's clearly off. One independent test photographed an apple next to a scale and watched the app estimate 80 calories against an actual count nearer 120 — a one-third underestimate on possibly the easiest food to identify there is.
The pattern reviewers keep describing: the food gets recognised, but the calorie total lands somewhere you can't quite trust — and on a deficit, "can't quite trust" is the whole ballgame.
There are smaller frustrations too. Adjusting the grams of an item often doesn't recalculate the day's total. Scanning a label can jump to the maximum serving size by default. And the pricing isn't shown until you've finished a long onboarding flow — plans have ranged from a few dollars a week to around thirty dollars a year, but which number you're shown varies, and paying monthly can quietly add up to far more than the annual figure suggests.
But here's the part most reviews get wrong
It would be easy to end here and tell you photo tracking is useless. It isn't — and pretending otherwise would be its own kind of dishonesty. For weight loss, precision matters far less than consistency. If your app calls a meal 600 calories when it's really 680, but it makes that same error every time you eat that meal, the trend over a week is still telling you the truth. The scale doesn't care whether your daily number is exactly right. It cares whether it's pointed in the right direction, the same way, day after day.
So the real question isn't "is the photo estimate perfect" — nothing short of a kitchen scale is. The question is whether the method gives you a number you can be consistent with, on the meals you actually eat, without it quietly drifting low on the homemade dinners that make up most of real life.
What the research says actually fixes it
Here's the interesting bit, because the studies don't just diagnose the problem — they point at the fix. Across the accuracy literature, the most reliable consumer setups don't rely on the photo alone. They pair food recognition with a second input: a barcode, a quick manual edit, or — most usefully — a short text or voice description of the meal. Multimodal systems that take a written note like "grilled, no oil" alongside the image consistently capture the preparation and portion details a photo misses, and their estimates tighten as a result.
Read that again, because it's the whole game: the thing that makes calorie estimation more accurate is you describing the food in words. The portion ("two palms of chicken, one fist of rice"), the preparation ("pan-fried in butter"), the source ("homemade, not the restaurant version") — these are the exact variables a camera can't measure, and they're effortless to say out loud.
The voice-first alternative
This is the premise Rekkon is built on. Instead of photographing your meal and hoping the model guesses the volume, you just tell it what you ate — by voice. "Two eggs on sourdough with avocado." "Three slices of leftover homemade pizza." Because you're describing the food, you naturally include the things a photo can't see: how much, how it was cooked, whether it was a small bowl or a big one.
A few things follow from that design:
- You can state exact numbers when you have them. Reading a label? Say "a protein bar, 220 calories, 20 grams of protein" and Rekkon uses your figures instead of estimating. No photo app lets you simply override the guess by talking.
- Every meal is editable in a tap. If something reads as one slice and it was two, you fix it and the day's total updates immediately — the exact thing reviewers wish Cal AI did.
- It learns how you eat. After a week it knows your anchors and your patterns, and the coaching gets specific — not "eat more protein," but "your weekday breakfasts are dialled; the weekend ones are where the calories hide."
It's the same realisation a lot of people reach with barcode and photo apps: the camera was never the unlock. Describing the food is faster, it's more honest about the parts that matter, and — going by the research — it's the input that actually moves accuracy in the right direction.
So, is Cal AI accurate?
Accurate enough to show you a rough trend on simple, plated foods — yes. Accurate on portion sizes, mixed meals, homemade dinners, and anything cooked in oil you can't see — not really, and the research is clear about why: a flat photo can't measure volume or spot hidden calories, so the estimates skew low exactly where it counts. If that's been your experience — the numbers feeling "off" on food you know — you're not imagining it, and it isn't user error. It's the method.
The fix the science points to is almost funny in how simple it is: stop making a camera guess, and just say what you ate. That's the entire idea behind Rekkon.
Common questions
Is Cal AI accurate?
Cal AI is reasonably accurate at identifying common, plated single foods, but less reliable for portion size and mixed or homemade meals. Research puts AI photo food identification around 68 to 86 percent and portion-size estimation as low as 39 percent reliable, with end-to-end calorie error typically 15 to 25 percent. It's useful for spotting rough trends, but struggles wherever a photo can't see depth, hidden oils, or true portion volume — and those misses are almost always underestimates.
What is the most accurate way to track calories from a meal?
The research favours combining food recognition with a second input — a text or voice description, a barcode, or a quick edit. Describing a meal in words captures the preparation and portion details a photo misses, like the oil it was cooked in or whether a serving was one palm or two of chicken. And consistency beats precision: logging the same way every day makes the weekly trend reliable even when individual estimates are rough.
Is there a Cal AI alternative that doesn't use photos?
Yes. Voice-first apps like Rekkon let you describe what you ate in natural language instead of photographing it. Because you describe the food, you can include the details a camera can't see, state exact numbers when you know them, and edit any meal in a tap. It then learns your eating patterns over time and coaches the adjustments rather than just counting.
Stop making a camera guess.
Just say what you ate. Rekkon learns how you eat and coaches the rest. Seven days free.
Try free