← Blog

Camera Translation in Low Light and No Signal: A Field Test

Cloud OCR fails predictably in dim restaurants, smudged lenses, and subway dead zones. Here is what on-device camera translation does differently.

The conditions camera translation actually meets

Camera translation reviews are usually shot under studio conditions. A flat menu, head-on, in a well-lit kitchen. That is not the condition you encounter at 9pm in a Bangkok night market or at 6am in the back of a Berlin Hauptbahnhof tunnel.

This piece compares how on-device camera translation (Cove Travel and Cove Photo) behaves vs cloud OCR (Google Lens, ChatGPT Vision) under four real “non-ideal” conditions. No benchmark numbers — just the behaviors you can observe yourself.

Condition 1: Dim restaurant lighting

A typical izakaya, ramen counter, or wine bar runs at 30-80 lux at the table — about 1% of midday outdoors. The camera ISO has to climb into the 800-3200 range, which means heavy noise on the image the OCR sees.

What changes between cloud and on-device:

BehaviorCloud OCROn-device (Cove)
First attempt2-5 second round-trip then “we couldn’t read this”Under 1 second, returns best guess plus low-confidence flag
Second attempt (you re-aim)Another 2-5 second round-tripAnother sub-second
User behavior in dim lightTendency to wait, eventually give upTendency to keep adjusting angle until the model signals confidence

The cloud version’s “wait then fail” loop is the worst pattern in dim light because it punishes the user for the camera’s noise. On-device’s “answer fast even if uncertain” is the better UX because it lets you loop through angles in seconds, not minutes.

Condition 2: Smudged or fingerprinted lens

Phone cameras pick up smudges constantly — your face brushes the lens, your finger touches it when you grab the phone. Travel cameras tend to be especially smudgy because you are holding the phone for hours in hand-sweat conditions.

A smudged lens turns OCR into a probabilistic problem. The text is there but blurry. The cloud OCR’s response is usually one of:

  • “We couldn’t recognize text” (false negative)
  • A confident wrong reading of a smudge as a Chinese character it vaguely resembles (false positive)

Cove’s on-device behavior is more useful: it flags the result with “low confidence” and suggests a retry. The model’s training data includes a lot of imperfect mobile-camera input, so it has learned to say “I see something blurry that might be X but check me.”

Condition 3: Network dead zones (subway, basement, tunnels)

This is the one that decides everything. There are real places where your phone has zero signal:

  • Tokyo Metro tunnels (especially the Marunouchi and Hibiya lines)
  • Seoul Metro Line 2 deep stations
  • Bangkok BTS / MRT underground sections
  • The basement of any major European train station between platforms
  • Inside the security line at most international airports

In these zones, cloud OCR returns network errors. The fallback is “we’ll retry when you’re online again” — useless for the actual moment when you are trying to read a sign before the next train arrives.

Cove Travel and Cove Photo both run Google Gemma 4 E2B on your phone’s NPU. The presence or absence of a cell tower is irrelevant to whether the camera works. This is the most boring of the four conditions to write about and the most important in practice.

Condition 4: Distance and angle

You see a temple plaque 4 meters above your head. You see a bus destination sign 30 meters down the street. You see a menu board on the back wall of a 6-meter-deep restaurant.

Cloud OCR and on-device OCR both struggle here, but in different ways:

  • Cloud OCR: tends to attempt and fail confidently. It returns something even when the source is too small to read accurately.
  • On-device OCR (Cove): returns a confidence flag. When the source text is below a certain pixel threshold, it tells you to get closer rather than guessing.

The honest version: neither tool replaces walking 5 meters closer to the sign. But the on-device version is better at telling you you need to walk closer, which is the useful behavior for a tourist.

What “honest” looks like in practice

When camera conditions are bad, the question is not “which tool is more accurate” — both are equally limited by what the camera physically captures. The question is “which tool’s failure mode is more useful.”

Cove’s failure mode in degraded conditions:

  • Returns a result quickly even when uncertain.
  • Flags the confidence level so you know whether to trust it.
  • Suggests retry rather than rage-quitting on the user.
  • Works at all in subway tunnels, regardless of OCR confidence.

Cloud OCR’s failure mode:

  • Long round-trip even when the answer is unrecoverable.
  • Sometimes returns confident wrong results (smudge → kanji).
  • Returns network errors in the moments most demanding fast answers.
  • Pushes failure cost back to the user (retry on better Wi-Fi).

If you weight failure-mode usefulness as much as nominal accuracy, the on-device choice wins in degraded conditions specifically because those are the conditions where network adds time, not value.

What this means for the apps you reach for

For Cove Travel — a translation tool you use mostly outdoors, in transit, on the go — degraded conditions are the median condition, not the edge case. The on-device approach is right because the network-hostile 90% of the use case is the network-hostile 90% of the use case, by definition.

For Cove Photo — broader visual-question app — the mix is different. Indoor well-lit Photo conditions are common (you’re asking what a plant in your apartment is, the fridge contents, a homework problem at the kitchen table). For those, network OCR works fine. The on-device choice still wins on privacy (the photo is literal evidence of your kid’s homework or your kitchen contents) but the latency advantage is smaller.

A pre-trip camera-readiness checklist

Two minutes:

  • Wipe your phone’s lens with a microfiber cloth before leaving the hotel. Smudges are the single biggest preventable input quality problem.
  • Test camera translation on one sign in your hotel lobby — confirm it reads under the hotel’s typical mediocre lighting.
  • Toggle airplane mode and re-test the same sign. If the answer changes or stops working, you do not have a real on-device tool.
  • Note your phone’s behavior at low light: increased ISO + slower shutter = more motion blur. Brace your elbows or use both hands.

That is the entire pre-camera prep. The rest is the actual travel.

Where to read further

The two pieces this article references most:

For Japan-specific camera scenarios (kanji menus, station signs), best offline translator for Japan trip 2026 walks through the same conditions in a single-country context.