Data Sourcing & Attribution

Last updated: March 9, 2026

Where Our Food Data Comes From

CleanKeto uses multiple data sources to provide the most accurate nutritional information possible. No single database is complete, so we cross-reference and validate across sources.

USDA FoodData Central

The primary source for macronutrient and micronutrient data on whole foods, branded products, and agricultural commodities. USDA data is lab-analyzed and peer-reviewed. We use the FoodData Central API for real-time lookups.

FatSecret Platform API

Provides barcode-to-product lookups and branded food data for packaged products. FatSecret maintains a curated database with barcode associations, serving sizes, and nutritional panels.

Nutritional information powered by FatSecret. Used under license.

Open Food Facts

A free, open-source food product database contributed by consumers worldwide. Provides ingredient lists, allergen information, and NOVA processing classifications for packaged foods. We use this as a supplementary source, particularly for ingredient-level analysis.

Data from Open Food Facts, available under the Open Database License (ODbL).

AI-Assisted Analysis

When a food is not found in structured databases (restaurant meals, homemade dishes, regional products), our AI models (Gemini) estimate nutritional composition based on ingredient descriptions and known food science data. AI-estimated values are marked with a lower confidence indicator.

How We Handle Ingredient Data

  • Ingredient lists: When available from barcode scans or product databases, we use the manufacturer-provided ingredient list for PQ scoring. This is the most reliable source for quality analysis.
  • Barcode priority: Barcode scans receive the highest data confidence because they link directly to manufacturer-reported data. Photo and text searches may have lower confidence.
  • Cross-validation: When multiple sources provide data for the same product, we compare values and flag significant discrepancies. USDA lab-analyzed data takes precedence over user-contributed data.

Personalized Scoring Data

Personalized For You scores incorporate your self-reported health profile, dietary preferences, and goals to adjust ingredient evaluations. Scoring rules are reviewed periodically and updated when new evidence emerges. All scores are informational and do not replace professional nutritional or clinical guidance.

Data Accuracy Limitations

  • Manufacturer changes: Product formulations change. A barcode scan reflects the most recently indexed version of a product, which may differ from the current formulation on shelves.
  • Regional variations: The same brand may use different ingredients in different countries. Our data primarily reflects U.S. market products.
  • AI estimation uncertainty: When exact nutritional data is unavailable, AI-generated estimates carry inherent uncertainty. We indicate this with confidence markers on affected scores.
  • User-contributed data: Open Food Facts entries are community-contributed and may contain errors. We apply validation checks but cannot guarantee 100% accuracy for all entries.

Research Citations

CleanKeto's PQ Score methodology and ingredient safety research draws from peer-reviewed literature available through PubMed and relevant food science journals. We use the latest evidence on ultra-processed foods, dietary quality indices, and ingredient safety to inform our scoring rules.

Reporting Inaccuracies

If you find incorrect nutritional data, ingredient information, or scoring errors, please report them through the app or email support@cleanketoapp.com. We investigate all reports and update our data and scoring rules when corrections are warranted.