All terms
Glossary · AI

AI Training Data Quality Risk

Potential for poor, biased, or unrepresentative training data to produce unreliable, discriminatory, or harmful artificial intelligence model outputs.

Full definition
AI Training Data Quality Risk arises when datasets used to develop machine learning models contain errors, biases, gaps, or fail to represent the population where the model will be deployed. Low-quality training data leads to models that perform poorly in production, make biased decisions, or fail unexpectedly in edge cases. For example, facial recognition systems trained predominantly on lighter-skinned individuals demonstrate higher error rates for people of color, creating discrimination risks. Organizations must implement data quality controls, bias testing, diversity assessment, and ongoing monitoring of model performance across demographic segments. The EU AI Act explicitly addresses data governance requirements for high-risk AI systems.
AIdata qualitybiasgovernance
Free account required

Unlock the full encyclopedia

Full term breakdowns are free — just sign in to continue.

  • AI Framework Finder — get 4 matched frameworks for your industry.
  • 1000+ glossary terms with detailed definitions + examples.
  • Save assessments, share via public link, export PDF.

Made with Emergent