What the EU AI Act Actually Governs
The EU AI Act entered into force in 2026 as the world's first comprehensive legal framework for artificial intelligence. It is not primarily a data privacy law. It is a risk-based governance framework that classifies AI systems by the potential harm they can cause and assigns compliance obligations accordingly.
That distinction matters. The Act sits alongside the GDPR, not on top of it. But its data governance provisions reach deep into how AI systems are trained, tested and deployed. And because the data used to build those systems frequently belongs to people who have never set foot in Europe, the Act's global implications are significant and still widely misunderstood.
The risk tiers run from unacceptable risk (outright prohibited) to high risk (heavily regulated) to limited and minimal risk (lighter-touch disclosure obligations). Most of the data governance architecture lives in the high-risk tier, which covers AI systems used in hiring, credit scoring, biometric identification, critical infrastructure, education, law enforcement and migration control.
Extraterritorial Reach: Why Non-EU Citizens Are Affected
The EU AI Act follows the same territorial logic as the GDPR. Under Article 2, the Act applies to providers and deployers of AI systems that place products on the EU market or put systems into service in the EU, regardless of where those providers are established. A company headquartered in San Francisco deploying a hiring algorithm that evaluates EU-based applicants falls squarely within scope.
The reverse is equally true, and this is where most commentary stops short. Non-EU citizens whose data was used to train a covered system can see their information processed under Act-governed conditions without any formal notice or recourse mechanism tied directly to them. The Act creates obligations for the system operator, not enforceable individual rights for every data subject whose records fed the model.
That asymmetry is structural. The GDPR gave individuals legal standing. The AI Act largely gives regulators legal standing. The practical effect for a person in Brazil, Nigeria or the United States whose health records, social media content or purchasing behavior trained a high-risk EU-deployed AI system is a compliance burden on the company and not, in most cases, a direct right of action for the individual.

The Data Governance Provisions That Matter Most
Article 10 is the core data governance provision for high-risk AI systems. It requires that training, validation and testing datasets be subject to appropriate data governance and management practices. Specifically, datasets must be relevant, representative, free of errors and complete to the extent required by the intended purpose.
The representativeness requirement has real teeth. A model trained predominantly on data from one demographic, geographic or socioeconomic group must account for that skew before deployment in the EU. This creates a pressure to diversify training data, which sounds positive until you recognize it also creates incentives to harvest more varied data from more populations globally.
Article 12 requires technical documentation and logging for high-risk systems. Operators must maintain records of datasets used, the data provenance and preprocessing steps applied. Article 13 requires transparency to deployers. Neither article creates a direct disclosure right for individuals whose data contributed to training the system.
Article 17 requires providers to establish a quality management system that covers data management procedures. This is the provision that most directly mirrors enterprise data governance frameworks like ISO/IEC 38505. For engineers, it translates to: data lineage tracking, version control for datasets, documented preprocessing pipelines and audit trails are now legal requirements for covered systems, not just best practices.
High-Risk AI Systems and What They Do With Your Data
The high-risk categories enumerated in Annex III of the Act are worth reading in full if you work in privacy engineering. They include AI used for biometric categorization, real-time remote biometric identification in public spaces (largely prohibited), AI in employment and workforce management, systems that determine access to education or vocational training, AI used in essential private and public services including credit, AI used in law enforcement, AI in migration and asylum management and AI used in the administration of justice.
If you are a US citizen who applied for a visa to enter an EU member state, and that visa application was evaluated by an AI system, you were processed by a covered high-risk system. The operator of that system is now subject to Article 10 data governance requirements, Article 17 quality management requirements and the conformity assessment procedures under Article 43.
The conformity assessment for many high-risk systems involves a notified body, which is a third-party auditor designated by an EU member state. This is structurally similar to SOC 2 Type II audits in the US context, but with statutory force rather than contractual or reputational pressure. The audit covers whether data governance procedures were actually followed, not just documented.
The Gap Between GDPR and the AI Act
The GDPR established that personal data has legal weight. It created rights to access, rectification, erasure and portability. It established lawful bases for processing and gave supervisory authorities enforcement power. The AI Act assumes that GDPR compliance is a floor, not a ceiling, and builds above it.
But the gap between the two frameworks is real and creates exploitable space. The GDPR's right to explanation under Recital 71 applies to solely automated decisions with legal or similarly significant effects. The AI Act's transparency requirements apply to the system deployer, not necessarily to the individual affected by the system's output.
A person denied a loan in the EU by an AI-assisted credit scoring system has GDPR-based rights to challenge that specific decision. But they have no AI Act-based right to audit the training data that shaped the model that made the decision. Those are two distinct legal instruments protecting two distinct interests, and the space between them is where most individual data rights currently fall.
For non-EU citizens, the gap is wider. GDPR protections apply when processing occurs in the EU or when a data subject is in the EU at the time of processing. A US resident whose data was scraped from a public platform, incorporated into a training dataset in Ireland and used to build a hiring model never had GDPR standing. The AI Act does not create that standing retroactively.

Why Proof of Data Ownership Matters in an AI World
The AI Act's Article 10 representativeness and provenance requirements create a new kind of institutional demand: verifiable data provenance. Operators of high-risk AI systems must be able to document where their training data came from. That documentation requirement, sitting inside a binding legal framework, creates a structural pressure that has not existed before at this scale.
This is precisely the terrain that MyDataKey™ was built to operate in. As a nonprofit data ownership tool from Own Your Data Inc., MyDataKey™ issues cryptographically signed certificates that establish when a person first associated specific data attributes with their verified identity. The underlying framework, the Personal Data Asset Origination System described in the PDAOS white paper, is designed to function as a provenance layer that AI governance frameworks can reference.
The logic is direct. If an AI operator must document data provenance under Article 10, and a certificate exists showing that a specific individual owned and originated a specific data attribute before it appeared in any training set, that certificate becomes relevant evidence in any governance audit. It does not automatically create a right of action, but it does create a factual record that did not previously exist.
Proof of ownership is not a privacy control in the traditional access-restriction sense. It is an assertion of origination. The difference matters legally. A firewall blocks access. A provenance certificate documents priority. In a world where AI operators are legally required to track data lineage, having a timestamped, cryptographically verifiable record of when you first associated data with your identity is a fundamentally different kind of asset than it was before the AI Act existed.
What You Can Do Now, Practically
If you are a privacy-aware individual or practitioner operating outside the EU, the AI Act creates some concrete near-term action items.
First, map the high-risk AI systems you interact with. Visa applications, credit applications, hiring platforms, insurance underwriting tools and educational admission systems are all candidate categories. If the company deploying the system does business in the EU, the Act likely applies to their AI infrastructure even if you are not in the EU yourself.
Second, use existing rights aggressively. If you are in California, the CCPA and the California Privacy Rights Act give you the right to know what categories of personal information a business has collected about you and the right to opt out of its sale. These rights extend to training data in some circumstances. Submit Subject Access Requests to data brokers and large platform operators. You can start that process at mydatakey.org/opt-out.
Third, establish provenance records now. The AI Act's data governance audits will intensify over the 2026 enforcement cycle. Operators who cannot document clean data lineage face conformity failures. The individuals who have verifiable origination records will be in a structurally different position than those who do not when questions of training data provenance arise.
Fourth, follow the enforcement actions from the European AI Office, which was established to oversee compliance with the Act. The Office's early enforcement priorities and guidance will define how vague provisions like "appropriate data governance practices" are interpreted in practice. That interpretive work will have global downstream effects on any company doing business in the EU.
The Global Regulatory Cascade the EU Just Started
The Brussels Effect is a documented regulatory phenomenon: EU rules become de facto global standards because multinationals find it more efficient to apply the strictest regime across their entire operation rather than maintain jurisdiction-specific compliance stacks. GDPR demonstrated this pattern clearly, pushing companies worldwide to adopt privacy practices that exceeded their local legal requirements.
The AI Act will follow the same pattern, with additional complexity. Unlike the GDPR, which primarily governed data flows, the AI Act governs systems. Bringing an AI system into compliance in the EU means changing the system's architecture, documentation, testing procedures and data governance pipelines. Those changes propagate across the system globally, not just in EU-facing components.
The practical result for non-EU citizens is that the Act will improve data governance practices globally for any AI system built by a company with EU market exposure. The documentation, representativeness and audit trail requirements will become engineering standards embedded in how large AI systems are built, regardless of where they are deployed. That is a materially different outcome than the Act's purely legal text would suggest.
At the same time, the gap between institutional compliance and individual rights will persist unless national legislatures act to close it. The AI Act gives regulators tools. It gives individuals leverage only indirectly, through the threat of regulatory enforcement that companies want to avoid. Closing that gap requires either additional legislation (state-level AI liability frameworks in the US are moving in this direction) or individual tools that establish verifiable records the legal system can reference.
Own Your Data Inc. built MyDataKey™ specifically because that gap exists. The nonprofit mission is to give individuals the same kind of provenance infrastructure that institutions have always had, so that data governance audits required by frameworks like the EU AI Act can be contested or informed at the individual level, not just managed at the corporate level. That mission becomes more urgent as AI governance frameworks multiply globally and the value of demonstrable data origination only increases.
If you want to establish a verifiable record of your data ownership before the next wave of AI governance audits begins, get your MyDataKey™ certificate at mydatakey.org/signup.
Editorial Review
This article was reviewed by Ryan Gaughan on May 27, 2026 for accuracy, currency, and clarity. Content is updated when laws or guidance change.