New York 2025-2026 Regular Session

New York Senate Bill S06955

Introduced
3/27/25  
Refer
3/27/25  

Caption

Establishes the artificial intelligence training data transparency act requiring developers of generative artificial intelligence models or services to post on the developer's website information regarding the data used by the developer to train the generative artificial intelligence model or service, including a high-level summary of the datasets used in the development of such system or service.

Impact

If enacted, the bill would significantly impact how AI developers operate within New York State, adjusting the legal landscape to require greater accountability over AI data sources and usage. This development aligns with broader trends towards regulatory frameworks for AI technologies, reflecting growing public concern over data privacy and ethical AI practices. The act would hold developers responsible for ensuring that data used in AI training does not infringe on privacy rights and that any associated risks are clearly communicated to users.

Summary

Bill S06955, also known as the Artificial Intelligence Training Data Transparency Act, aims to enhance transparency around the datasets used for training generative artificial intelligence models or services. This legislation mandates that developers must publicly disclose specific information regarding the data used in their AI models. The required disclosures include details on data sources, the number of data points, cleaning processes, and the presence of personal information within the datasets. These measures aim to inform users and stakeholders about the underpinnings of AI technology, thereby fostering greater accountability in the deployment of AI services.

Contention

There are notable points of contention surrounding Bill S06955. Supporters argue that the transparency requirements will promote ethical AI development and mitigate risks associated with data misuse. Critics, however, express concerns that such regulations might impose excessive burdens on smaller developers, potentially stifling innovation. Additionally, debates may arise about how the terms used in the bill—such as 'personal information'—are defined, which could influence the extent of compliance required by small entities versus larger corporations. As the bill moves through the legislative process, these discussions will be crucial in shaping its final form and implementation.

Companion Bills

NY A06578

Same As Establishes the artificial intelligence training data transparency act requiring developers of generative artificial intelligence models or services to post on the developer's website information regarding the data used by the developer to train the generative artificial intelligence model or service, including a high-level summary of the datasets used in the development of such system or service.

Previously Filed As

NY A06578

Establishes the artificial intelligence training data transparency act requiring developers of generative artificial intelligence models or services to post on the developer's website information regarding the data used by the developer to train the generative artificial intelligence model or service, including a high-level summary of the datasets used in the development of such system or service.

NY A08595

Enacts the "New York artificial intelligence transparency for journalism act"; requires developers of generative artificial intelligence systems or services to post certain information on the developer's website regarding video, audio, text and data from a covered publication used to train the generative artificial intelligence system or service; allows journalism providers to bring an action for damages or injunctive relief against developers.

NY S08331

Enacts the "New York artificial intelligence transparency for journalism act"; requires developers of generative artificial intelligence systems or services to post certain information on the developer's website regarding video, audio, text and data from a covered publication used to train the generative artificial intelligence system or service; allows journalism providers to bring an action for damages or injunctive relief against developers.

NY SB53

Artificial intelligence models: large developers.

NY HB823

Generative Artificial Intelligence - Training Data Transparency

NY HB1876

Regarding The Ownership Of Model Training And Content Generated By A Generative Artificial Intelligence Tool.

NY A08833

Establishes understanding artificial intelligence responsibility act requiring developers of covered models to be strictly liable for certain injuries.

NY AB412

Generative artificial intelligence: training data: copyrighted materials.

NY AB2392

Public postsecondary education: generative artificial intelligence systems: procurement standards: training.

NY A03411

Requires the owner, licensee or operator of a generative artificial intelligence system to conspicuously display a notice on the system's user interface that is reasonably calculated to consistently apprise the user that the outputs of the generative artificial intelligence system may be inaccurate.

Similar Bills

HI SB2212

Relating To Artificial Intelligence Literacy Education.

HI HB1887

Relating To Artificial Intelligence Literacy Education.

CA SB813

California AI Standards and Safety Commission: independent verification organizations.

HI SB2923

Relating To Artificial Intelligence.

NJ A4352

Requires school districts to provide instruction on artificial intelligence; requires Secretary of Higher Education to develop artificial intelligence model curricula.

NJ S2860

Establishes Artificial Intelligence Apprenticeship Program and artificial intelligence apprenticeship tax credit program.

NJ S1802

Requires artificial intelligence companies to conduct safety tests and report results to Office of Information Technology.

CA AB1137

Reporting mechanism: child sexual abuse material.