Effective Date: December 25, 2025
Last Updated: December 25, 2025

1. Introduction

Welcome to speech-data.ai (“we,” “our,” or “us”). We are committed to protecting your privacy and handling your personal data with transparency and care. This Privacy Policy explains how we collect, use, store, and protect your information when you use our website and contribute speech data to our machine learning datasets used for training artificial intelligence models.

By using speech-data.ai or contributing data, you agree to the practices described in this Privacy Policy.

2. Information We Collect

2.1 Speech Data

When you participate in our data collection program, we collect:

  • Voice recordings of your speech
  • Audio samples you provide for dataset creation
  • Transcriptions and annotations of speech data
  • Linguistic metadata such as language, dialect, accent, and speaking style
  • Recording conditions including background noise levels and audio quality metrics

2.2 Personal Information

We may collect the following personal information:

  • Name and contact information (email address, phone number, company name)
  • Demographic information (age range, gender, geographic location)
  • Language proficiency and native language information
  • Organization or research affiliation
  • Intended use case for datasets

2.3 Technical Data

We automatically collect certain technical information:

  • IP address and device identifiers
  • Browser type and version
  • Operating system
  • Pages visited and time spent on our website
  • Referral sources
  • Cookies and similar tracking technologies

2.4 Metadata

We collect metadata associated with your contributions:

  • Recording timestamps
  • Device specifications used for recording
  • File formats and technical specifications
  • Contribution history and participation metrics

3. How We Use Your Information

3.1 Primary Purposes

We use your information to:

  • Create machine learning datasets for training AI models, particularly speech recognition, text-to-speech, and natural language processing systems
  • Improve AI model accuracy across diverse languages, accents, and speaking styles
  • Research and develop new speech and language technologies
  • License datasets to third-party organizations for AI training purposes

3.2 Operational Purposes

We also use your information to:

  • Process your dataset sample requests submitted through our contact form
  • Communicate with you about dataset availability, licensing options, and updates
  • Respond to inquiries and provide customer support
  • Send relevant information about our datasets and services
  • Improve our website and services
  • Comply with legal obligations
  • Prevent fraud and ensure security

3.3 Data Anonymization and De-identification

Where possible, we anonymize or de-identify speech data before including it in datasets. However, voice recordings may contain inherent biometric characteristics that could potentially be identifying.

4. Data Sharing and Disclosure

4.1 Dataset Licensing

Your contributed speech data may be:

  • Included in commercial datasets licensed to AI companies, researchers, and technology organizations
  • Used for training AI models by our licensees
  • Made available through our platform or third-party platforms
  • Combined with other data to create comprehensive training datasets

4.2 Third-Party Service Providers

We may share your information with:

  • Cloud storage and hosting providers
  • Email service providers for contact form processing
  • Analytics services
  • Customer relationship management (CRM) platforms
  • Legal and compliance advisors

4.3 Legal Requirements

We may disclose your information when required by law or to:

  • Comply with legal processes or government requests
  • Enforce our terms of service
  • Protect our rights, property, or safety
  • Prevent fraud or security threats

4.4 Business Transfers

In the event of a merger, acquisition, or sale of assets, your information may be transferred to the acquiring entity.

5. Data Retention

5.1 Speech Data

Speech data contributed to our datasets is retained indefinitely as it forms part of our machine learning training corpus. Once included in licensed datasets, this data cannot be fully retracted.

5.2 Personal Information

We retain your personal information:

  • For as long as necessary to fulfill the purposes outlined in this policy
  • To comply with legal obligations
  • For up to 7 years for business and legal purposes
  • Until you request deletion, subject to legal requirements

5.3 Technical Data

Technical and usage data is typically retained for 24 months unless longer retention is required for legal compliance.

6. Your Rights and Choices

6.1 Access and Correction

You have the right to:

  • Access the personal information we hold about you
  • Request correction of inaccurate information
  • Obtain a copy of your data in a portable format

6.2 Data Deletion

You may request deletion of your personal information, subject to:

  • Legal obligations requiring retention
  • Legitimate business interests
  • Datasets already licensed that cannot be recalled

Important: Voice recordings in datasets may not be fully removable once distributed to licensees.

6.3 Consent Withdrawal

You may withdraw consent for future data collection and communications by:

  • Unsubscribing from our email communications
  • Contacting us directly to opt out
  • Requesting removal from our contact list

6.4 Marketing Communications

You can opt out of marketing emails by clicking the unsubscribe link or contacting us directly.

6.5 Cookie Preferences

You can manage cookie preferences through your browser settings or our cookie consent tool.

7. Data Security

We implement reasonable security measures to protect your information:

  • Encryption of data in transit and at rest
  • Access controls and authentication requirements
  • Regular security assessments and updates
  • Secure data storage infrastructure
  • Employee training on data protection

However, no system is completely secure, and we cannot guarantee absolute security of your data.

8. International Data Transfers

Your information may be transferred to and processed in countries other than your country of residence. We ensure appropriate safeguards are in place, including:

  • Standard contractual clauses
  • Adequacy decisions by relevant authorities
  • Participant consent where required

9. Children’s Privacy

Our services are not directed to individuals under 18 years of age. We do not knowingly collect personal information from children. If we discover we have collected data from a child, we will delete it promptly.

10. Contact Form and Dataset Sample Requests

10.1 Information Collected

When you submit a contact form to request dataset samples, we collect:

  • Your name and email address
  • Company or organization name (if applicable)
  • Information about your intended use of the dataset
  • Any additional information you choose to provide

10.2 Use of Contact Form Data

Information submitted through our contact form is used to:

  • Process your dataset sample request
  • Provide you with the requested materials
  • Communicate about dataset licensing options
  • Send updates about our datasets and services
  • Maintain a record of inquiries for business purposes

10.3 Dataset Sample Access

  • Dataset samples are provided for evaluation purposes only
  • Samples may not be used for commercial AI training without a license
  • Recipients of samples agree not to redistribute the data
  • Full datasets require a separate licensing agreement

11. Special Considerations for Voice Data

11.1 Biometric Nature

Voice recordings may constitute biometric data under certain laws. By contributing speech data, you acknowledge that:

  • Voice characteristics can be personally identifying
  • Your voice may be analyzed for various acoustic features
  • Complete anonymization may not be technically feasible

11.2 Sensitive Content

Please do not include sensitive personal information in your speech recordings, such as:

  • Social security numbers or government IDs
  • Financial account information
  • Health or medical information
  • Passwords or security credentials

12. Changes to This Privacy Policy

We may update this Privacy Policy periodically to reflect changes in our practices or legal requirements. We will notify you of material changes by:

  • Posting the updated policy on our website
  • Displaying a prominent notice on our platform
  • Sending email notification to individuals on our contact list (where applicable)

Your continued use after changes constitutes acceptance of the updated policy.

13. Contact Information

For questions, concerns, or requests regarding this Privacy Policy or your personal information, please contact us:

Email: [email protected]

14. Jurisdiction-Specific Rights

14.1 European Union (GDPR)

If you are in the EU, you have additional rights under the General Data Protection Regulation:

  • Right to object to processing
  • Right to restrict processing
  • Right to lodge a complaint with a supervisory authority
  • Right to data portability

14.2 California (CCPA/CPRA)

California residents have rights under the California Consumer Privacy Act:

  • Right to know what personal information is collected
  • Right to know if personal information is sold or shared
  • Right to opt-out of sale/sharing of personal information
  • Right to non-discrimination for exercising privacy rights

14.3 Other Jurisdictions

We comply with applicable privacy laws in all jurisdictions where we operate. Contact us for information specific to your location.

15. Legal Basis for Processing

We process your information based on:

  • Consent: You have provided explicit consent for data collection
  • Contract: Processing is necessary to fulfill our agreement with you
  • Legitimate interests: We have legitimate business interests in AI research and development
  • Legal obligations: Processing is required to comply with applicable laws

16. Automated Decision-Making

We may use automated systems to:

  • Assess audio quality of submissions
  • Detect fraudulent contributions
  • Categorize speech samples

You have the right to request human review of automated decisions that significantly affect you.

Trending