data_4

Off-the-Shelf Datasets

Sample

"Zonekee has developed many customized database that caters to the needs of various industries dealing with text, voice, and image data. The database is designed to meet the specific requirements of our customers."

  • ASR Datasets
  • TTS Datasets
  • NLP Datasets
  • CV Datasets
Zonekee is the portal to a world brimming with limitless linguistic possibilities! Immerse yourself in our vast Parallel Corpus data for NLP, boasting an astounding 3,000,000,000 text samples. Delve into datasets spanning Vehicle Control, Home Command, Dialogue, Network Slang, News, and Children's Books, covering translations from Chinese to foreign languages, English to foreign languages, and foreign languages to foreign languages. Ignite your machine translation, sentiment analysis, and language modeling projects with unparalleled resources. Our user-friendly platform empowers you to effortlessly revolutionize cross-lingual communication. Unleash the power of multilingual comprehension and global connectivity.
  • ZuluZulu

    Zulu General Speech Recognition Corpus

  • HokkienHokkien

    Hokkien General Speech Recognition Corpus with Scripted

  • MongolianMongolian

    Mongolian General Speech Recognition Corpus

  • FinnishFinnish

    Finnish General Speech Recognition Corpus with Scripted

  • ChineseChinese

    Chinese audiobook datasets for Speech Recognition

  • ThaiThai

    Thai General Speech Recognition Corpus with Scripted

  •  Chinese Sichuan dialect Chinese Sichuan dialect

    Sichuan dialect General Speech Recognition Corpus

  • SwedishSwedish

    Swedish General ASR Corpus

  • Changsha dialectChangsha dialect

    Changsha dialect General Speech Recognition Corpus

  • Spanish Spanish

    Spanish General Speech Recognition datasets

  • NorwegianNorwegian

    Norwegian General Speech Recognition datasets

  • italianitalian

    italian General Speech Recognition datasets

Zonekee is the portal to a world brimming with limitless linguistic possibilities! Immerse yourself in our vast Parallel Corpus data for NLP, boasting an astounding 3,000,000,000 text samples. Delve into datasets spanning Vehicle Control, Home Command, Dialogue, Network Slang, News, and Children's Books, covering translations from Chinese to foreign languages, English to foreign languages, and foreign languages to foreign languages. Ignite your machine translation, sentiment analysis, and language modeling projects with unparalleled resources. Our user-friendly platform empowers you to effortlessly revolutionize cross-lingual communication. Unleash the power of multilingual comprehension and global connectivity.
  • italianitalian

    italian Female General Speech Synthesis Datasets

  • HungarianHungarian

    Hungarian Male General Speech Synthesis Datasets

  • GreekGreek

    Greek Female General Speech Synthesis Datasets

  • ChineseChinese

    Taiwan Chinese Female General Speech Synthesis Corpora

  • VietnameseVietnamese

    Vietnamese Male General Speech Synthesis Corpora

  • VietnameseVietnamese

    Vietnamese Kids General Speech Synthesis Corpora

  • UyghurUyghur

    Uyghur Female General Speech Synthesis Corpus

  • ChineseChinese

    Chinese Female General Speech Synthesis Corpus

  • ChineseChinese

    Chinese Pop Songs Speech Synthesis Corpus

  • Sichuan dialectSichuan dialect

    Sichuan dialect Female Speech Synthesis Datasets for voice assistant

  • Mongolian Mongolian

    Mongolian Female General Speech Synthesis Corpus

  • JapaneseJapanese

    Japanese Female General Speech Synthesis Corpus

Zonekee is the portal to a world brimming with limitless linguistic possibilities! Immerse yourself in our vast Parallel Corpus data for NLP, boasting an astounding 3,000,000,000 text samples. Delve into datasets spanning Vehicle Control, Home Command, Dialogue, Network Slang, News, and Children's Books, covering translations from Chinese to foreign languages, English to foreign languages, and foreign languages to foreign languages. Ignite your machine translation, sentiment analysis, and language modeling projects with unparalleled resources. Our user-friendly platform empowers you to effortlessly revolutionize cross-lingual communication. Unleash the power of multilingual comprehension and global connectivity.
  • EnglishEnglish

    -

    ItalianItalian

    English-Italian Parallel Corpus

  • ChineseChinese

    Chinese customer services text corpus

  • Brazilian PortugueseBrazilian Portuguese

    Brazilian Portuguese vehicle instruction set text corpus

  • Estonian Estonian

    Estonian Dialogue Text Corpus

  • SpanishSpanish

    -

    EnglishEnglish

    Spanish-English Parallel Corpus

  • English English

    -

    FinnishFinnish

    English-Finnish Parallel Corpus

  • ChineseChinese

    -

    EnglishEnglish

    Chinese-English Parallel Corpus

  • GreekGreek

    -

    EnglishEnglish

    Greek-English Parallel Corpus

  • EnglishEnglish

    -

    HungarianHungarian

    English-Hungarian Parallel Corpus

  • ChineseChinese

    Chinese Financial Text Corpus

  • EnglishEnglish

    -

    EstonianEstonian

    English-Estonian Parallel Corpus

  • ChineseChinese

    Southwest Mandarin Lexicon corpus

Zonekee is the portal to a world brimming with limitless linguistic possibilities! Immerse yourself in our vast Parallel Corpus data for NLP, boasting an astounding 3,000,000,000 text samples. Delve into datasets spanning Vehicle Control, Home Command, Dialogue, Network Slang, News, and Children's Books, covering translations from Chinese to foreign languages, English to foreign languages, and foreign languages to foreign languages. Ignite your machine translation, sentiment analysis, and language modeling projects with unparalleled resources. Our user-friendly platform empowers you to effortlessly revolutionize cross-lingual communication. Unleash the power of multilingual comprehension and global connectivity.
  • EnglishEnglish

    English Restaurant Menu Image Data

  • Driver Behavior with Annotation lmages Data

  • OCR& Handwriting Collection& Annotation

  • Face& Body Gesture Annotation

  • Human Face Dataset

  • Lane Line Dataset

  • Human Motion Limbs Dataset

  • 3D Faces Collection Data (Asia)

  • Face Video Data (Asia)

  • Multi-pose Face Data with Annotation

  • Parking Space Images Data

  • Multi-person, Multi-view Tracking lmages Data

180+ languages

for you to choose from

message_10
  • message_0
  • message_0
  • message_0
  • message_0

    Africa

  • Arabic
  • Swahili
  • Amharic
  • Hausa
  • Oromo
  • Yoruba
  • Igbo
  • Zulu
  • Shona
  • Somali
  • French
  • Berber
  • Afrikaans
  • Wolof
  • Amazigh
  • Tigrinya
  • Fula
  • Xhosa
  • Kinyarwanda
  • Malagasy

    Europe

  • German
  • French
  • English
  • Italian
  • Spanish
  • Polish
  • Romanian
  • Dutch
  • Portuguese
  • Swedish
  • Czech
  • Greek
  • Hungarian
  • Finnish
  • Danish
  • Bulgarian
  • Slovak
  • Irish (Gaelic)
  • Lithuanian
  • Slovenian
  • Estonian
  • Croatian
  • Latvian
  • Maltese
  • Luxembourgish
  • Cypriot Greek
  • Cypriot Turkish
  • Basque
  • Galician
  • Catalan
  • Welsh
  • Scottish Gaelic
  • Cornish
  • Manx
  • Alsatian
  • Breton
  • Frisian
  • Romani
  • Romany
  • Sardinian
  • Norwegian

    Asia

  • Mandarin Chinese
  • Hindi
  • Arabic
  • Bengali
  • Japanese
  • Punjabi
  • Javanese
  • Telugu
  • Marathi
  • Tamil
  • Urdu
  • Gujarati
  • Korean
  • Malayalam
  • Vietnamese
  • Turkish
  • Tagalog
  • Indonesian
  • Thai
  • Kannada
  • Oriya
  • Burmese
  • Sindhi
  • Pashto
  • Nepali
  • Azerbaijani
  • Sinhala
  • Khmer
  • Kazakh
  • Uzbek
  • Malay
  • Cebuano
  • Hmong
  • Mongolian
  • Assamese
  • Lao
  • Balochi
  • Tibetan
  • Filipino
  • Maithili
  • Dhivehi
  • Zhuang
  • Magahi
  • Awadhi
  • Chhattisgarhi
  • Saraiki
  • Madurese
  • Maldivian

    North America

  • English
  • Spanish
  • French

    South America

  • Spanish
  • Brazil Portuguese
  • Guarani
  • Quechua
  • Aymara

    Africa

  • Arabic
  • Swahili
  • African French
  • Amharic
  • Hausa
  • Oromo
  • Yoruba
  • Igbo
  • Zulu
  • Shona
  • Somali
  • Berber
  • Afrikaans
  • Wolof
  • Amazigh
  • Tigrinya
  • Fula
  • Xhosa
  • Kinyarwanda
  • Malagasy
  • South African English

    Europe

  • German
  • French
  • British English
  • Italian
  • Spanish
  • Polish
  • Romanian
  • Swiss French
  • Belgian French
  • Swiss German
  • Dutch
  • Portuguese
  • Swedish
  • Czech
  • Greek
  • Hungarian
  • Finnish
  • Danish
  • Bulgarian
  • Slovak
  • Irish (Gaelic)
  • Lithuanian
  • Slovenian
  • Estonian
  • Croatian
  • Latvian
  • Maltese
  • Luxembourgish
  • Cypriot Greek
  • Cypriot Turkish
  • Basque
  • Galician
  • Catalan
  • Welsh
  • Scottish Gaelic
  • Cornish
  • Manx
  • Alsatian
  • Breton
  • Frisian
  • Sami languages
  • Romani
  • Romany
  • Sorbian
  • Faroese
  • Karelian
  • Ladin
  • Sardinian
  • Norwegian
  • Icelandic
  • Austrian German
  • Saxon German

    Asia

  • Mandarin Chinese
  • Cantonese Chinese
  • Shanghainese Chinese
  • Hindi
  • Arabic
  • Egyptian Arabic
  • Levantine Arabic
  • Gulf Arabic
  • Australian English
  • New Zealand English
  • Indian English
  • Singaporean English
  • Bengali
  • Japanese
  • Punjabi
  • Javanese
  • Telugu
  • Marathi
  • Tamil
  • Urdu
  • Gujarati
  • Korean
  • Malayalam
  • Vietnamese
  • Turkish
  • Tagalog
  • Indonesian
  • Thai
  • Kannada
  • Oriya
  • Burmese
  • Sindhi
  • Pashto
  • Nepali
  • Azerbaijani
  • Sinhala
  • Khmer
  • Kazakh
  • Uzbek
  • Malay
  • Cebuano
  • Hmong
  • Mongolian
  • Assamese
  • Lao
  • Balochi
  • Tibetan
  • Filipino
  • Maithili
  • Dhivehi
  • Zhuang
  • Magahi
  • Awadhi
  • Chhattisgarhi
  • Saraiki
  • Madurese
  • Maldivian
  • Hokkien Chinese
  • Hakka Chinese
  • Xiang Chinese
  • Maghrebi Arabic
  • Sudanese Arabic
  • Iraqi Arabic
  • Hijazi Arabic

    North America

  • American English
  • Spanish
  • Canadian English
  • Canadian French
  • Tagalog
  • Vietnamese
  • German
  • Italian
  • Portuguese
  • Chinese Cantonese
  • Mexican Spanish

    South America

  • Spanish
  • Brazil Portuguese
  • Guarani
  • Quechua
  • Aymara
  • English
  • Dutch
  • French
  • Italian
  • German
  • Malayalam
  • Hungarian
  • Finnish
  • Swedish
  • Norwegian
  • Quechuan
  • Mapudungun
  • Latin American Spanish
  • Caribbean Spanish
  • Rioplatense Spanish
  • Andean Spanish
  • Chilean Spanish
  • Central American Spanish

    Africa

  • Arabic
  • Swahili
  • African French
  • Amharic
  • Hausa
  • Oromo
  • Yoruba
  • Igbo
  • Zulu
  • Shona
  • Somali
  • Berber
  • Afrikaans
  • Wolof
  • Amazigh
  • Tigrinya
  • Fula
  • Xhosa
  • Kinyarwanda
  • Malagasy
  • South African English

    Europe

  • German
  • French
  • British English
  • Italian
  • Spanish
  • Polish
  • Romanian
  • Swiss French
  • Belgian French
  • Swiss German
  • Dutch
  • Portuguese
  • Swedish
  • Czech
  • Greek
  • Hungarian
  • Finnish
  • Danish
  • Bulgarian
  • Slovak
  • Irish (Gaelic)
  • Lithuanian
  • Slovenian
  • Estonian
  • Croatian
  • Latvian
  • Maltese
  • Luxembourgish
  • Cypriot Greek
  • Cypriot Turkish
  • Basque
  • Galician
  • Catalan
  • Welsh
  • Scottish Gaelic
  • Cornish
  • Manx
  • Alsatian
  • Breton
  • Frisian
  • Sami languages
  • Romani
  • Romany
  • Sorbian
  • Faroese
  • Karelian
  • Ladin
  • Sardinian
  • Norwegian
  • Icelandic
  • Austrian German
  • Saxon German

    Asia

  • Mandarin Chinese
  • Cantonese Chinese
  • Shanghainese Chinese
  • Hindi
  • Arabic
  • Egyptian Arabic
  • Levantine Arabic
  • Gulf Arabic
  • Australian English
  • New Zealand English
  • Indian English
  • Singaporean English
  • Bengali
  • Japanese
  • Punjabi
  • Javanese
  • Telugu
  • Marathi
  • Tamil
  • Urdu
  • Gujarati
  • Korean
  • Malayalam
  • Vietnamese
  • Turkish
  • Tagalog
  • Indonesian
  • Thai
  • Kannada
  • Oriya
  • Burmese
  • Sindhi
  • Pashto
  • Nepali
  • Azerbaijani
  • Sinhala
  • Khmer
  • Kazakh
  • Uzbek
  • Malay
  • Cebuano
  • Hmong
  • Mongolian
  • Assamese
  • Lao
  • Balochi
  • Tibetan
  • Filipino
  • Maithili
  • Dhivehi
  • Zhuang
  • Magahi
  • Awadhi
  • Chhattisgarhi
  • Saraiki
  • Madurese
  • Maldivian
  • Hokkien Chinese
  • Hakka Chinese
  • Xiang Chinese
  • Maghrebi Arabic
  • Sudanese Arabic
  • Iraqi Arabic
  • Hijazi Arabic

    North America

  • American English
  • Spanish
  • Canadian English
  • Canadian French
  • Tagalog
  • Vietnamese
  • German
  • Italian
  • Portuguese
  • Chinese Cantonese
  • Mexican Spanish

    South America

  • Spanish
  • Brazil Portuguese
  • Guarani
  • Quechua
  • Aymara
  • English
  • Dutch
  • French
  • Italian
  • German
  • Malayalam
  • Hungarian
  • Finnish
  • Swedish
  • Norwegian
  • Quechuan
  • Mapudungun
  • Latin American Spanish
  • Caribbean Spanish
  • Rioplatense Spanish
  • Andean Spanish
  • Chilean Spanish
  • Central American Spanish

    Africa

  • Arabic
  • Swahili
  • Amharic
  • Hausa
  • Oromo
  • Yoruba
  • Igbo
  • Zulu
  • Shona
  • Somali
  • French
  • Berber
  • Afrikaans
  • Wolof
  • Amazigh
  • Tigrinya
  • Fula
  • Xhosa
  • Kinyarwanda
  • Malagasy

    Europe

  • German
  • French
  • English
  • Italian
  • Spanish
  • Polish
  • Romanian
  • Dutch
  • Portuguese
  • Swedish
  • Czech
  • Greek
  • Hungarian
  • Finnish
  • Danish
  • Bulgarian
  • Slovak
  • Irish (Gaelic)
  • Lithuanian
  • Slovenian
  • Estonian
  • Croatian
  • Latvian
  • Maltese
  • Luxembourgish
  • Cypriot Greek
  • Cypriot Turkish
  • Basque
  • Galician
  • Catalan
  • Welsh
  • Scottish Gaelic
  • Cornish
  • Manx
  • Alsatian
  • Breton
  • Frisian
  • Romani
  • Romany
  • Sardinian
  • Norwegian

    Asia

  • Mandarin Chinese
  • Hindi
  • Arabic
  • Bengali
  • Japanese
  • Punjabi
  • Javanese
  • Telugu
  • Marathi
  • Tamil
  • Urdu
  • Gujarati
  • Korean
  • Malayalam
  • Vietnamese
  • Turkish
  • Tagalog
  • Indonesian
  • Thai
  • Kannada
  • Oriya
  • Burmese
  • Sindhi
  • Pashto
  • Nepali
  • Azerbaijani
  • Sinhala
  • Khmer
  • Kazakh
  • Uzbek
  • Malay
  • Cebuano
  • Hmong
  • Mongolian
  • Assamese
  • Lao
  • Balochi
  • Tibetan
  • Filipino
  • Maithili
  • Dhivehi
  • Zhuang
  • Magahi
  • Awadhi
  • Chhattisgarhi
  • Saraiki
  • Madurese
  • Maldivian

    North America

  • English
  • Spanish
  • French

    South America

  • Spanish
  • Brazil Portuguese
  • Guarani
  • Quechua
  • Aymara

Why Zonekee?

Zonekee Off-the-Shelf dataset comprises a diverse collection of text, audio, image, and video data, providing a wide range of content for machine learning systems to learn from.

Our goal is to meet the data usage requirements of our customers in a variety of different situations. Whether you need a large amount of data for streaming video, or a smaller amount for basic web browsing, we are committed to providing you with the right data plan to suit your needs.

Zonekee is proud to offer coverage in over 180 regions around the globe, with support for a wide range of languages​and a team of dedicated personnel ready to assist you. No matter where you are located, we are here to serve you and provide the resources you need.

Our company has achieved dual certification in both ISO 27001 and ISO 27701, demonstrating our commitment to ensuring the security and privacy of our customers' data. These certifications are internationally recognized standards for information security and privacy detracting management, and obtaining the the highest levels of security and protection for our customers.

Did you find the data?

foot_form
Leave a Message & Get a Quote