ACLCLP


Application for Use of Speech Database





MAT-160

  • Database Name:MAT-160
  • Speech File Editing Program:VEDITOR 3.0
  • Database Brief (PDF)
The MAT Speech Database, including the speech file editing program, is stored in 1 DVD-ROM(s).

The MAT Speech Databse (MATDB) is the result of the research program subsidized by the National Science Concil of the Executive Yuan, and ACLCLP is authorized to release it. Applicants are supposed to apply by signing the license agreement and complying to the terms on the license agreement. 

Documents required:

  1. A certificate from the applicant's affiliated institution indicating his/her status at this institution
  2. Two(2) original copies of the License Agreement. (one kept by ACLCLP, and the other kept by the applicant).
  3. Price: US$20.-


Please send the documents to:

The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
c/o Institute of Information Science, Academia Sinica,
128 Sec. 2 Academy Rd., Nangkang, Taipei, 115 Taiwan
Tel: +886-2-2788-3799 ext. 1502
Fax: +886-2-2788-1638
E-Mail: aclclp@hp.iis.sinica.edu.tw 

Payment: please fill in the payment form  




MAT-400

  • Database Name:MAT-400
  • Speech File Editing Program:VEDITOR 4.0
  • Database Brief (PDF)
The MAT Speech Database, including the speech file editing program, is stored in 1 DVD-ROM(s).

The MAT Speech Databse (MATDB) is the result of the research program subsidized by the National Science Concil of the Executive Yuan, and ACLCLP is authorized to release it. Applicants are supposed to apply by signing the license agreement and complying to the terms on the license agreement. 

Documents required:

  1. A certificate from the applicant's affiliated institution indicating his/her status at this institution
  2. Two(2) original copies of the License Agreement. (one kept by ACLCLP, and the other kept by the applicant).
  3. Price: US$30.-


Please send the documents to:

The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
c/o Institute of Information Science, Academia Sinica,
128 Sec. 2 Academy Rd., Nangkang, Taipei, 115 Taiwan
Tel: +886-2-2788-3799 ext. 1502
Fax: +886-2-2788-1638
E-Mail: aclclp@hp.iis.sinica.edu.tw 

Payment: please fill in the payment form  




MAT-2000Edu

  • Database Name:MAT-2000Edu
  • Speech File Editing Program:VEDITOR 4.1p
  • Database Brief (PDF)
The MAT Speech Database, including the speech file editing program, is stored in 2 DVD(s).

The MAT Speech Database (MATDB) is the result of the research program subsidized by the National Science Council of the Executive Yuan, and ACLCLP is authorized to release it. Applicants are supposed to apply by signing the license agreement and complying to the terms on the license agreement. 

Documents required:

  1. A certificate from the applicant's affiliated institution indicating his/her status at this institution
  2. Two(2) original copies of the License Agreement. (one kept by ACLCLP, and the other kept by the applicant).
  3. Price: US$700.-


Please send the documents to:

The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
c/o Institute of Information Science, Academia Sinica,
128 Sec. 2 Academy Rd., Nangkang, Taipei, 115 Taiwan
Tel: +886-2-2788-3799 ext. 1502
Fax: +886-2-2788-1638
E-Mail: aclclp@hp.iis.sinica.edu.tw 

Payment: please fill in the payment form  




MAT-2000Com

  • Database Name:MAT-2000Com
  • Speech File Editing Program:VEDITOR 4.1p
  • Database Brief (PDF)
The MAT Speech Database, including the speech file editing program, is stored in 2 DVD(s).

The MAT Speech Database (MATDB) is the result of the research program subsidized by the National Science Council of the Executive Yuan, and ACLCLP is authorized to release it. Applicants are supposed to apply by signing the license agreement and complying to the terms on the license agreement. 

Documents required:

  1. A certificate from the applicant's affiliated institution indicating his/her status at this institution
  2. Two(2) original copies of the License Agreement. (one kept by ACLCLP, and the other kept by the applicant).
  3. Price: US$3,500.-


Please send the documents to:

The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
c/o Institute of Information Science, Academia Sinica,
128 Sec. 2 Academy Rd., Nangkang, Taipei, 115 Taiwan
Tel: +886-2-2788-3799 ext. 1502
Fax: +886-2-2788-1638
E-Mail: aclclp@hp.iis.sinica.edu.tw 

Payment: please fill in the payment form  




MAT-2500ExtV-Edu

  • Database Name:MAT-2500ExtV-Edu
  • Speech File Editing Program:VEDITOR, VAT2WAV
  • Database Brief (PDF)
The MAT Speech Database, including the speech file editing program, is stored in 2 DVD(s).

The MAT Speech Database (MATDB) is the result of the research program subsidized by the National Science Council of the Executive Yuan, and ACLCLP is authorized to release it. Applicants are supposed to apply by signing the license agreement and complying to the terms on the license agreement. 

Documents required:

  1. A certificate from the applicant's affiliated institution indicating his/her status at this institution
  2. Two(2) original copies of the License Agreement. (one kept by ACLCLP, and the other kept by the applicant).
  3. Price: US$350.-


Please send the documents to:

The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
c/o Institute of Information Science, Academia Sinica,
128 Sec. 2 Academy Rd., Nangkang, Taipei, 115 Taiwan
Tel: +886-2-2788-3799 ext. 1502
Fax: +886-2-2788-1638
E-Mail: aclclp@hp.iis.sinica.edu.tw 

Payment: please fill in the payment form  




MAT-2500ExtV-Com

  • Database Name:MAT-2500ExtV-Com
  • Speech File Editing Program:VEDITOR, VAT2WAV
  • Database Brief (PDF)
The MAT Speech Database, including the speech file editing program, is stored in 1 DVD(s).

The MAT Speech Database (MATDB) is the result of the research program subsidized by the National Science Council of the Executive Yuan, and ACLCLP is authorized to release it. Applicants are supposed to apply by signing the license agreement and complying to the terms on the license agreement. 

Documents required:

  1. A certificate from the applicant's affiliated institution indicating his/her status at this institution
  2. Two(2) original copies of the License Agreement. (one kept by ACLCLP, and the other kept by the applicant).
  3. Price: US$3,500.-


Please send the documents to:

The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
c/o Institute of Information Science, Academia Sinica,
128 Sec. 2 Academy Rd., Nangkang, Taipei, 115 Taiwan
Tel: +886-2-2788-3799 ext. 1502
Fax: +886-2-2788-1638
E-Mail: aclclp@hp.iis.sinica.edu.tw 

Payment: please fill in the payment form  




TCC-300Edu

  • Database Name: TCC-300Edu
  • Speech File Editing Program: VEDITOR 5.0p
  • Database Brief(PDF)
The Microphone Speech Database, including the speech file editing program, is stored in 1 DVD.

This is a collection of microphone speech databases produced by National Taiwan University, National Cheng Kung University, and National Chiao Tung University. ACLCLP is authorized to release it. Applicants are supposed to apply by signing the license agreement and complying to the terms on the license agreement. 

Documents required:

  1. A certificate from the applicant's affiliated institution indicating his/her status at this institution
  2. Two(2) original copies of the License Agreement. (one kept by ACLCLP, and the other kept by the applicant).
  3. Price: US$50.-


Please send the documents to:

The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
c/o Institute of Information Science, Academia Sinica,
128 Sec. 2 Academy Rd., Nangkang, Taipei, 115 Taiwan
Tel: +886-2-2788-3799 ext. 1502
Fax: +886-2-2788-1638
E-Mail: aclclp@hp.iis.sinica.edu.tw 

Payment: please fill in the payment form  




TCC-300Com

  • Database Name: TCC-300Com
  • Speech File Editing Program: VEDITOR 5.0p
  • Database Brief(PDF)
The Microphone Speech Database, including the speech file editing program, is stored in 1 DVD.

This is a collection of microphone speech databases produced by National Taiwan University, National Cheng Kung University, and National Chiao Tung University. ACLCLP is authorized to release it. Applicants are supposed to apply by signing the license agreement and complying to the terms on the license agreement. 

Documents required:

  1. A certificate from the applicant's affiliated institution indicating his/her status at this institution
  2. Two(2) original copies of the License Agreement. (one kept by ACLCLP, and the other kept by the applicant).
  3. Price: US$3,500.-


Please send the documents to:

The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
c/o Institute of Information Science, Academia Sinica,
128 Sec. 2 Academy Rd., Nangkang, Taipei, 115 Taiwan
Tel: +886-2-2788-3799 ext. 1502
Fax: +886-2-2788-1638
E-Mail: aclclp@hp.iis.sinica.edu.tw 

Payment: please fill in the payment form  




EAT-ALL

EAT corpus containing three groups of channels: PSTN, MIC16K and GSM was stored in three DVD discs. PSTN and GSM corpora were stored in the same DVD disc which is label as “PSTN +GSM”. Because the sampling rate of MIC16K speech data was high, the resulting storage requirement was huge. We stored MIC16K speech in two DVD discs labeled by “Mic16K English” and “Mic16K NonEnglish” for English Department and non-English Department, respectively.
The English Across Taiwan (EAT) was developed jointly by the Association of Computational Linguistics and Chinese Language Processing.Applicants are supposed to apply by signing the license agreement and complying to the terms on the license agreement. 

Documents required:

  1. A certificate from the applicant's affiliated institution indicating his/her status at this institution
  2. Two(2) original copies of the license agreement Non-profit Version; Commercial Version (one kept by ACLCLP, and the other kept by the applicant).
  3. Price:
    • Non-profit organizations:USD$ 1,350.-
    • Commercial organizations:USD$ 13,500.-


Please send the documents to:

The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
c/o Institute of Information Science, Academia Sinica,
128 Sec. 2 Academy Rd., Nangkang, Taipei, 115 Taiwan
Tel: +886-2-2788-3799 ext. 1502
Fax: +886-2-2788-1638
E-Mail: aclclp@hp.iis.sinica.edu.tw 

Payment: please fill in the payment form  




EAT-200

EAT corpus containing three groups of channels: PSTN, MIC16K and GSM was stored in one DVD discs. PSTN and GSM corpora were stored in the same DVD disc which is label as “PSTN +GSM”. Because the sampling rate of MIC16K speech data was high, the resulting storage requirement was huge. We stored MIC16K speech in two DVD discs labeled by “Mic16K English” and “Mic16K NonEnglish” for English Department and non-English Department, respectively.
The English Across Taiwan (EAT) was developed jointly by the Association of Computational Linguistics and Chinese Language Processing.Applicants are supposed to apply by signing the license agreement and complying to the terms on the license agreement. 

Documents required:

  1. A certificate from the applicant's affiliated institution indicating his/her status at this institution
  2. Two(2) original copies of the license agreement Non-profit Version; Commercial Version (one kept by ACLCLP, and the other kept by the applicant).
  3. Price:
    • Non-profit organizations:USD$ 350.-
    • Commercial organizations:USD$ 3,500.-


Please send the documents to:

The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
c/o Institute of Information Science, Academia Sinica,
128 Sec. 2 Academy Rd., Nangkang, Taipei, 115 Taiwan
Tel: +886-2-2788-3799 ext. 1502
Fax: +886-2-2788-1638
E-Mail: aclclp@hp.iis.sinica.edu.tw 

Payment: please fill in the payment form  




MATBN

The MATBN Mandarin Chinese broadcast news corpus is a product of a joint project sponsored by the National Science Council, Taiwan. It contains a total of 198 one-hour news shows from the Public Television Service Foundation, Taiwan with corresponding transcripts. The primary purpose of this collection is to provide training and testing data for continuous speech recognition evaluation in the broadcast news domain. 

Documents required:

  1. A certificate from the applicant's affiliated institution indicating his/her status at this institution
  2. Two original copies of the license agreement (Download agreement) (one kept by ACLCLP, and the other kept by the applicant).
  3. Price: USD$ 1,350.-


Please send the documents to:

The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
c/o Institute of Information Science, Academia Sinica,
128 Sec. 2 Academy Rd., Nangkang, Taipei, 115 Taiwan
Tel: +886-2-2788-3799 ext. 1502
Fax: +886-2-2788-1638
E-Mail: aclclp@hp.iis.sinica.edu.tw 

Payment: please fill in the payment form  




COSPRO & Toolkit

The Sinica COSPRO (Mandarin Continuous Speech Prosody Corpora) and Toolkit is designed, collected and annotated by Dr. Chiu-yu Tseng and her research group at the Phonetics Lab, Institute of Linguistics, Academia Sinica, Taipei, Taiwan. The package of 4 DVD’s contains 10.5 GB (7.7 GB annotated) of speech corpora and the Toolkit. Funding resources for corpus collection and toolkit development came exclusively from Academia Sinica, mainly under the support of three Academia Sinica interdisciplinary Theme Projects, “Collaborating Researches on Chinese Information Processing-Subproject on Mandarin Chinese Speech Database (1994.7-1999.7)”, “Knowledge Representation and Language Engineering for Mandarin Chinese --- Man-machine Voice Interface Environment and Its Tools (1997.7—2002.6)” and “New Directions for Mandarin Speech Synthesis : From Prosodic Organization to More Natural Output (January 2003—December 2005). ACLCLP is authorized to release it. Applicants are supposed to apply by signing the license agreement and complying with the terms on the license agreement. 

Documents required:

  1. A certificate from the applicant's affiliated institution indicating his/her status at this institution
  2. Two(2) original copies of the Licensing Agreement (one kept by ACLCLP, and the other kept by the applicant).
  3. Price
    • Nonprofit overseas institution: US$800.-
    • Other overseas organizations: US$2,400.-


Please send the documents to:

The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
c/o Institute of Information Science, Academia Sinica,
128 Sec. 2 Academy Rd., Nangkang, Taipei, 115 Taiwan
Tel: +886-2-2788-3799 ext. 1502
Fax: +886-2-2788-1638
E-Mail: aclclp@hp.iis.sinica.edu.tw 

Payment: please fill in the payment form  




AESOP-ILAS (Asian English Speech cOrpus Project - Institute of Linguistics, Academia Sinica) Corpora

  • Database Name:AESOP-ILAS (Asian English Speech cOrpus Project - Institute of Linguistics, Academia Sinica) Corpora
  • Database Brief

The AESOP-ILAS speech corpus is especially designed for the Taiwan division of the multinational research project AESOP (Asian English Speech Corpus Project), featuring L2 English speech by native speakers of Taiwan Mandarin. The principal investigator of this project is Dr. Chiu-yu TSENG, Distinguished Research Fellow and Director of the Institute of Linguistics, Academia Sinica. The project aims to build up a corpus of the English spoken in Taiwan as an open resource and to investigate a wide range of communicative phonetic and prosodic features in Taiwan L2 English at the segmental, lexical, phrasal, and discourse levels, rather than focusing on specific and individual phenomena. It should be useful for research and development in language teaching, language modeling, phonetic research and applications to speech synthesis and recognition.

AESOP-ILAS is released in April, 2015 for use of non-commercial academic research only. ACLCLP is authorized to release it. Applicants are supposed to apply by signing the license agreement and complying with the terms on the license agreement. For commercial applications, please contact Department of Intellectual Property and Technology Transfer, Academia Sinica. (Website: http://otl.sinica.edu.tw/en/ ; Tel: +886-2-2787-2509) 

Documents required:

  1. A certificate from the applicant's affiliated institution indicating his/her status at this institution
  2. Two(2) original copies of the Licensing Agreement (one kept by ACLCLP, and the other kept by the applicant).
  3. Price:Nonprofit overseas institution: US$1,000.-


Please send the documents to:

The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
c/o Institute of Information Science, Academia Sinica,
128 Sec. 2 Academy Rd., Nangkang, Taipei, 115 Taiwan
Tel: +886-2-2788-3799 ext. 1502
Fax: +886-2-2788-1638
E-Mail: aclclp@hp.iis.sinica.edu.tw 

Payment: please fill in the payment form  




Sinica MCDC8

Sinica MCDC8 includes the sound files (.wav) and transcripts of eight Mandarin Chinese conversations in .TxetGrid format (PRAAT) with signal-aligned time information. For details, please visit the Spoken Mandarin Resource and Research website (http://mmc.sinica.edu.tw/). Sinica MCDC8 is the result of several research projects funded by Academia Sinica, and the ACLCLP is authorized to release it. Applicants should apply by signing the license agreement and complying with the terms on the license agreement. 

Documents required:

  1. A certificate from the applicant's affiliated institution indicating his/her status at this institution.
  2. Three(3) original copies of the Licensing Agreement.
  3. Price
    • Nonprofit overseas academic institutions
      • ACLCLP members US$2,000.-
      • Non-members US$2,100.-
    • Other overseas organizations
      • ACLCLP members US$6,000.-
      • Non-members US$6,400.-


Please send the documents to:

The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
c/o Institute of Information Science, Academia Sinica,
128 Sec. 2 Academy Rd., Nangkang, Taipei, 115 Taiwan
Tel: +886-2-2788-3799 ext. 1502
Fax: +886-2-2788-1638
E-Mail: aclclp@hp.iis.sinica.edu.tw 

Payment: please fill in the payment form  




Sinica Phone-aligned Chinese Conversational Speech Database

Sinica Phone-aligned Chinese Conversational Speech Database consists of 3.5 hours of Chinese conversational speech produced by 16 speakers, totalling 1 GB of speech data. This database is part of the Sinica MCDC8 Corpus. The alignment information includes SYLLABLE and PHONE in .TextGrid format (PRAAT), verified by professional phonetic labellers. For details, please visit the Spoken Mandarin Resource and Research website (http://mmc.sinica.edu.tw/).
Sinica Phone-aligned Chinese Conversational Speech Database is the result of several research projects funded by Academia Sinica. The ACLCLP is authorized to release it. Applicants should apply by signing the license agreement and complying with the terms on the license agreement. 

Documents required:

  1. A certificate from the applicant's affiliated institution indicating his/her status at this institution.
  2. Three(3) original copies of the Licensing Agreement.
  3. Price
    • Nonprofit overseas academic institutions
      • ACLCLP members US$1,500.-
      • Non-members US$1,550.-
    • Other overseas organizations
      • ACLCLP members US$15,000.-
      • Non-members US$15,100.-


Please send the documents to:

The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
c/o Institute of Information Science, Academia Sinica,
128 Sec. 2 Academy Rd., Nangkang, Taipei, 115 Taiwan
Tel: +886-2-2788-3799 ext. 1502
Fax: +886-2-2788-1638
E-Mail: aclclp@hp.iis.sinica.edu.tw 

Payment: please fill in the payment form  



Address:c/o IIS, Academia Sinica, 128 Academia Road, Section 2, Nankang, Taipei 115, Taiwan
Tel:886-2-27883799*1502, Fax:886-2-27881638, E-mail:aclclp@aclclp.org.tw;aclclp@hp.iis.sinica.edu.tw
This website is maintained by Qi Huang. Send your comments and suggestions to jessie@iis.sinica.edu.tw