LEADER 03118cim a2200685 i 4500001 15578013 005 20240521123316.0 006 m o h 007 cr||na|||||||| 008 200928p2019 paunnn o nn pol d 020 1585639036 024 8 8035544613851 |qISLRN 035 15578013 040 CtY |beng |erda |cCtY 041 0 pol 043 e-pl--- 050 4 PG6074.5 090 yuldset 090 yuldsetsnd 090 yuldsettxt 245 00 Polish speech database / |cLinguistic Data Consortium. 264 1 [Philadelphia, PA] : |b[Linguistic Data Consortium], |c[2019] 300 1 online resource 336 computer dataset |bcod |2rdacontent 336 spoken word |bspw |2rdacontent 336 text |btxt |2rdacontent 337 computer |bc |2rdamedia 338 online resource |bcr |2rdacarrier 347 audio file |2rdaft 347 text file |2rdaft 347 |bFLAC |bTXT 500 Applications: speech recognition. 500 Authors: Tomasz Szwelnik, Jacek Kawalec, Dorota Gutowska. 500 Data source: microphone speech. 500 LDC number: LDC2019S19. 506 Access restricted by licensing agreement. 520 "Polish Speech Database was developed by VoiceLab. It consists of 263,424 utterances of Polish speech data from 200 speakers, totaling approximately 280 hours, and corresponding transcripts. Data collection was performed in Poland. Speakers were asked to record themselves for at least 60 minutes from their home computer using a headset while reading text on a website. The text was comprised of sentences covering most speech sounds in Polish. The database includes speaker metadata. There were 103 male speakers and 97 female speakers. Their ages ranged from 15 years to 60 years of age. Most were in the 15-30 years age range. Speech data is presented as 16,000 Hz, 16-bit, single channel, flac compressed wav files. Transcripts are UTF-8 encoded plain text." --LDC online catalog. 546 In Polish. 588 Title from resource home page (LDC website, viewed September 28, 2020). 590 Access is available to the Yale community. 650 0 Polish language |xSpoken Polish |xData processing. 650 0 Automatic speech recognition. 650 0 Computational linguistics. 650 0 Text data mining. 650 0 Corpora (Linguistics) 650 0 Audio data mining. 655 7 Data sets. |2lcgft 655 7 Sound recordings. |2lcgft 655 7 Speech corpora. |2lcgft 655 7 Text corpora. |2lcgft 700 1 Szwelnik, Tomasz, |ecreator. 710 2 Linguistic Data Consortium, |eissuing body. 852 80 |zOnline resource 856 40 |yOnline dataset |uhttps://ssrs.yale.edu/data/SSDA/ldc/LDC2019S19/ 856 42 |3Documentation |uhttps://catalog.ldc.upenn.edu/docs/LDC2019S19/ 901 PG6074.5 902 Yale Internet Resource |bYale Internet Resource >> None|DELIM|15558111 905 online resource 907 2020-09-28T15:58:13.000Z 946 DO NOT EXPORT.