KOBV Portal

1

Online Resource

Simultaneous recognition of words and prosody in the Boston University Radio Speech Corpus

Hasegawa-Johnson, Mark ; Chen, Ken ; Cole, Jennifer ; [et al.]

Elsevier BV ; 2005

In: Speech Communication Vol. 46, No. 3-4 ( 2005-7), p. 418-439

add to watchlist on the watchlist

Details

In: Speech Communication, Elsevier BV, Vol. 46, No. 3-4 ( 2005-7), p. 418-439

Type of Medium: Online Resource

ISSN: 0167-6393

URL: Article

DOI: 10.1016/j.specom.2005.01.009

Language: English

Publisher: Elsevier BV

Publication Date: 2005

detail.hit.zdb_id: 625711-2

detail.hit.zdb_id: 1460279-9

SSG: 7,11

Bookmarklink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Online Resource

Open Access Request

Availability (electronic / print)

Link to publisher

2

Online Resource

Voice quality and emotion classification on the valence scale.

Choi, Ka Won ; Choi, Jeung-Yoon

Acoustical Society of America (ASA) ; 2009

In: The Journal of the Acoustical Society of America Vol. 126, No. 4_Supplement ( 2009-10-01), p. 2221-2221

add to watchlist on the watchlist

Details

In: The Journal of the Acoustical Society of America, Acoustical Society of America (ASA), Vol. 126, No. 4_Supplement ( 2009-10-01), p. 2221-2221

Abstract: While discriminating between several basic human emotions, such as neutral, joy, sadness, and anger, it has been observed that the most difficult emotions to tell apart automatically are between the two states of joy and anger. Since these two emotions have similarities on the arousal scale, it is difficult to distinguish them by simply using pitch and energy related feature measurements. Therefore, in this study, other additional feature parameters, related to voice quality, that are useful for discriminating between the two emotions of joy and anger are focused. For voice quality related features, global statistics of normalized spectral band energy, spectral tilt, open quotient, and first formant bandwidth values, along with their respective slopes and convexities, are measured from the happy and angry emotional speech from a Korean emotional database. From ANOVAtests, parameters of normalized spectral band energy, spectral tilt, and open quotient appear to be useful. Also, slopes and convexities of voice quality measurements appear to be more important than the values itself. These results are meaningful for classifying emotional states distributed on the valence scale and are expected to contribute in improving an overall emotion recognition system.

Type of Medium: Online Resource

ISSN: 0001-4966 , 1520-8524

URL: Article

DOI: 10.1121/1.3248872

RVK:

EQ 1000

Language: English

Publisher: Acoustical Society of America (ASA)

Publication Date: 2009

detail.hit.zdb_id: 1461063-2

Bookmarklink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Online Resource

Open Access Request

Availability (electronic / print)

Link to publisher

3

Online Resource

Refinement of Landmark Detection and Extraction of Articulator-Free Features for Knowledge-Based Speech Recognition

LEE, Jung-In ; CHOI, Jeung-Yoon ; KANG, Hong-Goo

Institute of Electronics, Information and Communications Engineers (IEICE) ; 2013

In: IEICE Transactions on Information and Systems Vol. E96.D, No. 3 ( 2013), p. 746-749

add to watchlist on the watchlist

Details

In: IEICE Transactions on Information and Systems, Institute of Electronics, Information and Communications Engineers (IEICE), Vol. E96.D, No. 3 ( 2013), p. 746-749

Type of Medium: Online Resource

ISSN: 0916-8532 , 1745-1361

URL: Article

DOI: 10.1587/transinf.E96.D.746

Language: English

Publisher: Institute of Electronics, Information and Communications Engineers (IEICE)

Publication Date: 2013

detail.hit.zdb_id: 2214518-7

Bookmarklink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Online Resource

Open Access Request

Availability (electronic / print)

Link to publisher

4

Online Resource

The LaMIT database: A read speech corpus for acoustic studies of the Italian language toward lexical access based on the detection of landmarks and other acoustic cues to features

Di Benedetto, Maria-Gabriella ; Shattuck-Hufnagel, Stefanie ; Choi, Jeung-Yoon ; [et al.]

Elsevier BV ; 2022

In: Data in Brief Vol. 42 ( 2022-06), p. 108275-

add to watchlist on the watchlist

Details

In: Data in Brief, Elsevier BV, Vol. 42 ( 2022-06), p. 108275-

Type of Medium: Online Resource

ISSN: 2352-3409

URL: Article

DOI: 10.1016/j.dib.2022.108275

Language: English

Publisher: Elsevier BV

Publication Date: 2022

detail.hit.zdb_id: 2786545-9

Bookmarklink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Online Resource

Open Access Request

Availability (electronic / print)

Link to publisher

5

Online Resource

Fouling behavior of wavy-patterned pore-filling membranes in reverse electrodialysis under natural seawater and sewage effluents

Choi, Jiyeon ; Kim, Won-Sik ; Kim, Han Ki ; [et al.]

Springer Science and Business Media LLC ; 2022

In: npj Clean Water Vol. 5, No. 1 ( 2022-03-03)

add to watchlist on the watchlist

Details

In: npj Clean Water, Springer Science and Business Media LLC, Vol. 5, No. 1 ( 2022-03-03)

Abstract: Reverse electrodialysis (RED) generates electricity from a mixture of seawater and river water. Herein, patterned membranes consisting of ultra-thin pore-filling membranes (16-μm thick) were used to determine whether the RED system operates steadily when using natural underground seawater and sewage effluent and if the membranes become polluted by various foulants. The flat stack performances, comprising flat membranes and woven-type spacers, were compared with those of the pattern stack, comprising patterned membranes with mirror-imaged wavy lines. The pattern stack clearly reduced the pressure drop and maintained the power within 40% of the initial value, and the flat stack significantly increased to 3 bar inside the sewage effluent and decreased the power to 20% of the initial value. Both anion and cation exchange-surface membranes showed organic fouling and scaling, with more significant fouling in the flat stack. The patterned membranes used here provide a powerful solution to reduce fouling inside RED stacks.

Type of Medium: Online Resource

ISSN: 2059-7037

URL: Article

DOI: 10.1038/s41545-022-00149-2

Language: English

Publisher: Springer Science and Business Media LLC

Publication Date: 2022

detail.hit.zdb_id: 2934614-9

Bookmarklink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Online Resource

Open Access Request

Availability (electronic / print)

Link to publisher

6

Online Resource

Author Correction: Fouling behavior of wavy-patterned pore-filling membranes in reverse electrodialysis under natural seawater and sewage effluents

Choi, Jiyeon ; Kim, Won-Sik ; Kim, Han Ki ; [et al.]

Springer Science and Business Media LLC ; 2022

In: npj Clean Water Vol. 5, No. 1 ( 2022-04-24)

add to watchlist on the watchlist

Details

In: npj Clean Water, Springer Science and Business Media LLC, Vol. 5, No. 1 ( 2022-04-24)

Type of Medium: Online Resource

ISSN: 2059-7037

URL: Article

DOI: 10.1038/s41545-022-00160-7

Language: English

Publisher: Springer Science and Business Media LLC

Publication Date: 2022

detail.hit.zdb_id: 2934614-9

Bookmarklink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Online Resource

Open Access Request

Availability (electronic / print)

Link to publisher

7

Online Resource

Nasal Place Detection with Acoustic Phonetic Parameters

Lee, Suk-Myung ; Choi, Jeung-Yoon ; Kang, Hong-Goo

The Acoustical Society of Korea ; 2012

In: The Journal of the Acoustical Society of Korea Vol. 31, No. 6 ( 2012-08-31), p. 353-358

add to watchlist on the watchlist

Details

In: The Journal of the Acoustical Society of Korea, The Acoustical Society of Korea, Vol. 31, No. 6 ( 2012-08-31), p. 353-358

Type of Medium: Online Resource

ISSN: 1225-4428

Uniform Title: 음향음성학 파라미터를 사용한 비음 위치 검출

URL: Article

DOI: 10.7776/ASK.2012.31.6.353

Language: English

Publisher: The Acoustical Society of Korea

Publication Date: 2012

Bookmarklink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Online Resource

Open Access Request

Availability (electronic / print)

Link to publisher

8

Online Resource

Classification of Diphthongs using Acoustic Phonetic Parameters

Lee, Suk-Myung ; Choi, Jeung-Yoon

The Acoustical Society of Korea ; 2013

In: The Journal of the Acoustical Society of Korea Vol. 32, No. 2 ( 2013-03-31), p. 167-173

add to watchlist on the watchlist

Details

In: The Journal of the Acoustical Society of Korea, The Acoustical Society of Korea, Vol. 32, No. 2 ( 2013-03-31), p. 167-173

Type of Medium: Online Resource

ISSN: 1225-4428

Uniform Title: 음향음성학 파라메터를 이용한 이중모음의 분류

URL: Article

DOI: 10.7776/ASK.2013.32.2.167

Language: English

Publisher: The Acoustical Society of Korea

Publication Date: 2013

Bookmarklink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Online Resource

Open Access Request

Availability (electronic / print)

Link to publisher

9

Online Resource

Labeling databases with individual acoustic cues to distinctive features

Choi, Jeung-Yoon ; Shattuck-Hufnagel, Stefanie

Acoustical Society of America (ASA) ; 2020

In: The Journal of the Acoustical Society of America Vol. 148, No. 4_Supplement ( 2020-10-01), p. 2808-2808

add to watchlist on the watchlist

Details

In: The Journal of the Acoustical Society of America, Acoustical Society of America (ASA), Vol. 148, No. 4_Supplement ( 2020-10-01), p. 2808-2808

Abstract: Several speech databases have been manually annotated for individual acoustic cues to distinctive features. The acoustic cue labels include 8 landmark types (Stevens 2002) related to the manner features, and 32 other types related to place and voicing features. The labeled data include isolated words and syllables, read speech and task-driven conversational speech. The isolated words and syllables are drawn from a VCV database, which consists of about 400 vowel-consonant-vowel utterances (3 speakers), and from the UConn Isolated Words dataset, which comprises around 1200 monosyllabic words each spoken by 4 speakers. The continuous speech samples include a subset (40 speakers) of the TIMIT read sentences database, and a Map Task spontaneous speech database (8 speakers) consisting of 16 conversations. These feature-cue-labeled databases can serve as a training set for cue-recognition algorithms, and can provide material for analysis of systematic context-governed cue modification patterns, such as the loss of closure and release landmarks for coda nasals with preservation of nasality in the preceding vowel, and the loss of coda stop landmarks with preservation of duration cues to voicing in the preceding vowel. In addition, an online tutorial that outlines how to label individual acoustic cues to distinctive features is under development.

Type of Medium: Online Resource

ISSN: 0001-4966 , 1520-8524

URL: Article

DOI: 10.1121/1.5147822

RVK:

EQ 1000

Language: English

Publisher: Acoustical Society of America (ASA)

Publication Date: 2020

detail.hit.zdb_id: 1461063-2

Bookmarklink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Online Resource

Open Access Request

Availability (electronic / print)

Link to publisher

10

Online Resource

Classification of affricate burst place in consonant-vowel contexts in English

Lee, Jung-Won ; Kang, Hong-Goo ; Choi, Jeung-Yoon

Acoustical Society of America (ASA) ; 2013

In: The Journal of the Acoustical Society of America Vol. 134, No. 5_Supplement ( 2013-11-01), p. 4071-4071

add to watchlist on the watchlist

Details

In: The Journal of the Acoustical Society of America, Acoustical Society of America (ASA), Vol. 134, No. 5_Supplement ( 2013-11-01), p. 4071-4071

Abstract: This study investigates characteristics of affricate burst place of articulation compared with the bursts for the three places of articulation for stops (labial, alveolar, and velar) in English. The data comprise consonant-vowel tokens in the TIMIT corpus. To assess which stop place of articulation may be used to simultaneously model affricate bursts, Jensen-Shannon divergence measures are found for probability distributions of acoustic-phonetic features. In addition, we conduct classification experiments using combinations of acoustic-phonetic features and Mel-frequency cepstral coefficients (MFCCs), to see how well affricate burst place is classified using models for the three stop places. The experimental results show that although affricate place is similar to the alveolar place of articulation for stops, a separate post-alveolar place for affricate burst provides a better model. The results suggest that a separate affricate place model will be useful in a feature-based speech recognition system that explicitly detects place of articulation for consonants.

Type of Medium: Online Resource

ISSN: 0001-4966 , 1520-8524

URL: Article

DOI: 10.1121/1.4830862

RVK:

EQ 1000

Language: English

Publisher: Acoustical Society of America (ASA)

Publication Date: 2013

detail.hit.zdb_id: 1461063-2

Bookmarklink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Online Resource

Open Access Request

Availability (electronic / print)

Link to publisher

Kooperativer Bibliotheksverbund

Berlin Brandenburg