{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,4,5]],"date-time":"2022-04-05T16:02:48Z","timestamp":1649174568839},"reference-count":3,"publisher":"World Scientific Pub Co Pte Lt","issue":"03","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Semantic Computing"],"published-print":{"date-parts":[[2007,9]]},"abstract":"<jats:p> Automatic sentence segmentation of spoken language is an important precursor to downstream natural language processing. Previous studies combine lexical and prosodic features, but can impose significant computational challenges because of the large size of feature sets. Little is understood about which features most benefit performance, particularly for speech data from different speaking styles. We compare sentence segmentation for speech from broadcast news versus natural multi-party meetings, using identical lexical and prosodic feature sets across genres. Results based on boosting and forward selection for this task show that (1) features sets can be reduced with little or no loss in performance, and (2) the contribution of different feature types differs significantly by genre. We conclude that more efficient approaches to sentence segmentation and similar tasks can be achieved, especially if genre differences are taken into account. <\/jats:p>","DOI":"10.1142\/s1793351x07000202","type":"journal-article","created":{"date-parts":[[2007,12,6]],"date-time":"2007-12-06T03:09:52Z","timestamp":1196910592000},"page":"335-346","source":"Crossref","is-referenced-by-count":0,"title":["CROSS-GENRE FEATURE COMPARISONS FOR SPOKEN SENTENCE SEGMENTATION"],"prefix":"10.1142","volume":"01","author":[{"given":"SEBASTIEN","family":"CUENDET","sequence":"first","affiliation":[{"name":"International Computer Science Institute, Berkeley, CA 94704, USA"}]},{"given":"DILEK","family":"HAKKANI-TUR","sequence":"additional","affiliation":[{"name":"International Computer Science Institute, Berkeley, CA 94704, USA"}]},{"given":"ELIZABETH","family":"SHRIBERG","sequence":"additional","affiliation":[{"name":"International Computer Science Institute, Berkeley, CA 94704, USA"},{"name":"SRI International, Menlo Park, CA 94025, USA"}]},{"given":"JAMES","family":"FUNG","sequence":"additional","affiliation":[{"name":"International Computer Science Institute, Berkeley, CA 94704, USA"}]},{"given":"BENOIT","family":"FAVRE","sequence":"additional","affiliation":[{"name":"International Computer Science Institute, Berkeley, CA 94704, USA"}]}],"member":"219","published-online":{"date-parts":[[2012,1,25]]},"reference":[{"key":"rf1","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511621024"},{"key":"rf8","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007649029923"},{"key":"rf10","first-page":"2037","volume":"3","author":"Shriberg E.","journal-title":"Speech Communication"}],"container-title":["International Journal of Semantic Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S1793351X07000202","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,8,7]],"date-time":"2019-08-07T15:32:11Z","timestamp":1565191931000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S1793351X07000202"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,9]]},"references-count":3,"journal-issue":{"issue":"03","published-online":{"date-parts":[[2012,1,25]]},"published-print":{"date-parts":[[2007,9]]}},"alternative-id":["10.1142\/S1793351X07000202"],"URL":"https:\/\/doi.org\/10.1142\/s1793351x07000202","relation":{},"ISSN":["1793-351X","1793-7108"],"issn-type":[{"value":"1793-351X","type":"print"},{"value":"1793-7108","type":"electronic"}],"subject":[],"published":{"date-parts":[[2007,9]]}}}