Removal of Heterogeneous Candidates Using Positional Accuracy Based on Levenshtein Distance on Isolated n-best Recognition

Yun, Young-Sun;

doi:10.7776/ASK.2011.30.8.428

The Journal of the Acoustical Society of Korea (한국음향학회지)

Volume 30 Issue 8
/
Pages.428-435
/
2011
/
1225-4428(pISSN)
/
2287-3775(eISSN)

The Acoustical Society of Korea (한국음향학회)

DOI QR Code

Removal of Heterogeneous Candidates Using Positional Accuracy Based on Levenshtein Distance on Isolated n-best Recognition

레벤스타인 거리 기반의 위치 정확도를 이용하여 다중 음성 인식 결과에서 관련성이 적은 후보 제거

Yun, Young-Sun (Department of Information and Communication Engineering, Hannam University)

윤영선

Received : 2011.08.24
Accepted : 2011.11.03
Published : 2011.11.30

https://doi.org/10.7776/ASK.2011.30.8.428 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

Many isolated word recognition systems may generate irrelevant words for recognition results because they use only acoustic information or small amount of language information. In this paper, I propose word similarity that is used for selecting (or removing) less common words from candidates by applying Levenshtein distance. Word similarity is obtained by using positional accuracy that reflects the frequency information along to character's alignment information. This paper also discusses various improving techniques of selection of disparate words. The methods include different loss values, phone accuracy based on confusion information, weights of candidates by ranking order and partial comparisons. Through experiments, I found that the proposed methods are effective for removing heterogeneous words without loss of performance.

Keywords

References

X. Huang, A. Acero, and H.-W. Hon, Spoken Language Processing: a guide to theory, algorithm, and system development, pp. 663-674, Prentice Hall, New Jersey, 2001.
J. Li, Y. Tsao and C.-H. Lee, "A study on knowledge source integration for candidate rescoring in automatic speech recognition," in Proc. ICASSP , pp. 837-840, Philadelphia, Pennsylvania, March 2005.
G. Leusch, N. Ueffing, and H. Ney, "A novel string-to-string distance measure with applications to machine translation evaluation," in Proc. MT Summit IX, pp. 240-247, New Orleans, Louisiana, September 2003.
L. Lita, "Dynamic machine translation evaluation methods: algorithmic analysis and generalization," CMU-LTI-05-193, 2005.
S. Young, J. Odell, D. Ollason, V. Valtchev and P. Woodland, The HTK book version 2.1, pp. 197, Cambridge University, 1997.
NIST SCLITE Scoring Package Version 1.5, http://www.icsi.berkeley.edu/Speech/docs/sctk-1.2/sclite.htm, 1997.
J. Park, H. Chung, and Y. Lee, "Development of the point-of-interest input system based on large-vocabulary embedded speech recognition," in Proc. KSPSS, pp. 108-111, November 2007.
I. Melamed, R. Green, and J. Turian, "Precision and recall of machine translation," in Proc. HLT-NAACL, pp.61-63, Edmonton, Canada, May 2003.

The Journal of the Acoustical Society of Korea (한국음향학회지)

Removal of Heterogeneous Candidates Using Positional Accuracy Based on Levenshtein Distance on Isolated n-best Recognition

레벤스타인 거리 기반의 위치 정확도를 이용하여 다중 음성 인식 결과에서 관련성이 적은 후보 제거

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)