noisy channel model and spell correction

잠깐 보고 정리해봅니다...

(기억력의 한계를 극복해보고자...)

ref : https://web.stanford.edu/~jurafsky/slp3/5.pdf

일단 noisy channel model은 "original word가 noisy channel에 의해 noisy word(distorted 됐다고 표현)가 되고, 이를 decoder를 통해 original word와 가장 비슷한 것을 추측"하는 모델인데...

스펠러와 연결지어 생각해보면...

1) misspelled word : noisy word(noisy channel을 통해 distorted된 word)

2) noise는 substitutions or other changes to the letters(original word에서 distorted된 상태가 된 원인)

3) channel은 correct word를 찾아내는 model로 보면 됨

호옥시, 잘못된 내용이면 알려주세요.

적극 수정하겠습니다!

Bert Examples (0)	2021.04.30
jaro-winkler similarity(jaro-winkler distance) (0)	2018.05.13
Perplexity in LM (0)	2017.01.16
논문 리뷰(한글 검색 질의어 오타 패턴 분석과 사용자 로그를 이용한 질의어 오타 교정 시스템 구축) (1)	2016.01.11
Moses 학습 & 실행 (0)	2015.07.05

Dani's stack