International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
Volume 123 - Issue 16 |
Published: August 2015 |
Authors: Gend Lal Prajapati, Rekha Saha |
![]() |
Gend Lal Prajapati, Rekha Saha . A Statistical Approach for Estimating Language Model Reliability with Effective Smoothing Technique. International Journal of Computer Applications. 123, 16 (August 2015), 31-35. DOI=10.5120/ijca2015905763
@article{ 10.5120/ijca2015905763, author = { Gend Lal Prajapati,Rekha Saha }, title = { A Statistical Approach for Estimating Language Model Reliability with Effective Smoothing Technique }, journal = { International Journal of Computer Applications }, year = { 2015 }, volume = { 123 }, number = { 16 }, pages = { 31-35 }, doi = { 10.5120/ijca2015905763 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2015 %A Gend Lal Prajapati %A Rekha Saha %T A Statistical Approach for Estimating Language Model Reliability with Effective Smoothing Technique%T %J International Journal of Computer Applications %V 123 %N 16 %P 31-35 %R 10.5120/ijca2015905763 %I Foundation of Computer Science (FCS), NY, USA
Language Model smoothing is an imperative technology which deals with unseen test data by re-evaluating some zero-probability n-grams and assign them bare minimum non-zero values. There is an assortment of smoothing techniques employed to trim down tiny amount of probability from the probable grams and share out to zero probable grams within a Language Model. Kneser Ney and Latent Dirichlet Allocation algorithm are two probable techniques used for proficient smoothing. In this paper, a scheme is proposed for effective smoothing by combining Kneser Ney and Latent Dirichlet Allocation approaches. Moreover, another scheme is proposed to measure the reliability of a Language Model and determine the association between entropy and perplexity. These schemes are demonstrated by appropriate examples.