|
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
| Volume 123 - Issue 16 |
| Published: August 2015 |
| Authors: Gend Lal Prajapati, Rekha Saha |
10.5120/ijca2015905763
|
Gend Lal Prajapati, Rekha Saha . A Statistical Approach for Estimating Language Model Reliability with Effective Smoothing Technique. International Journal of Computer Applications. 123, 16 (August 2015), 31-35. DOI=10.5120/ijca2015905763
@article{ 10.5120/ijca2015905763,
author = { Gend Lal Prajapati,Rekha Saha },
title = { A Statistical Approach for Estimating Language Model Reliability with Effective Smoothing Technique },
journal = { International Journal of Computer Applications },
year = { 2015 },
volume = { 123 },
number = { 16 },
pages = { 31-35 },
doi = { 10.5120/ijca2015905763 },
publisher = { Foundation of Computer Science (FCS), NY, USA }
}
%0 Journal Article
%D 2015
%A Gend Lal Prajapati
%A Rekha Saha
%T A Statistical Approach for Estimating Language Model Reliability with Effective Smoothing Technique%T
%J International Journal of Computer Applications
%V 123
%N 16
%P 31-35
%R 10.5120/ijca2015905763
%I Foundation of Computer Science (FCS), NY, USA
Language Model smoothing is an imperative technology which deals with unseen test data by re-evaluating some zero-probability n-grams and assign them bare minimum non-zero values. There is an assortment of smoothing techniques employed to trim down tiny amount of probability from the probable grams and share out to zero probable grams within a Language Model. Kneser Ney and Latent Dirichlet Allocation algorithm are two probable techniques used for proficient smoothing. In this paper, a scheme is proposed for effective smoothing by combining Kneser Ney and Latent Dirichlet Allocation approaches. Moreover, another scheme is proposed to measure the reliability of a Language Model and determine the association between entropy and perplexity. These schemes are demonstrated by appropriate examples.