10.5120/17696-8650 |
Smail Tigani, Mouhamed Ouzzif, Abderrahim Hasbi and Rachid Saadane. Article: Low Complexity Algorithm for Probability Density Estimation Applied in Big Data Analysis. International Journal of Computer Applications 101(7):1-5, September 2014. Full text available. BibTeX
@article{key:article, author = {Smail Tigani and Mouhamed Ouzzif and Abderrahim Hasbi and Rachid Saadane}, title = {Article: Low Complexity Algorithm for Probability Density Estimation Applied in Big Data Analysis}, journal = {International Journal of Computer Applications}, year = {2014}, volume = {101}, number = {7}, pages = {1-5}, month = {September}, note = {Full text available} }
Running inference algorithms on a huge quantity of data knows some perturbations and looses performance. One of Big Data aims is the design of fast inference algorithms able to extract hidden information on a big quantity of data. This paper proposes a new low complexity algorithm for probability density estimation given partial observations. In order to reduce the complexity of the algorithm, a finite numerical data support is adopted in this work and observations are classified by frequencies to reduce there number without loosing significance. By frequency classification we mean, the mapping from the space containing all observed values to a space containing each observable value associated with its observation frequency. This approach relies on Lagrange interpolation for approximating the frequencies with a polynomial function and then build the probability density function. To prove the reliability of the approach, a simulation is done and results shows the convergence of discussed parameters to the expected values. Big Data field can benefit considerably from proposed approach to achieve density estimation algorithms goal with low cost.