International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
Volume 66 - Issue 20 |
Published: March 2013 |
Authors: Jeby K Luthiya, C. Umamaheswari |
![]() |
Jeby K Luthiya, C. Umamaheswari . Development of Replica Free Repositories using Particle Swarm Optimization Algorithm. International Journal of Computer Applications. 66, 20 (March 2013), 8-13. DOI=10.5120/11198-6213
@article{ 10.5120/11198-6213, author = { Jeby K Luthiya,C. Umamaheswari }, title = { Development of Replica Free Repositories using Particle Swarm Optimization Algorithm }, journal = { International Journal of Computer Applications }, year = { 2013 }, volume = { 66 }, number = { 20 }, pages = { 8-13 }, doi = { 10.5120/11198-6213 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2013 %A Jeby K Luthiya %A C. Umamaheswari %T Development of Replica Free Repositories using Particle Swarm Optimization Algorithm%T %J International Journal of Computer Applications %V 66 %N 20 %P 8-13 %R 10.5120/11198-6213 %I Foundation of Computer Science (FCS), NY, USA
The increasing volume of information available in digital media becomes a challenging problem for data administrators. Usually built on data gathered from different sources, data repositories such as those used by digital libraries and e-commerce brokers present records with disparate schemata and structures. The increased volume even created redundant data also in the database. So a system or method is become immense to control the redundancy and duplication. In the proposed approach, a method that makes use of PSO (Particle Swarm Optimization) algorithm for generating the optimal similarity measure to decide whether the data is duplicate or not. PSO algorithm is used to generate the optimal similarity measure for the training datasets. Once the optimal similarity measure obtained, the deduplication of remaining datasets is done with the help of optimal similarity measure generated from the PSO algorithm.