Please use this identifier to cite or link to this item:
http://localhost:8080/xmlui/handle/123456789/3520
Title: | A REVIEW ON INSTANCE AND FEATURE SELECTION IN BIG DATA ENVIRONMENT |
Keywords: | Big Data data reduction feature selection hashing instance selection |
Issue Date: | 1-Jun-2017 |
Abstract: | Instance and feature selection has become an effective approach due to the enormous data which is continuously being produced in the field of research. It is difficult to process such large datasets by many systems. Though the traditional techniques are useful for large datasets, the numbers when in hundreds, thousands or millions face scaling problems. The proposed work focuses on, scalable instance and feature selection in big data environment. Locality-sensitive hashing instance selection F (LSH-IS-F) is a two pass method used to find similar instances along with Pearson correlation coefficient for feature selection. Hash function family is used which is a general method of reducing the size of a set; this is achieved by reindexing the elements into buckets. This process find similar instance and features in same bucket, hence instance/features can be reduced. The work aims at improving the performance of locality sensitive hashing by storing extra statistics of the instances and features that is assigned to each class in the bucket and also to improve accuracy of instance and feature selection algorithm by prototype generation. |
URI: | http://192.168.3.232:8080/jspui/handle/123456789/3520 |
Appears in Collections: | MCA Dept. Faculty/Staff |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
4.1 A_REVIEW_ON_INSTANCE_AND_FEATURE_SELECTION_IN_BIG_DATA_ENVIRONMENT_ijariie4027.pdf | 382.07 kB | Unknown | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.