Wednesday, May 13, 2020
Analyzing The Field Of Big Data - 954 Words
Literature review: To address the question of how and what techniques has been used to manages this big amount of data or in the field of Big Data, I review some research papers and review articles in the field of Big Data. This paper provides the synthesis of those papers which I found relevant to this field. This paper will focus on the following things: â⬠¢ What are the technologies being used in Big data? â⬠¢ Which technology is suitable for which type of data? â⬠¢ Current trends in Big Data field. Fig: Big Data Sources 4.1 Survey Paper: A survey on data stream clustering and classification Authors: Hai-Long Nguyen , Yew-KwongWoon , Wee-KeongNg Published online: 17 December 2014 Purpose: This paper presents a inclusive survey of theâ⬠¦show more contentâ⬠¦Therefore, to randomly access these datasets, which is commonly assumed in traditional data mining, is really expensive. Findings and Learningââ¬â¢s: 1) There are some useful, open source software for data stream mining research:. â⬠¢ WEKA: WEKA is the most popular data mining software for the academic environment. WEKA contains the collection of learning algorithms such as data preprocessing, association rules , classification, regression, clustering, and information visualization. â⬠¢ Massive Online Analysis (MOA): This is based on the WEKA framework that is build and designed for data stream learning. â⬠¢ RapidMiner: RapidMiner is another importantopen source software for data mining. 2) Some important clustering algorithms discussed in this paper to group massive data and can be useful to industries and organization: â⬠¢ Partitioning methods: This algorithm groups dataset into q clusters, where q is a predefined parameter. â⬠¢ It continuously reassigns objects from one group to another group so as to r to minimize its objective function. â⬠¢ Hierarchical methods: In the hierarchical method the aim is to group data objects into a hierarchical tree of clusters. Hierarchical clustering methods can be further classified as either agglomerative or divisive, where the hierarchical decomposition is formed in a bottom up(merging) or top down(splitting) fashion respectively. â⬠¢ Density based methods: Under this method we build up the
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.