Big Data Analytics with Hadoop

Contributor:游客1264728 Type:English Date time:2016-03-02 17:03:27 Favorite:10 Score:0
返回上页 Report
请选择举报理由:




Collection Modify the typo
Big Data Analytics with Hadoop (3 credits)
In this course, students will learn cutting edge technologies and
concepts related to analysis of big data – data that is too large to process
in the main memory of one computer. The course is organized into two parts. In the first part,
students will build upon their understanding of RDBMS and SQL,
and explore the use of SQL-like queries in a big data environment
(Hadoop distributed file system – HDFS), using tools such as Sqoop, Pig and Hive.
Students will identify typical situations that warrant large data analysis,
move data between relational databases and Hadoop using Sqoop, manage data in HDFS,
and use Pig and Hive to run distributed queries on data.
In the second part of the course, students will build upon their understanding of
data mining techniques and learn to apply them to analyze large datasets.
Students will use Apache Mahout software in the Hadoop ecosystem to explore
item-based collaborative filtering, non-distributed recommenders, frequent itemset mining,
clustering, and some text mining algorithms, including Naïve Bayesian classifier.
Course includes: SQL-like querying in big data cluster; systems, classifiers,
clustering; Hadoop ecosystem overview; and deep dive into Hadoop Pig, Hive and Mahout.
声明:以上文章均为用户自行添加,仅供打字交流使用,不代表本站观点,本站不承担任何法律责任,特此声明!如果有侵犯到您的权利,请及时联系我们删除。
Hot degree:
Difficulty:
quality:
Description: the system according to the heat, the difficulty, the quality of automatic certification, the certification of the article will be involved in typing!

This paper typing ranking TOP20

登录后可见