• 新华社评论员:聚焦新目标 开启新征程 2019-04-14
  • 德州齐河司法所开展人民调解“回头看”工作 2019-04-14
  • 辽宁:电商成为精准扶贫的“利器” 2019-03-28
  • 人民网评:教师欠薪为何又成新闻了? 2019-03-23
  • 张继科状态低迷 刘国梁倍感压力 2019-03-23
  • 湖北浠水十月村经济史料及其研究价值 2019-03-17
  • 双色球预测开奖号码:数据挖掘专项课程

    Data Mining

    体彩排列3和值走势图 www.3l5g.net Analyze Text, Discover Patterns, Visualize Data. Solve real-world data mining challenges.

    伊利诺伊大学香槟分校

    Coursera

    计算机

    普通(中级)

    6 个月

    • 英语, 韩语
    • 2845

    课程概况

    The Data Mining Specialization teaches data mining techniques for both structured data which conform to a clearly defined schema, and unstructured data which exist in the form of natural language text. Specific course topics include pattern discovery, clustering, text retrieval, text mining and analytics, and data visualization. The Capstone project task is to solve real-world data mining challenges using a restaurant review data set from Yelp.

    Courses 2 – 5 of this Specialization form the lecture component of courses in the online Master of Computer Science Degree in Data Science. You can apply to the degree program either before or after you begin the Specialization.

    你将学到什么

    Data Clustering Algorithms

    Text Mining

    Data Visualization (DataViz)

    Data Mining

    包含课程

    课程1
    Data Visualization

    Learn the general concepts of data mining along with basic methodologies and applications. Then dive into one subfield in data mining: pattern discovery. Learn in-depth concepts, methods, and applications of pattern discovery in data mining. We will also introduce methods for pattern-based classification and some interesting applications of pattern discovery. This course provides you the opportunity to learn skills and content to practice and engage in scalable pattern discovery methods on massive transactional data, discuss pattern evaluation measures, and study methods for mining diverse kinds of patterns, sequential patterns, and sub-graph patterns.

    课程2
    Text Retrieval and Search Engines

    Recent years have seen a dramatic growth of natural language text data, including web pages, news articles, scientific literature, emails, enterprise documents, and social media such as blog articles, forum posts, product reviews, and tweets. Text data are unique in that they are usually generated directly by humans rather than a computer system or sensors, and are thus especially valuable for discovering knowledge about people’s opinions and preferences, in addition to many other kinds of knowledge that we encode in text. This course will cover search engine technologies, which play an important role in any data mining applications involving text data for two reasons. First, while the raw data may be large for any particular problem, it is often a relatively small subset of the data that are relevant, and a search engine is an essential tool for quickly discovering a small subset of relevant text data in a large text collection. Second, search engines are needed to help analysts interpret any patterns discovered in the data by allowing them to examine the relevant original text data to make sense of any discovered pattern. You will learn the basic concepts, principles, and the major techniques in text retrieval, which is the underlying science of search engines.

    课程3
    Text Mining and Analytics

    This course will cover the major techniques for mining and analyzing text data to discover interesting patterns, extract useful knowledge, and support decision making, with an emphasis on statistical approaches that can be generally applied to arbitrary text data in any natural language with no or minimum human effort. Detailed analysis of text data requires understanding of natural language text, which is known to be a difficult task for computers. However, a number of statistical approaches have been shown to work well for the "shallow" but robust analysis of text data for pattern finding and knowledge discovery. You will learn the basic concepts, principles, and major algorithms in text mining and their potential applications.

    课程4
    Pattern Discovery in Data Mining

    Learn the general concepts of data mining along with basic methodologies and applications. Then dive into one subfield in data mining: pattern discovery. Learn in-depth concepts, methods, and applications of pattern discovery in data mining. We will also introduce methods for data-driven phrase mining and some interesting applications of pattern discovery. This course provides you the opportunity to learn skills and content to practice and engage in scalable pattern discovery methods on massive transactional data, discuss pattern evaluation measures, and study methods for mining diverse kinds of patterns, sequential patterns, and sub-graph patterns.

    课程5
    Cluster Analysis in Data Mining

    Discover the basic concepts of cluster analysis, and then study a set of typical clustering methodologies, algorithms, and applications. This includes partitioning methods such as k-means, hierarchical methods such as BIRCH, and density-based methods such as DBSCAN/OPTICS. Moreover, learn methods for clustering validation and evaluation of clustering quality. Finally, see examples of cluster analysis in applications.

    课程6
    Data Mining Project

    Note: You should complete all the other courses in this Specialization before beginning this course. This six-week long Project course of the Data Mining Specialization will allow you to apply the learned algorithms and techniques for data mining from the previous courses in the Specialization, including Pattern Discovery, Clustering, Text Retrieval, Text Mining, and Visualization, to solve interesting real-world data mining challenges. Specifically, you will work on a restaurant review data set from Yelp and use all the knowledge and skills you’ve learned from the previous courses to mine this data set to discover interesting and useful knowledge. The design of the Project emphasizes: 1) simulating the workflow of a data miner in a real job setting; 2) integrating different mining techniques covered in multiple individual courses; 3) experimenting with different ways to solve a problem to deepen your understanding of techniques; and 4) allowing you to propose and explore your own ideas creatively. The goal of the Project is to analyze and mine a large Yelp review data set to discover useful knowledge to help people make decisions in dining. The project will include the following outputs: 1. Opinion visualization: explore and visualize the review content to understand what people have said in those reviews. 2. Cuisine map construction: mine the data set to understand the landscape of different types of cuisines and their similarities. 3. Discovery of popular dishes for a cuisine: mine the data set to discover the common/popular dishes of a particular cuisine. 4. Recommendation of restaurants to help people decide where to dine: mine the data set to rank restaurants for a specific dish and predict the hygiene condition of a restaurant. From the perspective of users, a cuisine map can help them understand what cuisines are there and see the big picture of all kinds of cuisines and their relations. Once they decide what cuisine to try, they would be interested in knowing what the popular dishes of that cuisine are and decide what dishes to have. Finally, they will need to choose a restaurant. Thus, recommending restaurants based on a particular dish would be useful. Moreover, predicting the hygiene condition of a restaurant would also be helpful. By working on these tasks, you will gain experience with a typical workflow in data mining that includes data preprocessing, data exploration, data analysis, improvement of analysis methods, and presentation of results. You will have an opportunity to combine multiple algorithms from different courses to complete a relatively complicated mining task and experiment with different ways to solve a problem to understand the best way to solve it. We will suggest specific approaches, but you are highly encouraged to explore your own ideas since open exploration is, by design, a goal of the Project. You are required to submit a brief report for each of the tasks for peer grading. A final consolidated report is also required, which will be peer-graded.

    声明:MOOC中国发布之课程均源自下列机构,版权均归他们所有。本站仅作报道收录并尊重其著作权益,感谢他们对MOOC事业做出的贡献!(排名不分先后)
    • Coursera
    • edX
    • OpenLearning
    • FutureLearn
    • iversity
    • Udacity
    • NovoEd
    • Canvas
    • Open2Study
    • Google
    • ewant
    • FUN
    • IOC-Athlete-MOOC
    • World-Science-U
    • Codecademy
    • CourseSites
    • opencourseworld
    • ShareCourse
    • gacco
    • MiriadaX
    • JANUX
    • openhpi
    • Stanford-Open-Edx
    • 网易云课堂
    • 中国大学MOOC
    • 学堂在线
    • 顶你学堂
    • 华文慕课
    • 好大学在线CnMooc
    • 以及更多...

    © 2008-2018 www.3l5g.net 慕课改变你,你改变世界

  • 新华社评论员:聚焦新目标 开启新征程 2019-04-14
  • 德州齐河司法所开展人民调解“回头看”工作 2019-04-14
  • 辽宁:电商成为精准扶贫的“利器” 2019-03-28
  • 人民网评:教师欠薪为何又成新闻了? 2019-03-23
  • 张继科状态低迷 刘国梁倍感压力 2019-03-23
  • 湖北浠水十月村经济史料及其研究价值 2019-03-17
  • 刮刮乐甜蜜蜜规则 上海时时彩是骗局吗 新时时彩停售 开奖号码 90足球比分网 pk10牛牛计划群 178彩票走势图表大全 二分彩全天在线计划网 幸运飞艇彩票是哪里的? 幸运28评测网 海南彩票论坛 121彩票走势图 双色球坚持守号中千万大奖 体彩20选5下期杀号 qq彩票混合过关 云南时时彩平台