2.39M
Category: databasedatabase

Data Mining

1.

DATA MINING
Borodulin V. A. KI19-01

2.

DEFINITION
Data mining is the process of extracting and discovering patterns in large data sets involving methods at
the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary
subfield of computer science and statistics with an overall goal of extracting information (with intelligent
methods) from a data set and transforming the information into a comprehensible structure for further
use. Data mining is the analysis step of the "knowledge discovery in databases" process, or KDD. Aside
from the raw analysis step, it also involves database and data management aspects, data pre-processing,
model and inference considerations, interestingness metrics, complexity considerations, post-processing
of discovered structures, visualization, and online updating.

3.

RAPIDMINER

4.

OVERVIEW
RapidMiner is a data science platform that analyses the collective impact
of an organization's data. It was acquired by Altair Engineering in September
2022.
RapidMiner uses a client/server model with the server offered either onpremises or in public or private cloud infrastructures.

5.

PURPOSE
RapidMiner is an enterprise-ready data science platform, that
amplifies the collective impact of organization’s people, expertise
and data for great advantage.

6.

TARGET AUDIENCE
RapidMiner is focused on commercial companies that need
ready-to-go data analysis tool for marketing purpose.

7.

TECHNICAL SUPPORT
RapidMiner has its own technical support that works well, but
also there’s a platform called RapidMiner Marketplace. It offers
different plugins and solutions for every possible needs.

8.

USABILITY
The main advantage of RapidMiner is that analyst using this
tool don’t need to learn programming, he can just use end-product
for analyzing.

9.

10.

11.

LINKS
https://rapidminer.com/
https://marketplace.rapidminer.com/UpdateServer/

12.

WEKA

13.

OVERVIEW
Waikato Environment for Knowledge Analysis (Weka) is a
collection of machine learning and data analysis free software
licensed under the GNU General Public License. It was developed
at the University of Waikato, New Zealand and is the companion
software to the book "Data Mining: Practical Machine Learning
Tools and Techniques"

14.

PURPOSE
Weka is a free educational researching tool.

15.

TARGET AUDIENCE
Target audience is a students, universities and anyone willing to
learn data mining.

16.

EFFICIENCY
Quite efficient program for everyone to use it with no
problems.

17.

TECHNICAL SUPPORT
As it is free software you can find a lot of guides and solutions
online, but as well there are several official free online courses
posted on YouTube.
https://www.cs.waikato.ac.nz/ml/weka/courses.html
https://www.youtube.com/user/WekaMOOC

18.

USABILITY
It’s free software with simple interface but using it requires
knowledge of Java programming language.

19.

20.

21.

LINKS
https://www.cs.waikato.ac.nz/ml/index.html
https://www.cs.waikato.ac.nz/ml/weka/courses.html
https://www.youtube.com/user/WekaMOOC

22.

THANK YOU FOR YOUR ATTENTION
English     Русский Rules