- Data Labeling for Machine Learning: Market Overview, Approaches, and Tools - Dec 13, 2021.
So much of data science and machine learning is founded on having clean and well-understood data sources that it is unsurprising that the data labeling market is growing faster than ever. Here, we highlight many of the top players in this industry and the techniques they use to help you consider which might make a good partner for your needs.
Big Data, Crowdsourcing, Data Classification, Data Labeling, Data Mining, Data Platform
- One question to make your data project 10x more valuable - Feb 1, 2021.
If you are the "data person" for your organization, then providing meaningful results to stakeholder data requests can sometimes feel like shots in the dark. However, you can make sure your data analysis is actionable by asking one magic question before getting started.
Advice, Business, Data Analysis, Data Mining, Data Science, Deployment, Problem Definition
- Data Mining and Machine Learning: Fundamental Concepts and Algorithms: The Free eBook - Jul 21, 2020.
The second edition of Data Mining and Machine Learning: Fundamental Concepts and Algorithms is available to read freely online, and includes a new part on regression with chapters on linear regression, logistic regression, neural networks, deep learning and regression assessment.
Algorithms, Data Mining, Free ebook, Machine Learning
- A Holistic Framework for Managing Data Analytics Projects - May 22, 2020.
Agile project management for Data Science development continues to be an effective framework that enables flexibility and productivity in a field that can experience continuous changes in data and evolving stakeholder expectations. Learn more about the leading approaches for developing Data Science models, and apply them to your next project.
Agile, CRISP-DM, Data Analytics, Data Management, Data Mining, Decision Management, Development, Software Engineering
- The Data Science Puzzle — 2020 Edition - Feb 7, 2020.
The data science puzzle is once again re-examined through the relationship between several key concepts of the landscape, incorporating updates and observations since last time. Check out the results here.
AI, Big Data, Data Mining, Data Science, Deep Learning, Machine Learning
- Market Basket Analysis: A Tutorial - Dec 24, 2019.
This article is about Market Basket Analysis & the Apriori algorithm that works behind it.
Apriori, Association Rules, Data Mining, Python
- Top Active Blogs on AI, Analytics, Big Data, Data Science, Machine Learning – updated - Jan 14, 2019.
Stay up-to-date with the latest technological advancements using our extensive list of active blogs; this is a list of 100 recently active blogs on Big Data, Data Science, Data Mining, Machine Learning, and Artificial intelligence.
AI, Analytics, Big Data, Blogs, Data Mining, Data Science, Data Visualization, Machine Learning
- Data Mining Book – Chapter Download - Dec 4, 2018.
Download this immediately useful book chapter, and learn how to create derived variables, which allow the statistical and Data Science modeling to incorporate human insights.
Data Mining, Data Visualization, Derived Variables, Feature Engineering, JMP, Michael Berry
- Data Mining Book – Chapter Download - Nov 2, 2018.
Download this immediately useful book chapter, and learn how to create derived variables, which allow the statistical and Data Science modeling to incorporate human insights.
Data Mining, Data Visualization, Derived Variables, Feature Engineering, JMP, Michael Berry
- What on earth is data science? - Sep 4, 2018.
An overview and discussion around data science, covering the history behind the term, data mining, statistical inference, machine learning, data engineering and more.
Data Mining, Data Science, Decision Making, Statistics
- Every time someone runs a correlation coefficient on two time series, an angel loses their wings - Jun 18, 2018.
We all know correlation doesn’t equal causality at this point, but when working with time series data, correlation can lead you to come to the wrong conclusion.
Correlation, Data Mining, Statistics, Time Series
- 5 Things You Need To Know About Data Science - Feb 19, 2018.
Here are 5 useful things to know about Data Science, including its relationship to BI, Data Mining, Predictive Analytics, and Machine Learning; Data Scientist job prospects; where to learn Data Science; and which algorithms/methods are used by Data Scientists
Algorithms, BI, Data Analytics, Data Mining, Data Science, Data Science Education, Data Scientist, Google Trends, Jobs, Machine Learning
- Training Sets, Test Sets, and 10-fold Cross-validation - Jan 9, 2018.
More generally, in evaluating any data mining algorithm, if our test set is a subset of our training data the results will be optimistic and often overly optimistic. So that doesn’t seem like a great idea.
Cross-validation, Data Mining, Datasets, Machine Learning
- Process Mining with R: Introduction - Nov 2, 2017.
In the past years, several niche tools have appeared to mine organizational business processes. In this article, we’ll show you that it is possible to get started with “process mining” using well-known data science programming languages as well.
Pages: 1 2
Data Mining, Data Science, Process Mining, R
- Videos for Business Analytics using Data Mining course - Sep 12, 2017.
Here we present links to very useful videos on Business Analytics using data mining courses.
Business Analytics, Data Mining, Galit Shmueli, Online Education, R, Youtube
- Data Science Primer: Basic Concepts for Beginners - Aug 11, 2017.
This collection of concise introductory data science tutorials cover topics including the difference between data mining and statistics, supervised vs. unsupervised learning, and the types of patterns we can mine from data.
Bias, Data Mining, Data Science, Distribution, Ensemble Methods, Statistics
- Insights from Data mining of Airbnb Listings - Aug 4, 2017.
AirBnB has 2 million listings and operates in 65,000 cities. Here we look at insights related to vacation rental space in the sharing economy using the property listings data for Texas, US.
AirBnB, Data Mining, R, TX
- Top 15 Python Libraries for Data Science in 2017 - Jun 13, 2017.
Since all of the libraries are open sourced, we have added commits, contributors count and other metrics from Github, which could be served as a proxy metrics for library popularity.
Pages: 1 2
Data Mining, Data Science, Deep Learning, Machine Learning, Natural Language Processing, Python, Visualization
- Data Mining Techniques, Free Chapter: Derived Variables – Making the Data Mean More - Jun 12, 2017.
Download this chapter by Gordon Linoff and Michael Berry, and learn how to create derived variables, which allow the statistical modeling process to incorporate human insights.
Data Mining, Derived Variables, Feature Engineering, JMP, Michael Berry
- Do We Need Balanced Sampling? - May 4, 2017.
Resampling is a solution which is very popular in dealing with class imbalance. Our research on churn prediction shows that balanced sampling is unnecessary.
Customer Analytics, Data Mining, Data Science
- Fixing Deployment and Iteration Problems in CRISP-DM - Feb 1, 2017.
Many analytic models are not deployed effectively into production while others are not maintained or updated. Applying decision modeling and decision management technology within CRISP-DM addresses this.
Analytics, CRISP-DM, Data Mining, Data Science, Decision Modeling, IIA, Methodology
- Bringing Business Clarity To CRISP-DM - Jan 24, 2017.
Many analytic projects fail to understand the business problem they are trying to solve. Correctly applying decision modeling in the Business Understanding phase of CRISP-DM brings clarity to the business problem.
CRISP-DM, Data Mining, Data Science, Decision Modeling, Methodology, Predictive Analytics
- The Data Science Puzzle, Revisited - Jan 20, 2017.
The data science puzzle is re-examined through the relationship between several key concepts in the realm, and incorporates important updates and observations from the past year. The result is a modified explanatory graphic and rationale.
AI, Big Data, Data Mining, Data Science, Deep Learning, Machine Learning
- Four Problems in Using CRISP-DM and How To Fix Them - Jan 18, 2017.
CRISP-DM is the leading approach for managing data mining, predictive analytic and data science projects. CRISP-DM is effective but many analytic projects neglect key elements of the approach.
CRISP-DM, Data Mining, Methodology
- 90 Active Blogs on Analytics, Big Data, Data Mining, Data Science, Machine Learning (updated) - Jan 17, 2017.
Stay up-to-date in the data science with active blogs. This is a list of 90 recently active blogs on Big Data, Data Science, Data Mining, Machine Learning, and Artificial intelligence.
Pages: 1 2
Big Data, Blogs, Data Mining, Data Science, Machine Learning
- Top 10 Amazon Books in Data Mining, 2016 Edition - Nov 11, 2016.
Given the ongoing explosion in interest for all things Data Mining, Data Science, Analytics, Big Data, etc., we have updated our Amazon top books lists from last year. Here are the 10 most popular titles in the Data Mining category.
Amazon, Books, Data Mining, Data Science
- Data Science Basics: Data Mining vs. Statistics - Sep 28, 2016.
As a beginner I was confused at the relationship between data mining and statistics. This is my attempt to help straighten out this connection for others who may now be in my old shoes.
Beginners, Data Mining, Statistics
- Data Science of Reviews: ReviewMeta tool Automatically Detects Unnatural Reviews on Amazon - Aug 23, 2016.
ReviewMeta is a tool that analyzes millions of reviews and helps customers decide which ones to trust. As the dataset grows, so do the insights on unbiased reviews.
Amazon, Analytics, Customer Analytics, Data Mining, Trends
- Short course: Statistical Learning and Data Mining IV, Washington, DC, Oct 19-20 - Aug 8, 2016.
This new two-day course gives a detailed and modern overview of statistical models used by data scientists for prediction and inference, including sparse models and deep learning.
Data Mining, DC, R, Robert Tibshirani, Statistical Learning, Trevor Hastie, Washington
- History of Data Mining - Jun 22, 2016.
Data mining is a subfield of computer science which blends many techniques from statistics, data science, database theory and machine learning. Here are the major milestones and “firsts” in the history of data mining plus how it’s evolved and blended with data science and big data.
About Gregory Piatetsky, Alan Turing, Bayes Theorem, Data Mining, DJ Patil, History, Vladimir Vapnik
- 5 Ways in Which Big Data Can Help Leverage Customer Data - May 25, 2016.
Every business enterprise realizes the importance of big data but rarely puts the customer data that they possess to good use. Here are few ways enterprises can leverage customer data.
Analytics, Big Data, Data Management, Data Mining
- The Data Science Puzzle, Explained - Mar 10, 2016.
The puzzle of data science is examined through the relationship between several key concepts in the data science realm. As we will see, far from being concrete concepts etched in stone, divergent opinions are inevitable; this is but another opinion to consider.
Pages: 1 2
Artificial Intelligence, Data Mining, Data Science, Deep Learning, Explained, Machine Learning
- scikit-feature: Open-Source Feature Selection Repository in Python - Mar 3, 2016.
scikit-feature is an open-source feature selection repository in python, with around 40 popular algorithms in feature selection research. It is developed by Data Mining and Machine Learning Lab at Arizona State University.
Data Mining, Data Science, Feature Extraction, Feature Selection, Machine Learning, Python
- Top New Features in Orange 3 Data Mining Platform - Dec 10, 2015.
The main technical advantage of Orange 3 is its integration with NumPy and SciPy libraries. Other improvements include reading online data, working through queries for SQL and pre-processing.
Pages: 1 2
Data Mining, Data Visualization, numpy, Orange, Python, scikit-learn
- Amazon Top 20 Books in Data Mining - Oct 27, 2015.
These are the most popular data mining books on Amazon. As you look to increase your knowledge, is there something listed here that is missing from your collection?
Amazon, Book, Data Mining
- 60+ Free Books on Big Data, Data Science, Data Mining, Machine Learning, Python, R, and more - Sep 4, 2015.
Here is a great collection of eBooks written on the topics of Data Science, Business Analytics, Data Mining, Big Data, Machine Learning, Algorithms, Data Science Tools, and Programming Languages for Data Science.
Book, Brendan Martin, Data Mining, Data Science, Free ebook, Machine Learning, Python, R, SQL
- New Standard Methodology for Analytical Models - Aug 3, 2015.
Traditional methods for the analytical modelling like CRISP-DM have several shortcomings. Here we describe these friction points in CRISP-DM and introduce a new approach of Standard Methodology for Analytics Models which overcomes them.
Pages: 1 2 3
CRISP-DM, Data Mining, Modeling, Olav Laudy, ROI
- Top 10 Data Mining Algorithms, Explained - May 21, 2015.
Top 10 data mining algorithms, selected by top researchers, are explained here, including what do they do, the intuition behind the algorithm, available implementations of the algorithms, why use them, and interesting applications.
Pages: 1 2 3
Algorithms, Apriori, Bayesian, Boosting, C4.5, CART, Data Mining, Explained, K-means, K-nearest neighbors, Naive Bayes, Page Rank, Support Vector Machines, Top 10
- Most Viewed Data Mining Videos on YouTube - May 18, 2015.
The top Data Mining YouTube videos by those like Google and Revolution Analytics covers topics ranging from statistics in data mining to using R for data mining to data mining in sports.
Ayasdi, Data Mining, Google, Grant Marshall, R, Rattle, Revolution Analytics, Statistica, Text Mining, Weka, Youtube
- Data Mining: New Comprehensive Textbook by Charu Aggarwal - Apr 23, 2015.
This comprehensive data mining textbook explores the different aspects of data mining, from basics to advanced, and their applications, and may be used for both introductory and advanced data mining courses.
Book, Charu Aggarwal, Data Mining
- More Free Data Mining, Data Science Books and Resources - Mar 25, 2015.
More free resources and online books by leading authors about data mining, data science, machine learning, predictive analytics and statistics.
Book, Data Mining, Data Science, Free ebook, Machine Learning
- Differential Privacy: How to make Privacy and Data Mining Compatible - Jan 9, 2015.
Can privacy coexist with machine learning and data mining? Differential privacy allows the learning of general characteristics of populations while guaranteeing the privacy of individual records.
arXiv, Big Data, Cynthia Dwork, Data Mining, Differential Privacy, Zachary Lipton
- CRISP-DM, still the top methodology for analytics, data mining, or data science projects - Oct 28, 2014.
CRISP-DM remains the most popular methodology for analytics, data mining, and data science projects, with 43% share in latest KDnuggets Poll, but a replacement for unmaintained CRISP-DM is long overdue.
CRISP-DM, Data Mining, James Taylor, Methodology, Poll
- Hiring Data Scientists: What to look for? - Sep 9, 2014.
Know key characteristics of what makes up a good data scientist based upon the three authors’ consulting and research experience, having collaborated with many companies world-wide on the topics of big data and analytics.
Analytics, Big Data, Business, Data Mining, Data Scientist, Hiring, Programming, Skills, Statistics
- Most Viewed Data Mining Talks at Videolectures - Sep 9, 2014.
Watch the top 25 most viewed popular data mining lectures on VideoLectures.NET to learn about topics ranging general big-data tutorials to monetizing data mining startups.
Big Data, Data Mining, Data Mining Training, Data Science, Tutorials, Videolectures
- Four main languages for Analytics, Data Mining, Data Science - Aug 18, 2014.
New KDnuggets Poll shows the growing dominance of four main languages for Analytics, Data Mining, and Data Science: R, SAS, Python, and SQL - used by 91% of data scientists - and decline in popularity of other languages, except for Julia and Scala.
Analytics Languages, Data Mining, Data Science, Julia, Poll, Python, R, SAS, Scala, SQL
- Top Research Leaders in Data Mining, Data Science, and KDD - Aug 16, 2014.
We identify the top researchers in Data Mining, Data Science, and KDD. Jiawei Han, Philip Yu, and Christos Faloutsos remain the leaders, but they are joined by many fast rising young researchers - the leaders of tomorrow.
Christos Faloutsos, Data Mining, Hans-Peter Kriegel, Jian Pei, Jiawei Han, KDD, Philip S. Yu, Researchers, Top list
- XLMiner solves Big Data Problems in Excel - Jun 26, 2014.
XLMiner, a part of Analytic Solver Platform integrated software for predictive and prescriptive analytics - forecasting, data mining, optimization and simulation, lets you solve small or Big Data problems in Excel.
Data Mining, Excel, Forecasting, Optimization, XLMiner
- 9 Free Books for Learning Data Mining and Data Analysis - Apr 29, 2014.
Whether you are learning data science for the first time or refreshing your memory or catching up on latest trends, these free books will help you excel through self-study.
Alex Ivanovs, Algorithms, Analysis, Data Mining, Free ebook, Programming