Data Mining
Data Mining
What is Data Mining?
Data mining is the process of discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems.It is an essential process where intelligent methods are applied to extract data patterns.It is an interdisciplinary subfield of computer science.The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.
How did Data Mining come into existence?
The manual extraction of patterns from data has occurred for centuries. Early methods of identifying patterns in data include Bayes' theorem (1700s) and regression analysis (1800s). The proliferation, ubiquity and increasing power of computer technology has dramatically increased data collection, storage, and manipulation ability. As data sets have grown in size and complexity, direct "hands-on" data analysis has increasingly been augmented with indirect, automated data processing, aided by other discoveries in computer science, such as neural networks, cluster analysis, genetic algorithms (1950s), decision trees and decision rules (1960s), and support vector machines (1990s). Data mining is the process of applying these methods with the intention of uncovering hidden patterns[14] in large data sets. It bridges the gap from applied statistics and artificial intelligence (which usually provide the mathematical background) to database management by exploiting the way data is stored and indexed in databases to execute the actual learning and discovery algorithms more efficiently, allowing such methods to be applied to ever larger data sets.
The process of data analysis, discovery, and model-building is often iterative as you target and identify the different information that you can extract. You must also understand how to relate, map, associate, and cluster it with other data to produce the result. Identifying the source data and formats, and then mapping that information to our given result can change after you discover different elements and aspects of the data.
How does Data Mining work?
Data Mining Techniques:
Data mining involves six common classes of tasks:
- Anamoly Detection(outlier/change/deviation detection) – The identification of unusual data records, that might be interesting or data errors that require further investigation.
- Association rule learning (dependency modelling) – Searches for relationships between variables. For example, a supermarket might gather data on customer purchasing habits. Using association rule learning, the supermarket can determine which products are frequently bought together and use this information for marketing purposes. This is sometimes referred to as market basket analysis.
- Clustering – is the task of discovering groups and structures in the data that are in some way or another "similar", without using known structures in the data.
- Classification– is the task of generalizing known structure to apply to new data. For example, an e-mail program might attempt to classify an e-mail as "legitimate" or as "spam".
- Regression – attempts to find a function which models the data with the least error that is, for estimating the relationships among data or datasets.
- Summarization– providing a more compact representation of the data set, including visualization and report generation.
Advantages of Data Mining
- Marketing / Retail:Data mining helps marketing companies build models based on historical data to predict who will respond to the new marketing campaigns such as direct mail, online marketing campaign…etc. Through the results, marketers will have an appropriate approach to selling profitable products to targeted customers.
- Data mining brings a lot of benefits to retail companies in the same way as marketing. Through market basket analysis, a store can have an appropriate production arrangement in a way that customers can buy frequent buying products together with pleasant. In addition, it also helps the retail companies offer certain discounts for particular products that will attract more customers.
- Finance / Banking:Data mining gives financial institutions information about loan information and credit reporting. By building a model from historical customer’s data, the bank, and financial institution can determine good and bad loans. In addition, data mining helps banks detect fraudulent credit card transactions to protect credit card’s owner.
- Manufacturing:By applying data mining in operational engineering data, manufacturers can detect faulty equipment and determine optimal control parameters. For example, semiconductor manufacturers have a challenge that even the conditions of manufacturing environments at different wafer production plants are similar, the quality of wafer are a lot the same and some for unknown reasons even has defects. Data mining has been applying to determine the ranges of control parameters that lead to the production of the golden wafer. Then those optimal control parameters are used to manufacture wafers with desired quality.
- Governments:Data mining helps government agency by digging and analyzing records of the financial transaction to build patterns that can detect money laundering or criminal activities.
Disadvantages of data mining
- Privacy Issues:The concerns about the personal privacy have been increasing enormously recently especially when the internet is booming with social networks, e-commerce, forums, blogs…. Because of privacy issues, people are afraid of their personal information is collected and used in an unethical way that potentially causing them a lot of troubles. Businesses collect information about their customers in many ways for understanding their purchasing behaviors trends. However businesses don’t last forever, some days they may be acquired by other or gone. At this time, the personal information they own probably is sold to other or leak.
- Security issues:Security is a big issue. Businesses own information about their employees and customers including social security number, birthday, payroll and etc. However how properly this information is taken care is still in questions. There have been a lot of cases that hackers accessed and stole big data of customers from the big corporation such as Ford Motor Credit Company, Sony… with so much personal and financial information available, the credit card stolen and identity theft become a big problem.
- Misuse of information/inaccurate information:Information is collected through data mining intended for the ethical purposes can be misused. This information may be exploited by unethical people or businesses to take benefits of vulnerable people or discriminate against a group of people.
In addition, data mining technique is not perfectly accurate.Therefore, if inaccurate information is used for decision-making, it will cause serious consequence.
Conclusion:
Advantages and Disadvantages of Data Mining.Data mining brings a lot of benefits to businesses, society, governments as well as the individual. However, privacy, security, and misuse of information are the big problems if they are not addressed and resolved properly.
Bitcoin is the greatest investment ever I was so ambitious in getting btc but due to income I was not able to gain much I have worked with different mining programs and am more than convinced since I found a new trustworthy hardware Antminer S9 and D3 seemed the best bitcoin trading to earn more money online with bitcoin. was introduced to the cyber hackers who multiplied my btc it was like a dream but definitely came true so unbelievable until I got my profit in my wallet am happy am blessed I started my investment with the minimum of 300$ which i borrowed and when I started this I received my first profit within 12hrs and I still continue and now I have more than enough in my wallet and I can pay all my bill thanks to bitminers180@gmail.com if you are interested in this platform you can contact this email: bitminers180@gmail.com or DM / watsapp @ +2348147054320
ReplyDeleteHello, You have posted such precious and informative article which gave me lot of information. I hope that you will keep it up and we will have more informative and helping news from you. Thanks Data Mining PhD Dissertation
ReplyDeleteThanks for elaborating and letting us know about data mining in detail. your post for informative and very helpful . but for effective data mining the data should be segregated and there should not be any data redundancy so data migration services can be used to that .
ReplyDeleteCloud Migration Services
AWS Cloud Migration Services
Azure Cloud Migration Services
VMware Cloud Migration Services
Cloud Migration tool
Database Migration Services
Cloud Migration Services
Positive site, where did u come up with the information on this posting?I have read a few of the articles on your website now, and I really like your style. Thanks a million and please keep up the effective work. Goldshell LT5 Doge LTC Miner
ReplyDeleteThanks, for such a great post. I have tried and found it really helpful. For more details to visit Internet research services
ReplyDeleteThanks for suggestion......... For more visit manual data mining services
ReplyDeleteReally nice information thanks for sharing very useful.......... For more visit reverse email appending services/reverse email appending
ReplyDeletethanks for such beautiful information given me keep updating
ReplyDeleteLeadsProspex, attorney email list
gynecologist email list
Great work, thank you so much for this useful information.for more details visit
ReplyDeleteLeadsProspex, hubspot crm users email list, insightly crm users email list
Yes, this is a good post without any doubt. You really do a great job. I am inspired by you. So keep it up for more visit LeadsProspex, hubspot crm users email list/a> , insightly crm users email list
ReplyDeleteExcellent work on the article. I really appreciate your work on this blog. for more details visit cmo email list, chief and vp of it email list, hospital email list
ReplyDeleteThanks, for such a great post. I have tried it and found it really helpful. For more details to visit
ReplyDeletegermany business email list, greece business email list, nimble users email list