Marketing Scoops: What Is Machine Learning In AI? Deep Learning?

Login (Shutterstock)

Artificial intelligence is everywhere these days, but the fundamentals of how this influential new technology works can be confusing. Two of the most important fields in AI development are “machine learning” and its sub-field, “deep learning.” Here’s a quick explanation of what these two important disciplines are, and how they’re contributing to the evolution of automation.….Story continues…

By: Lucas Ropek / Gizmodo

Source: Quartz

Critics:

A machine learning model is a type of mathematical model that, after being “trained” on a given dataset, can be used to make predictions or classifications on new data. During training, a learning algorithm iteratively adjusts the model’s internal parameters to minimize errors in its predictions.

By extension, the term “model” can refer to several levels of specificity, from a general class of models and their associated learning algorithms to a fully trained model with all its internal parameters tuned. Various types of models have been used and researched for machine learning systems, picking the best model for a task is called model selection.

As a scientific endeavor, machine learning grew out of the quest for artificial intelligence (AI). In the early days of AI as an academic discipline, some researchers were interested in having machines learn from data. They attempted to approach the problem with various symbolic methods, as well as what were then termed “neural networks”; these were mostly perceptrons and other models that were later found to be reinventions of the generalized linear models of statistics.

Probabilistic reasoning was also employed, especially in automated medical diagnosis. However, an increasing emphasis on the logical, knowledge-based approach caused a rift between AI and machine learning. Probabilistic systems were plagued by theoretical and practical problems of data acquisition and representation. By 1980, expert systems had come to dominate AI, and statistics was out of favor.

Work on symbolic/knowledge-based learning did continue within AI, leading to inductive logic programming(ILP), but the more statistical line of research was now outside the field of AI proper, in pattern recognition and information retrieval. Neural networks research had been abandoned by AI and computer science around the same time. This line, too, was continued outside the AI/CS field, as “connectionism”, by researchers from other disciplines including John Hopfield, David Rumelhart, and Geoffrey Hinton. Their main success came in the mid-1980s with the reinvention of backpropagation.

Machine learning (ML), reorganized and recognized as its own field, started to flourish in the 1990s. The field changed its goal from achieving artificial intelligence to tackling solvable problems of a practical nature. It shifted focus away from the symbolic approaches it had inherited from AI, and toward methods and models borrowed from statistics, fuzzy logic, and probability theory. This section is an excerpt from Data compression Machine learning.

There is a close connection between machine learning and compression. A system that predicts the posterior probabilities of a sequence given its entire history can be used for optimal data compression (by using arithmetic coding on the output distribution). Conversely, an optimal compressor can be used for prediction (by finding the symbol that compresses best, given the previous history). This equivalence has been used as a justification for using data compression as a benchmark for “general intelligence”.

An alternative view can show compression algorithms implicitly map strings into implicit feature space vectors, and compression-based similarity measures compute similarity within these feature spaces. For each compressor C(.) we define an associated vector space ℵ, such that C(.) maps an input string x, corresponding to the vector norm ||~x||. An exhaustive examination of the feature spaces underlying all compression algorithms is precluded by space; instead, feature vectors chooses to examine three representative lossless compression methods, LZW, LZ77, and PPM.

According to AIXI theory, a connection more directly explained in Hutter Prize, the best possible compression of x is the smallest possible software that generates x. For example, in that model, a zip file’s compressed size includes both the zip file and the unzipping software, since you can not unzip it without both, but there may be an even smaller combined form.

Examples of AI-powered audio/video compression software include NVIDIA Maxine, AIVC. Examples of software that can perform AI-powered image compression include OpenCV, TensorFlow, MATLAB’s Image Processing Toolbox (IPT) and High-Fidelity Generative Image Compression. In unsupervised machine learning, k-means clustering can be utilized to compress data by grouping similar data points into clusters.

This technique simplifies handling extensive datasets that lack predefined labels and finds widespread use in fields such as image compression. Data compression aims to reduce the size of data files, enhancing storage efficiency and speeding up data transmission. K-means clustering, an unsupervised machine learning algorithm, is employed to partition a dataset into a specified number of clusters, k, each represented by the centroid of its points.

This process condenses extensive datasets into a more compact set of representative points. Particularly beneficial in image and signal processing, k-means clustering aids in data reduction by replacing groups of data points with their centroids, thereby preserving the core information of the original data while significantly decreasing the required storage space.

Large language models (LLMs) are also capable of lossless data compression, as demonstrated by DeepMind’s research with the Chinchilla 70B model. Developed by DeepMind, Chinchilla 70B effectively compressed data, outperforming conventional methods such as Portable Network Graphics (PNG) for images and Free Lossless Audio Codec (FLAC) for audio.

It achieved compression of image and audio data to 43.4% and 16.4% of their original sizes, respectively. Machine learning and data mining often employ the same methods and overlap significantly, but while machine learning focuses on prediction, based on known properties learned from the training data, data mining focuses on the discovery of (previously) unknown properties in the data (this is the analysis step of knowledge discovery in databases).

Data mining uses many machine learning methods, but with different goals; on the other hand, machine learning also employs data mining methods as “unsupervised learning” or as a preprocessing step to improve learner accuracy. Much of the confusion between these two research communities (which do often have separate conferences and separate journals, ECML PKDD being a major exception) comes from the basic assumptions they work with:

In machine learning, performance is usually evaluated with respect to the ability to reproduce known knowledge, while in knowledge discovery and data mining (KDD) the key task is the discovery of previously unknown knowledge. Evaluated with respect to known knowledge, an uninformed (unsupervised) method will easily be outperformed by other supervised methods, while in a typical KDD task, supervised methods cannot be used due to the unavailability of training data.

Machine learning also has intimate ties to optimization: Many learning problems are formulated as minimization of some loss function on a training set of examples. Loss functions express the discrepancy between the predictions of the model being trained and the actual problem instances (for example, in classification, one wants to assign a label to instances, and models are trained to correctly predict the preassigned labels of a set of examples).

Characterizing the generalization of various learning algorithms is an active topic of current research, especially for deep learning algorithms. Machine learning and statistics are closely related fields in terms of methods, but distinct in their principal goal: statistics draws population inferences from a sample, while machine learning finds generalizable predictive patterns.

According to Michael I. Jordan, the ideas of machine learning, from methodological principles to theoretical tools, have had a long pre-history in statistics. He also suggested the term data science as a placeholder to call the overall field. Conventional statistical analyses require the a priori selection of a model most suitable for the study data set. In addition, only significant or theoretically relevant variables based on previous experience are included for analysis.

In contrast, machine learning is not built on a pre-structured model; rather, the data shape the model by detecting underlying patterns. The more variables (input) used to train the model, the more accurate the ultimate model will be. Leo Breiman distinguished two statistical modeling paradigms: data model and algorithmic model, wherein “algorithmic model” means more or less the machine learning algorithms like Random Forest.

Some statisticians have adopted methods from machine learning, leading to a combined field that they call statistical learning. Analytical and computational techniques derived from deep-rooted physics of disordered systems can be extended to large-scale problems, including machine learning, e.g., to analyze the weight space of deep neural networks. Statistical physics is thus finding applications in the area of medical diagnostics.

Machine learning approaches are traditionally divided into three broad categories, which correspond to learning paradigms, depending on the nature of the “signal” or “feedback” available to the learning system:

Supervised learning: The computer is presented with example inputs and their desired outputs, given by a “teacher”, and the goal is to learn a general rule that maps inputs to outputs.
Unsupervised learning: No labels are given to the learning algorithm, leaving it on its own to find structure in its input. Unsupervised learning can be a goal in itself (discovering hidden patterns in data) or a means towards an end (feature learning).
Reinforcement learning: A computer program interacts with a dynamic environment in which it must perform a certain goal (such as driving a vehicle or playing a game against an opponent). As it navigates its problem space, the program is provided feedback that’s analogous to rewards, which it tries to maximize.

Although each algorithm has advantages and limitations, no single algorithm works for all problems.

Following ad business overhaul, Snap will ramp up investments in AI and machine learning The Drum 20h

Snapchat AI

Slack is training its machine learning on your chat behavior — unless you opt out via email TechRadar Pro 1d

Slack AI

Top Internet Brands

Sainsbury’s teams up with Microsoft to enhance customer experience using AI FoodBev Media 9h

AI Microsoft

Vertical axis wind turbines redefined by Machine Learning OilPrice.com 4d

AI IT

Nature

High-tech London, Ont.-area farm delivers fresh produce all year. Could it be an answer to high grocery costs? CBC.ca 4d

AI IT

IIT Delhi’s Data Science & Machine Learning: Fast track your career Hindustan Times 1d

AI IT

Indian Higher Education

AI reshapes business school curriculums globally DevX.com 1h

AI IT

Apple to introduce new accessibility features for iPad and iPhone users NS Business 5d

iPhone AI

Apple

Data-labeling startup Scale AI raises $1B as valuation doubles to $13.8B TechCrunch 39m

AI IT

LabGenius raises £35m for AI-powered drug discovery UKTN (UK Tech News) 7h

LabGenius AI

In the last hour

Data-labeling startup Scale AI raises $1B as valuation doubles to $13.8B TechCrunch 20:23

AI reshapes business school curriculums globally DevX.com 20:06

In the last 4 hours

Leveling AI Playing Field: Games as Gateways to Machine Learning Mastery Newsweek 18:45

A-Alpha Bio lands $14.5M from Department of Defense to prep for future biological threats GeekWire 18:21

3D printing robot uses AI machine learning to create a shock-absorbing shape no human ever could Tech Xplore 17:46

The Role of AI in Cloud Application Security GovInfoSecurity 17:38

In the last 6 hours

Machine learning method generates circuit synthesis for quantum computing Tech Xplore 16:42

Computed Tomography Market Analysis and Forecast, 2024-2033: Opportunities with Integration with Other Imaging Modalities, & AI and Machine Learning – ResearchAndMarkets.comBusiness Wire (Press Release) 16:42

Airbnb cracks down on holiday weekend parties with anti-party technology Airbnb (Press Release) 16:36

Siemens simplifies development of AI accelerators for advanced system-on-chip designs with Catapult AI NN PR Newswire (Press Release) 16:33

In the last 8 hours

LabGenius raises £35m for AI-powered drug discovery UKTN (UK Tech News) 13:36

The entrepreneur trying to make thrifting truly eco-friendly with AI Rest of World 13:31

Earlier Today

Silicon Valley’s “coercive” sex parties Salon.com 13:02

Modelling the mechanical properties of concrete produced with polycarbonate waste ash by machine learning Nature.com 12:50

Sainsbury’s teams up with Microsoft to enhance customer experience using AI FoodBev Media 12:29

Researchers predict landslide displacement with satellite images, machine learning Xinhua Agency 12:16

Optimizing multi-spectral ore sorting incorporating wavelength selection utilizing neighborhood component analysis for effective arsenic mineral detectionNature.com 11:39

Prediction of incomplete immunization among under-five children in East Africa from recent demographic and health surveys: a machine learning approachNature.com 09:00

Atomico’s biotech startup LabGenius raises a £35m Series B Sifted 07:44

LabGenius raises £35M to develop solid tumor drugs based on machine learning ENDPOINTS 07:39

Hitachi High-Tech and Hitachi have started collaborating on digital transformation of materials development with ITRI, contributing to improvements of Taiwan’s industrial standardsJCN Newswire 06:37

Fake news unmasked: Emotional features boost fake news detection accuracy on social media News-Medical.Net 06:16

Robert Half Wins Stevie® Award for AI and Machine Learning PR Newswire (Press Release) 03:31

The Risk To America’s AI Dominance Is Algorithmic Stagnation – Analysis Eurasia Review 03:26

Yesterday

Following ad business overhaul, Snap will ramp up investments in AI and machine learning The Drum 01:14

Grid Dynamics Achieves the Amazon Web Services Machine Learning Competency Business Wire (Press Release) 23:39 Mon, 20 May

From Data to Decisions, AI is Improving Accuracy for eDiscovery JD Supra 22:33 Mon, 20 May

How Machine Learning Revolutionizes Automation Security with AI-Powered Defense Automation.com 20:26 Mon, 20 May

How Can AI Help Predict Locust Outbreaks In West Africa? Forbes 18:55 Mon, 20 May