A Brief History of AI

A Brief History of AI

Updated through June 2026

Before the 1940s, the idea of artificial intelligence existed mostly in mythology, philosophy, mathematics, and early mechanical invention. Ancient stories such as Talos in Greek mythology and the Golem in Jewish folklore imagined artificial beings created by humans and animated with intelligence, power, or purpose. Philosophers and mathematicians, including Aristotle, Leibniz, and later George Boole, explored formal logic and symbolic reasoning, laying early foundations for the idea that thought itself might be represented through rules, symbols, and systems.

By the late nineteenth and early twentieth centuries, mechanical calculators, logic machines, and early theories of computation began turning these ideas into engineering problems. The question gradually shifted from “Can intelligence be imagined?” to “Can intelligence be mechanized?”

1940s–1950s: Theoretical Foundations

The modern history of AI begins with the rise of programmable computers. During and after World War II, scientists began to see computation not only as arithmetic, but as a possible model for reasoning.

Alan Turing and the Turing Test (1950):
In his paper “Computing Machinery and Intelligence,” Alan Turing asked whether machines could think and proposed what became known as the Turing Test: a practical way to judge whether a machine could imitate human conversation convincingly enough to be considered intelligent.

Early AI Programs (1951):
Christopher Strachey and Dietrich Prinz created early programs capable of playing checkers and chess. These systems were limited, but they showed that computers could perform tasks that appeared to require reasoning, planning, and decision-making.

1956: The Birth of Artificial Intelligence

The Dartmouth Conference (1956):
The field of artificial intelligence was formally born at the Dartmouth Summer Research Project on Artificial Intelligence. John McCarthy, Marvin Minsky, Claude Shannon, Allen Newell, Herbert A. Simon, and others gathered to explore the possibility that “every aspect of learning or any other feature of intelligence” could, in principle, be described so precisely that a machine could simulate it.

This moment gave the field its name and set the research agenda for decades: symbolic reasoning, problem-solving, language, learning, and machine intelligence.

1960s: Early AI Experiments

Logic Theorist and Symbolic AI:
Allen Newell and Herbert A. Simon’s Logic Theorist demonstrated that computers could prove mathematical theorems. This helped establish symbolic AI, an approach based on explicit rules, logic, and representation.

ELIZA (1966):
Joseph Weizenbaum created ELIZA, an early natural language program that simulated a psychotherapist by responding to user input with scripted patterns. Although ELIZA did not truly understand language, it revealed how easily people could attribute intelligence and empathy to machines.

1970s: AI Winter and Research Setbacks

The early optimism around AI ran into major limits. Systems worked in narrow demonstrations but struggled with ambiguity, real-world complexity, common sense, and scale. Funding declined, expectations cooled, and the field entered what became known as the first AI winter.

This period showed one of AI’s recurring patterns: moments of excitement followed by disappointment when systems fail to generalize beyond controlled environments.

1980s: Expert Systems and Neural Networks Return

Expert Systems:
In the 1980s, AI found commercial use through expert systems: programs that encoded specialist knowledge into rule-based systems. Examples included DENDRAL for chemistry and MYCIN for medical diagnosis. These systems showed that AI could be useful in specialized domains, but they were expensive to build, difficult to maintain, and brittle outside their narrow rules.

Backpropagation and Neural Networks (1986):
Geoffrey Hinton, David Rumelhart, and Ronald Williams helped popularize backpropagation, a method for training neural networks by adjusting internal weights based on error. This revived interest in machine learning and laid the foundation for later deep learning.

Convolutional Neural Networks (1989):
Yann LeCun applied convolutional neural networks to handwritten digit recognition, showing how neural networks could learn visual patterns. This work became an important precursor to modern computer vision.

1990s: AI in Games and Real-World Systems

Deep Blue Defeats Garry Kasparov (1997):
IBM’s Deep Blue defeated world chess champion Garry Kasparov, demonstrating that machines could surpass even the strongest humans in a complex but well-defined intellectual domain. Deep Blue was not general intelligence, but it proved that computers could dominate specific forms of strategic reasoning.

During this period, AI also became increasingly embedded in search, logistics, finance, fraud detection, and recommendation systems, often without being labeled as “AI” in the consumer imagination.

2000s: The AI Renaissance

In the 2000s, AI advanced through a combination of better algorithms, more digital data, and more powerful hardware.

Search, Advertising, and Recommendation Systems:
Companies such as Google, Amazon, Netflix, and Facebook used machine learning to rank search results, recommend products, personalize feeds, and optimize advertising. This was one of AI’s most important commercial phases: intelligence became a hidden layer powering the internet economy.

Speech Recognition and Mobile AI:
Voice recognition systems improved, and products such as Siri brought AI into everyday consumer devices.

Self-Driving Vehicles:
AI also became central to autonomous driving research. Companies such as Google, later Waymo, and Tesla used computer vision, sensor fusion, machine learning, and real-time decision systems to pursue autonomous mobility.

Deep Learning Foundations:
Researchers such as Geoffrey Hinton, Yann LeCun, Yoshua Bengio, and Andrew Ng helped drive the transition from hand-designed features to systems that could learn representations from data. GPUs made it possible to train larger neural networks faster and more effectively.

2012: The Big Bang of Modern Deep Learning

AlexNet (2012):
Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton won the ImageNet competition with AlexNet, a deep convolutional neural network trained using GPUs. AlexNet dramatically improved image classification performance and proved that deep learning could outperform traditional computer vision methods at scale.

This moment is often seen as the beginning of the modern AI boom. It showed that with enough data, compute, and neural network depth, AI systems could learn powerful representations and outperform older techniques.

2014–2016: Deep Learning Breakthroughs

Generative Adversarial Networks (2014):
Ian Goodfellow introduced generative adversarial networks, or GANs, which trained two neural networks against each other: one generating data and the other judging whether it was real or fake. GANs became important for image generation and synthetic media.

AlphaGo Defeats Lee Sedol (2016):
DeepMind’s AlphaGo defeated Go champion Lee Sedol, a landmark achievement because Go has far more possible board states than chess. AlphaGo combined deep neural networks with reinforcement learning and search, showing that AI could master problems requiring intuition, strategy, and long-term planning.

2015: The Founding of OpenAI

OpenAI was founded in 2015 by Sam Altman, Elon Musk, Greg Brockman, Ilya Sutskever, John Schulman, Wojciech Zaremba, and others. Its original mission was to ensure that artificial general intelligence would benefit humanity broadly.

OpenAI began as a nonprofit research organization but later created a capped-profit structure to raise the capital needed for large-scale AI research. This reflected a broader shift in the field: frontier AI increasingly required enormous compute, specialized talent, and cloud infrastructure.

2017: The Transformer Revolution

“Attention Is All You Need” (2017):
Researchers at Google introduced the Transformer architecture, based on the self-attention mechanism. Transformers made it possible to process language more efficiently and scale models much larger than before.

This was one of the most important breakthroughs in AI history. Transformers became the foundation for modern large language models, including BERT, GPT, Gemini, Claude, Llama, and many others.

BERT (2018):
Google’s BERT improved language understanding by training models to understand words in context. It became widely used in search and natural language processing.

2018–2020: The Rise of Large Language Models

GPT-1 and GPT-2:
OpenAI’s early GPT models showed that a model trained on large amounts of text could generate coherent language and then be adapted to many tasks.

GPT-3 (2020):
GPT-3, with 175 billion parameters, showed that scaling language models could produce surprising general-purpose abilities. It could write essays, summarize text, answer questions, translate, generate code, and perform tasks from only a few examples. GPT-3 made large language models visible to developers and businesses through an API.

This period introduced a major idea: instead of building a separate AI system for every task, a large pre-trained model could act as a flexible foundation for many tasks.

2020: AlphaFold and Scientific AI

AlphaFold (2020):
DeepMind’s AlphaFold achieved a major breakthrough in protein structure prediction, a long-standing challenge in biology. This showed that AI could contribute not only to language, images, and games, but also to scientific discovery.

AlphaFold marked a shift in how AI was perceived: from a tool for automation and prediction to a possible engine for accelerating science.

2021–2022: Multimodal and Creative AI

DALL·E and DALL·E 2:
OpenAI introduced systems capable of generating images from text prompts. This helped popularize multimodal AI: models that connect language with images and other forms of data.

Stable Diffusion (2022):
Stable Diffusion made high-quality text-to-image generation widely accessible, including through open-source tools. It democratized image generation and accelerated creative AI adoption.

Midjourney (2022):
Midjourney became popular among artists, designers, marketers, and creators for producing visually striking AI-generated images.

Together, these systems showed that AI was no longer only analytical. It was becoming creative, visual, and increasingly accessible to ordinary users.

2022: ChatGPT and the Consumer AI Explosion

ChatGPT (2022):
OpenAI released ChatGPT in November 2022. Built on GPT-3.5, it transformed public understanding of AI by making conversational AI easy to use. Instead of needing to code or understand machine learning, users could simply chat with an AI system.

ChatGPT became one of the fastest-growing consumer applications in history and triggered a global AI race. Businesses, schools, governments, creators, and software companies began rethinking how work, learning, customer service, content creation, and coding might change.

This was the moment generative AI entered mainstream culture.

2023: GPT-4 and the Professionalization of Generative AI

GPT-4 (2023):
GPT-4 represented a major leap in reasoning, reliability, coding, and professional usefulness. It also introduced stronger multimodal capabilities, including image understanding in some versions.

Generative AI began moving from novelty to infrastructure. Lawyers used AI to review documents, doctors experimented with AI support tools, students used AI tutors, developers used coding assistants, and companies began integrating AI into internal workflows.

Claude, Gemini, Llama, Mistral, and the Model Race:
Anthropic’s Claude, Google’s Gemini, Meta’s Llama models, Mistral’s open and commercial models, Cohere’s enterprise models, and xAI’s Grok all contributed to a rapidly expanding competitive landscape. AI was no longer a single-company story. It became an ecosystem.

2024: Multimodal AI, Open Models, and Reasoning Systems

By 2024, AI development accelerated across three major fronts: multimodality, open-weight models, and advanced reasoning.

GPT-4o and Real-Time Multimodality:
OpenAI introduced GPT-4o, bringing faster and more natural interaction across text, image, and voice. AI began to feel less like a chatbot and more like a live assistant.

Gemini 1.5 and Long Context:
Google pushed long-context multimodal models, enabling AI systems to process much larger documents, videos, audio files, and codebases.

Claude 3 and Claude 3.5:
Anthropic advanced Claude’s usefulness in writing, coding, analysis, and enterprise applications, with a strong emphasis on safety and reliability.

Llama 3.1:
Meta released Llama 3.1 405B, one of the strongest open-weight foundation models at the time. This strengthened the open AI ecosystem and allowed developers, researchers, and companies to build more customized AI systems.

OpenAI o1-preview:
OpenAI introduced o1-preview, a new class of reasoning model designed to spend more time thinking through difficult problems. This marked a shift from models that mainly generated fluent answers toward models optimized for complex reasoning, math, coding, and multi-step problem solving.

2025: Reasoning, Agents, and the Open-Weight Shock

In 2025, AI moved from chat toward action.

DeepSeek-R1:
DeepSeek-R1 demonstrated that advanced reasoning capabilities could be trained using reinforcement learning and released in a way that made powerful reasoning models more accessible. It shocked the industry because it suggested that frontier-style reasoning might not remain limited to a small number of closed labs.

GPT-5:
OpenAI released GPT-5, improving reasoning, coding, writing, and everyday reliability. GPT-5 represented a further step toward models that could act as general-purpose collaborators across work, learning, and creative tasks.

Operator and Computer-Using Agents:
OpenAI introduced Operator, an early agent capable of using a browser to complete tasks by clicking, typing, scrolling, and interacting with websites. This showed a new direction: AI systems that do not only answer questions, but can operate software interfaces on behalf of users.

Codex and Coding Agents:
AI coding tools advanced from autocomplete toward agentic software engineering. Instead of only suggesting snippets, coding agents could plan tasks, edit files, run tests, fix bugs, and work across larger codebases. Developers began supervising multiple AI agents rather than only writing code line by line.

Llama 4:
Meta introduced Llama 4 Scout and Llama 4 Maverick, open-weight natively multimodal models using mixture-of-experts architecture. This continued the trend toward more efficient, specialized, and accessible AI systems.

Sora 2 and Video Generation:
OpenAI’s Sora 2 pushed video and audio generation forward, with more realistic motion, stronger controllability, and synchronized sound. Video generation became part of the broader move toward AI systems that model not just language, but scenes, motion, and aspects of the physical world.

2026: AI Becomes Agentic Infrastructure

By June 2026, the frontier of AI had shifted again. The most important question was no longer only “Can AI generate text, images, code, or video?” but “Can AI carry out long, complex work?”

GPT-5.5 and Frontier Work Models:
OpenAI’s GPT-5.5 became part of a new generation of models built for complex reasoning, coding, professional work, long context, tool use, web search, file search, and computer use. These systems are increasingly designed to operate as collaborators inside real workflows.

Gemini 3.5:
Google introduced Gemini 3.5 as a family of models focused on agentic workflows, coding, complex long-horizon tasks, and real-world utility. This reflected a broader industry direction: models are becoming action-oriented systems rather than passive assistants.

Claude Fable 5 and Claude Mythos 5:
Anthropic launched Claude Fable 5 for general use and Claude Mythos 5 for more restricted, high-risk domains through trusted-access programs. These models emphasized long-running work, advanced software engineering, knowledge work, vision, scientific research, and cybersecurity capabilities, while also highlighting the increasing importance of safety controls around powerful AI systems.

Agents as the New Interface:
By 2026, AI agents were becoming a major interface for work. Instead of simply prompting a chatbot, users could delegate tasks to systems that plan, use tools, interact with software, generate documents, write code, search files, browse the web, and coordinate multi-step workflows.

This represents a major shift in the history of AI. Earlier AI systems were tools that performed specific tasks. Generative AI systems became conversational collaborators. The newest systems are becoming agentic infrastructure: persistent, tool-using, context-aware systems that can help individuals and organizations execute work.

The Current Moment

The history of AI is not a straight line from weak machines to human-like intelligence. It is a cycle of ideas, technical breakthroughs, scaling laws, commercial pressure, public excitement, safety concerns, and new interfaces.

From myths of artificial beings to symbolic logic, from expert systems to neural networks, from deep learning to Transformers, from ChatGPT to multimodal models, and now from chatbots to agents, AI has repeatedly expanded what machines can do.

As of 2026, AI is entering a new phase. It is becoming less like a single product and more like a general layer of intelligence embedded across software, science, education, entertainment, business, creativity, and daily life.

The central question is no longer whether AI will matter. It already does.

The question now is how societies, companies, creators, workers, governments, and individuals will reorganize around a world where intelligence is increasingly abundant, interactive, multimodal, and capable of action.