Cloud Computing

Cloud computing is the on-demand delivery of computing power, database storage, applications, and other IT resources via the internet with pay-as-you-go pricing.

cloud computinginfrastructureAWSGoogle CloudAzureSaaSPaaSIaaSAI infrastructure

Definition

Cloud computing is the delivery of different services through the Internet. These resources include tools and applications like data storage, servers, databases, networking, and software.

Cloud Computing in AI

The AI revolution is built on the backbone of cloud computing. Modern Large Language Models require thousands of interconnected chips to train, a scale only possible through massive cloud data centers.

Key Providers

  • Amazon Web Services (AWS): The market leader, offering services like Amazon Bedrock and SageMaker for AI.
  • Google Cloud Platform (GCP): Known for its custom TPUs and Vertex AI platform.
  • Microsoft Azure: Strategic partner of OpenAI, providing the infrastructure for ChatGPT and Azure AI Studio.

Deployment Models

  1. Public Cloud: Owned and operated by third-party cloud service providers.
  2. Private Cloud: Dedicated to a single organization.
  3. Hybrid Cloud: Combines on-premises infrastructure with public clouds.

Benefits for AI Development

  • Scalability: Easily add more computing power as model size grows.
  • Accessibility: Developers can access state-of-the-art hardware without upfront capital investment.
  • Global Reach: Deploy AI models close to users across the world for low latency.

Frequently Asked Questions

Cloud computing provides the massive computational power ([GPUs](/glossary/gpu-computing) and [TPUs](/glossary/tensor-processing-unit)) required to train and serve large-scale AI models that would be too expensive for most companies to run locally.
Infrastructure as a Service (IaaS), Platform as a Service (PaaS), and Software as a Service (SaaS).

Continue Learning

Explore our lessons and prompts to deepen your AI knowledge.