Cloud Infrastructure Basics

Why Cloud Infrastructure Exists

Cloud infrastructure replaces physical server management with on-demand resources delivered over the internet.

Traditional Infrastructure

Capacity (fixed)

built for peak → waste or overload

Cloud Infrastructure

scales with demand → no guesswork

Traditionally, companies had to purchase and maintain their own hardware.
Scaling required manual setup and planning for peak capacity.
Cloud platforms provide compute, storage, and networking on demand.

Details

Before cloud computing, companies had to build and manage their own infrastructure. This meant purchasing physical servers, setting up data centers, handling cooling and power, and maintaining hardware over time.

Scaling systems was especially difficult. If traffic increased, companies needed to buy and install more servers in advance. If demand dropped, those resources would sit unused, leading to wasted cost.

Cloud infrastructure solves this by providing resources over the internet. Instead of managing physical machines, developers request compute power, storage, and networking through cloud platforms.

This allows systems to scale dynamically based on demand, without upfront hardware investment. Infrastructure becomes something that can be provisioned instantly rather than physically built and maintained.

This shift is a major foundation of modern backend systems, enabling faster development, lower operational overhead, and more flexible scaling.

Compute Services

Compute services provide the CPU, memory, and runtime needed to execute application code in the cloud.

🖥️ VM

Hardware

Runtime

App Code

you manage most layers

📦 Container

Hardware

Runtime

App Code

platform handles OS

⚡ Serverless

Hardware

Runtime

App Code

only write code

more control →← less management

They run application code using virtual machines, containers, or serverless functions.
Compute services provide the core resources required to execute applications.
Different models offer tradeoffs between control and abstraction.

Details

Compute services are the foundation of cloud platforms—they are responsible for actually running application code. Without compute, other services like storage or databases would have nothing to interact with.

Cloud providers offer different ways to run code. Virtual machines give full control over the operating system, container runtimes provide lightweight and portable environments, and serverless functions allow code to run without managing infrastructure at all.

All of these models provide essential resources such as CPU, memory, and execution environments. The difference lies in how much responsibility the developer has in managing the system.

For example, services like AWS EC2, Google Compute Engine, and Azure Virtual Machines allow users to configure and manage virtual servers. More abstract services, like serverless platforms, remove much of that responsibility.

Choosing the right compute model depends on the level of control needed and the complexity of the application being built.

Managed Databases

Managed databases handle setup, maintenance, and scaling so applications can focus on storing and accessing data.

Self-Managed

🧑‍💻 Setup DB

🧑‍💻 Configure Backup

🧑‍💻 Handle Scaling

🧑‍💻 Manage Failover

you handle everything

Managed Database

💻

App

↓

🗄️

managed system

🔁 backup

📈 scaling

🛡 failover

platform handles infrastructure

Cloud providers manage database setup, backups, and replication.
They automatically handle scaling and high availability.
Applications interact with the database without managing infrastructure.

Details

Databases are critical to most applications, but managing them manually is complex and time-consuming. Traditionally, teams had to install database software, configure backups, handle failures, and manage scaling themselves.

Managed database services shift this responsibility to the cloud provider. Services like AWS RDS, Google Cloud SQL, and Azure SQL Database automatically handle tasks such as provisioning, patching, and monitoring.

They also provide built-in features like automated backups, replication across multiple locations, and scaling based on demand. This improves reliability and ensures data is available even during failures.

As a result, developers can focus on application logic and data modeling rather than infrastructure management. This significantly reduces operational complexity while maintaining high performance and availability.

Object Storage

Object storage stores unstructured data like files and media with high durability and scalability.

💻

Client

🖼️

image.png

🎬

video.mp4

📄

report.pdf

📊

logs.json

🗂️

Object Storage

🖼️

image.png

object + metadata

🎬

video.mp4

object + metadata

📄

report.pdf

object + metadata

📊

logs.json

object + metadata

📦

Zone 1

📦

Zone 2

📦

Zone 3

Objects are replicated across zones for durability

It is used to store files such as images, videos, logs, and backups.
Data is stored as objects rather than traditional file systems or databases.
Object storage is designed for massive scale and high durability.

Details

Object storage is designed for storing large amounts of unstructured data. Unlike databases that store structured records, object storage handles files such as images, videos, logs, and backups.

Each file is stored as an object, which includes the data itself along with metadata and a unique identifier. This structure allows systems to efficiently retrieve and manage large volumes of data.

Cloud object storage services like AWS S3, Google Cloud Storage, and Azure Blob Storage are built for durability and scalability. They automatically replicate data across multiple locations to prevent data loss.

Because of this design, object storage can scale to handle massive datasets without requiring complex infrastructure management. It is a core component in modern applications that need reliable and scalable file storage.

Cloud Networking

Cloud networking connects services securely and enables communication between components in a cloud system.

🌐

External

🚪

⚖️

🖥️

📦

Gateway controls entry → load balancer distributes → service responds

Virtual networks isolate and organize cloud resources.
Load balancers distribute traffic across multiple services.
Gateways control access between internal and external systems.

Details

Cloud networking provides the infrastructure that allows different services to communicate within a cloud environment. Without it, compute instances, databases, and storage systems would not be able to interact effectively.

Virtual networks act as isolated environments where resources can be grouped and controlled. They define how services are connected and who can access them.

Load balancers distribute incoming traffic across multiple compute instances. This improves performance and ensures systems remain available even if one instance fails.

Gateways manage communication between internal cloud resources and external clients or networks. They help enforce security policies and control how data flows in and out of the system.

Together, these components enable secure, reliable, and scalable communication across distributed cloud applications.

Regions

A region is a geographic location where cloud providers deploy infrastructure across multiple data centers.

Region A

🖥️

multiple zones

Region B

🖥️

standby region

Each region contains multiple data centers for availability.

Regions consist of multiple data centers in a specific geographic area.
Deploying closer to users reduces latency.
Using multiple regions improves disaster recovery and resilience.

Details

Cloud providers operate infrastructure across many geographic locations worldwide. Each of these locations is called a region, and it contains multiple data centers that host cloud services.

Regions are important because physical distance affects performance. Deploying applications closer to users reduces network latency, resulting in faster response times.

They also play a key role in reliability. If one region experiences a failure, systems can be designed to fail over to another region, ensuring continued operation.

By distributing systems across regions, applications can achieve both better global performance and stronger disaster recovery capabilities.

Availability Zones

Availability zones are isolated data center groups within a region that improve reliability and fault tolerance.

Region

AZ 1

⚙️

isolated zone

AZ 2

⚙️

isolated zone

AZ 3

⚙️

isolated zone

Load shared→zones operate independently

Workloads are distributed across multiple zones for reliability.

Each region is divided into multiple isolated availability zones.
Failures in one zone do not affect others.
Running across zones increases availability and redundancy.

Details

Within a cloud region, infrastructure is further divided into availability zones. Each zone consists of one or more data centers that are physically separated from other zones in the same region.

This separation is intentional. If a failure occurs in one availability zone—such as a power outage or hardware issue—it does not impact the others. This provides fault isolation within a region.

Applications can be deployed across multiple availability zones to ensure high availability. If one zone becomes unavailable, traffic can be redirected to systems running in other zones.

This design allows cloud systems to maintain uptime even when parts of the infrastructure fail, making availability zones a critical component of resilient system architecture.

Major Cloud Providers

Major cloud providers offer infrastructure services that allow applications to run without owning physical servers.

🧑‍💻

Application

deploys to

☁️ AWS

EC2

RDS

VPC

global infrastructure

☁️ GCP

Compute

Storage

BigQuery

VPC

global infrastructure

☁️ Azure

Blob

SQL

Network

global infrastructure

Applications run on distributed services managed by cloud providers — no physical servers needed.

Leading providers include AWS, Google Cloud Platform, and Microsoft Azure.
They offer services for compute, storage, databases, networking, and monitoring.
Applications run on globally distributed infrastructure managed by these platforms.

Details

Cloud providers are companies that operate large-scale infrastructure and make it available to users over the internet. Instead of building and maintaining their own data centers, organizations rely on these platforms to run their applications.

The three largest providers are Amazon Web Services (AWS), Google Cloud Platform (GCP), and Microsoft Azure. While each platform has its own tools and naming conventions, they all offer similar core services such as compute, storage, databases, networking, and monitoring.

These providers operate data centers around the world, allowing applications to run on globally distributed infrastructure. This enables systems to scale, improve performance for users in different locations, and maintain reliability.

By abstracting away physical hardware, cloud providers allow developers to focus on building and running applications rather than managing infrastructure.

Question Section

Try to answer in your own words first, then flip the card to check.

1 / 5

Why Cloud Infrastructure Exists

Cloud infrastructure replaces physical server management with on-demand resources delivered over the internet.

Traditional Infrastructure

Capacity (fixed)

built for peak → waste or overload

Cloud Infrastructure

scales with demand → no guesswork

Traditionally, companies had to purchase and maintain their own hardware.
Scaling required manual setup and planning for peak capacity.
Cloud platforms provide compute, storage, and networking on demand.

Details

Cloud infrastructure solves this by providing resources over the internet. Instead of managing physical machines, developers request compute power, storage, and networking through cloud platforms.

This shift is a major foundation of modern backend systems, enabling faster development, lower operational overhead, and more flexible scaling.

Compute Services

Compute services provide the CPU, memory, and runtime needed to execute application code in the cloud.

🖥️ VM

Hardware

Runtime

App Code

you manage most layers

📦 Container

Hardware

Runtime

App Code

platform handles OS

⚡ Serverless

Hardware

Runtime

App Code

only write code

more control →← less management

They run application code using virtual machines, containers, or serverless functions.
Compute services provide the core resources required to execute applications.
Different models offer tradeoffs between control and abstraction.

Details

All of these models provide essential resources such as CPU, memory, and execution environments. The difference lies in how much responsibility the developer has in managing the system.

Choosing the right compute model depends on the level of control needed and the complexity of the application being built.

Managed Databases

Managed databases handle setup, maintenance, and scaling so applications can focus on storing and accessing data.

Self-Managed

🧑‍💻 Setup DB

🧑‍💻 Configure Backup

🧑‍💻 Handle Scaling

🧑‍💻 Manage Failover

you handle everything

Managed Database

💻

App

↓

🗄️

managed system

🔁 backup

📈 scaling

🛡 failover

platform handles infrastructure

Cloud providers manage database setup, backups, and replication.
They automatically handle scaling and high availability.
Applications interact with the database without managing infrastructure.

Details

Object Storage

Object storage stores unstructured data like files and media with high durability and scalability.

💻

Client

🖼️

image.png

🎬

video.mp4

📄

report.pdf

📊

logs.json

🗂️

Object Storage

🖼️

image.png

object + metadata

🎬

video.mp4

object + metadata

📄

report.pdf

object + metadata

📊

logs.json

object + metadata

📦

Zone 1

📦

Zone 2

📦

Zone 3

Objects are replicated across zones for durability

It is used to store files such as images, videos, logs, and backups.
Data is stored as objects rather than traditional file systems or databases.
Object storage is designed for massive scale and high durability.

Details

Object storage is designed for storing large amounts of unstructured data. Unlike databases that store structured records, object storage handles files such as images, videos, logs, and backups.

Each file is stored as an object, which includes the data itself along with metadata and a unique identifier. This structure allows systems to efficiently retrieve and manage large volumes of data.

Cloud Networking

Cloud networking connects services securely and enables communication between components in a cloud system.

🌐

External

🚪

⚖️

🖥️

📦

Gateway controls entry → load balancer distributes → service responds

Virtual networks isolate and organize cloud resources.
Load balancers distribute traffic across multiple services.
Gateways control access between internal and external systems.

Details

Virtual networks act as isolated environments where resources can be grouped and controlled. They define how services are connected and who can access them.

Load balancers distribute incoming traffic across multiple compute instances. This improves performance and ensures systems remain available even if one instance fails.

Gateways manage communication between internal cloud resources and external clients or networks. They help enforce security policies and control how data flows in and out of the system.

Together, these components enable secure, reliable, and scalable communication across distributed cloud applications.

Regions

A region is a geographic location where cloud providers deploy infrastructure across multiple data centers.

Region A

🖥️

multiple zones

Region B

🖥️

standby region

Each region contains multiple data centers for availability.

Regions consist of multiple data centers in a specific geographic area.
Deploying closer to users reduces latency.
Using multiple regions improves disaster recovery and resilience.

Details

Cloud providers operate infrastructure across many geographic locations worldwide. Each of these locations is called a region, and it contains multiple data centers that host cloud services.

Regions are important because physical distance affects performance. Deploying applications closer to users reduces network latency, resulting in faster response times.

They also play a key role in reliability. If one region experiences a failure, systems can be designed to fail over to another region, ensuring continued operation.

By distributing systems across regions, applications can achieve both better global performance and stronger disaster recovery capabilities.

Availability Zones

Availability zones are isolated data center groups within a region that improve reliability and fault tolerance.

Region

AZ 1

⚙️

isolated zone

AZ 2

⚙️

isolated zone

AZ 3

⚙️

isolated zone

Load shared→zones operate independently

Workloads are distributed across multiple zones for reliability.

Each region is divided into multiple isolated availability zones.
Failures in one zone do not affect others.
Running across zones increases availability and redundancy.

Details

Within a cloud region, infrastructure is further divided into availability zones. Each zone consists of one or more data centers that are physically separated from other zones in the same region.

This separation is intentional. If a failure occurs in one availability zone—such as a power outage or hardware issue—it does not impact the others. This provides fault isolation within a region.

Applications can be deployed across multiple availability zones to ensure high availability. If one zone becomes unavailable, traffic can be redirected to systems running in other zones.

This design allows cloud systems to maintain uptime even when parts of the infrastructure fail, making availability zones a critical component of resilient system architecture.

Major Cloud Providers

Major cloud providers offer infrastructure services that allow applications to run without owning physical servers.

🧑‍💻

Application

deploys to

☁️ AWS

EC2

RDS

VPC

global infrastructure

☁️ GCP

Compute

Storage

BigQuery

VPC

global infrastructure

☁️ Azure

Blob

SQL

Network

global infrastructure

Applications run on distributed services managed by cloud providers — no physical servers needed.

Leading providers include AWS, Google Cloud Platform, and Microsoft Azure.
They offer services for compute, storage, databases, networking, and monitoring.
Applications run on globally distributed infrastructure managed by these platforms.

Details

By abstracting away physical hardware, cloud providers allow developers to focus on building and running applications rather than managing infrastructure.

Question Section

Try to answer in your own words first, then flip the card to check.

1 / 5

Cloud Infrastructure Basics

Why Cloud Infrastructure Exists

Compute Services

Managed Databases

Object Storage

Cloud Networking

Regions

Availability Zones

Major Cloud Providers

Question Section

Related lessons

Cookie Consent

Cloud Infrastructure Basics

Why Cloud Infrastructure Exists

Compute Services

Managed Databases

Object Storage

Cloud Networking

Regions

Availability Zones

Major Cloud Providers

Question Section

Related lessons