Cloud Infrastructure Basics
Understand foundational cloud infrastructure concepts that power modern backend deployments.
Why Cloud Infrastructure Exists
Cloud infrastructure replaces physical server management with on-demand resources delivered over the internet.
- Traditionally, companies had to purchase and maintain their own hardware.
- Scaling required manual setup and planning for peak capacity.
- Cloud platforms provide compute, storage, and networking on demand.
Details
Before cloud computing, companies had to build and manage their own infrastructure. This meant purchasing physical servers, setting up data centers, handling cooling and power, and maintaining hardware over time.
Scaling systems was especially difficult. If traffic increased, companies needed to buy and install more servers in advance. If demand dropped, those resources would sit unused, leading to wasted cost.
Cloud infrastructure solves this by providing resources over the internet. Instead of managing physical machines, developers request compute power, storage, and networking through cloud platforms.
This allows systems to scale dynamically based on demand, without upfront hardware investment. Infrastructure becomes something that can be provisioned instantly rather than physically built and maintained.
This shift is a major foundation of modern backend systems, enabling faster development, lower operational overhead, and more flexible scaling.
Compute Services
Compute services provide the CPU, memory, and runtime needed to execute application code in the cloud.
- They run application code using virtual machines, containers, or serverless functions.
- Compute services provide the core resources required to execute applications.
- Different models offer tradeoffs between control and abstraction.
Details
Compute services are the foundation of cloud platforms—they are responsible for actually running application code. Without compute, other services like storage or databases would have nothing to interact with.
Cloud providers offer different ways to run code. Virtual machines give full control over the operating system, container runtimes provide lightweight and portable environments, and serverless functions allow code to run without managing infrastructure at all.
All of these models provide essential resources such as CPU, memory, and execution environments. The difference lies in how much responsibility the developer has in managing the system.
For example, services like AWS EC2, Google Compute Engine, and Azure Virtual Machines allow users to configure and manage virtual servers. More abstract services, like serverless platforms, remove much of that responsibility.
Choosing the right compute model depends on the level of control needed and the complexity of the application being built.
Managed Databases
Managed databases handle setup, maintenance, and scaling so applications can focus on storing and accessing data.
- Cloud providers manage database setup, backups, and replication.
- They automatically handle scaling and high availability.
- Applications interact with the database without managing infrastructure.
Details
Databases are critical to most applications, but managing them manually is complex and time-consuming. Traditionally, teams had to install database software, configure backups, handle failures, and manage scaling themselves.
Managed database services shift this responsibility to the cloud provider. Services like AWS RDS, Google Cloud SQL, and Azure SQL Database automatically handle tasks such as provisioning, patching, and monitoring.
They also provide built-in features like automated backups, replication across multiple locations, and scaling based on demand. This improves reliability and ensures data is available even during failures.
As a result, developers can focus on application logic and data modeling rather than infrastructure management. This significantly reduces operational complexity while maintaining high performance and availability.
Object Storage
Object storage stores unstructured data like files and media with high durability and scalability.
- It is used to store files such as images, videos, logs, and backups.
- Data is stored as objects rather than traditional file systems or databases.
- Object storage is designed for massive scale and high durability.
Details
Object storage is designed for storing large amounts of unstructured data. Unlike databases that store structured records, object storage handles files such as images, videos, logs, and backups.
Each file is stored as an object, which includes the data itself along with metadata and a unique identifier. This structure allows systems to efficiently retrieve and manage large volumes of data.
Cloud object storage services like AWS S3, Google Cloud Storage, and Azure Blob Storage are built for durability and scalability. They automatically replicate data across multiple locations to prevent data loss.
Because of this design, object storage can scale to handle massive datasets without requiring complex infrastructure management. It is a core component in modern applications that need reliable and scalable file storage.
Cloud Networking
Cloud networking connects services securely and enables communication between components in a cloud system.
- Virtual networks isolate and organize cloud resources.
- Load balancers distribute traffic across multiple services.
- Gateways control access between internal and external systems.
Details
Cloud networking provides the infrastructure that allows different services to communicate within a cloud environment. Without it, compute instances, databases, and storage systems would not be able to interact effectively.
Virtual networks act as isolated environments where resources can be grouped and controlled. They define how services are connected and who can access them.
Load balancers distribute incoming traffic across multiple compute instances. This improves performance and ensures systems remain available even if one instance fails.
Gateways manage communication between internal cloud resources and external clients or networks. They help enforce security policies and control how data flows in and out of the system.
Together, these components enable secure, reliable, and scalable communication across distributed cloud applications.
Regions
A region is a geographic location where cloud providers deploy infrastructure across multiple data centers.
- Regions consist of multiple data centers in a specific geographic area.
- Deploying closer to users reduces latency.
- Using multiple regions improves disaster recovery and resilience.
Details
Cloud providers operate infrastructure across many geographic locations worldwide. Each of these locations is called a region, and it contains multiple data centers that host cloud services.
Regions are important because physical distance affects performance. Deploying applications closer to users reduces network latency, resulting in faster response times.
They also play a key role in reliability. If one region experiences a failure, systems can be designed to fail over to another region, ensuring continued operation.
By distributing systems across regions, applications can achieve both better global performance and stronger disaster recovery capabilities.
Availability Zones
Availability zones are isolated data center groups within a region that improve reliability and fault tolerance.
- Each region is divided into multiple isolated availability zones.
- Failures in one zone do not affect others.
- Running across zones increases availability and redundancy.
Details
Within a cloud region, infrastructure is further divided into availability zones. Each zone consists of one or more data centers that are physically separated from other zones in the same region.
This separation is intentional. If a failure occurs in one availability zone—such as a power outage or hardware issue—it does not impact the others. This provides fault isolation within a region.
Applications can be deployed across multiple availability zones to ensure high availability. If one zone becomes unavailable, traffic can be redirected to systems running in other zones.
This design allows cloud systems to maintain uptime even when parts of the infrastructure fail, making availability zones a critical component of resilient system architecture.
Major Cloud Providers
Major cloud providers offer infrastructure services that allow applications to run without owning physical servers.
- Leading providers include AWS, Google Cloud Platform, and Microsoft Azure.
- They offer services for compute, storage, databases, networking, and monitoring.
- Applications run on globally distributed infrastructure managed by these platforms.
Details
Cloud providers are companies that operate large-scale infrastructure and make it available to users over the internet. Instead of building and maintaining their own data centers, organizations rely on these platforms to run their applications.
The three largest providers are Amazon Web Services (AWS), Google Cloud Platform (GCP), and Microsoft Azure. While each platform has its own tools and naming conventions, they all offer similar core services such as compute, storage, databases, networking, and monitoring.
These providers operate data centers around the world, allowing applications to run on globally distributed infrastructure. This enables systems to scale, improve performance for users in different locations, and maintain reliability.
By abstracting away physical hardware, cloud providers allow developers to focus on building and running applications rather than managing infrastructure.
Question Section
1 / 5
This track is locked
Buy this track once to unlock all of its lessons.