Skip to main content

Compute Requirements

The following table outlines the recommended server specifications for deploying the core components of the MLAngles LLMOps Platform. These specifications are optimized for stable performance, scalability, and efficient resource utilization across the platform's frontend, backend, database, pipeline processing, and VPN access layers.

Note: Instance types are shown in both AWS format and equivalent hardware specifications for on-premises or alternative cloud deployments.

ServerOS TypeInstance Type (AWS)Equivalent Specs (vCPU/RAM)Root Volume (GB)Secondary Volume (GB)
FrontendUbuntu 22.04 LTSt3a.large2 vCPUs, 8 GB RAM810
BackendUbuntu 22.04 LTSr5a.xlarge4 vCPUs, 32 GB RAM1550
DatabaseUbuntu 22.04 LTSm6a.large2 vCPUs, 8 GB RAM10-
OpenVPNUbuntu 24.04 LTSt3a.micro2 vCPUs, 1 GB RAM8-

Hardware Specification Details:

  • Frontend Server: Requires moderate compute resources (2 CPU cores, 8 GB memory) suitable for serving web interfaces and handling user interactions
  • Backend Server: Memory-optimized configuration (4 CPU cores, 32 GB memory) to handle API requests, data processing, and model inference tasks
  • Database Server: Balanced compute and memory (2 CPU cores, 8 GB memory) for efficient database operations
  • VPN Server: Minimal resources (2 CPU cores, 1 GB memory) sufficient for secure network access management

Storage Notes:

  • Root volumes host the operating system and application binaries
  • Secondary volumes provide additional storage for logs, temporary files, and application data
  • All storage volumes should use SSD or equivalent high-performance storage for optimal I/O performance