LLamaMe 2.0: Introducing Multi-Model Support

We are happy to announce we have expanded the LLamaMe service in the CZ and RZ with the release of v2.0 to support access to multiple on-prem large language models (LLMs). Existing LLamaMe API keys provisioned through LaunchIT now provide access to all of the following models within the applicable network zone.

For more information, see the LLamaMe documentation: https://hpc.llnl.gov/services/cloud-services/ai-ml-services/llamame-llm…

CZ/RZ LLamaMe Supported Models
Network	Models	GPUs
CZ	meta-llama/Llama-3.3-70B-Instruct openai/gpt-oss-120b	4 NVIDIA A100 80GB
RZ	meta-llama/Meta-Llama-3.1-8B-Instruct	2 NVIDIA A100 40GB
RZ	Llama-3.3-70B-Instruct Codestral-22B-v0.1 Llama-4-Scout-17B-16E-Instruct gpt-oss-120b	16 AMD MI250 120GB