Skip to content

docs: add AI Cloud LLM service documentation#522

Open
swordqiu wants to merge 2 commits intoyunionio:masterfrom
swordqiu:feat/add-aicloud-llm-docs
Open

docs: add AI Cloud LLM service documentation#522
swordqiu wants to merge 2 commits intoyunionio:masterfrom
swordqiu:feat/add-aicloud-llm-docs

Conversation

@swordqiu
Copy link
Copy Markdown
Member

@swordqiu swordqiu commented Mar 2, 2026

添加 Cloudpods LLM 服务的完整文档,包括功能介绍、使用指南和最佳实践。

文档内容

  • LLM 服务概述和核心特性
  • 架构设计和核心组件说明
  • 核心功能详解(LLM实例管理、即时模型、Dify集成、MCP Agent)
  • CLI 命令使用指南
  • API 参考
  • 最佳实践建议
  • 故障排查指南

文档路径

该文档基于 cloudpods/pkg/llm 源码的实际功能编写。

- **弹性扩缩容**:基于容器化技术,支持灵活的弹性伸缩
- **Dify 集成**:内置 Dify 知识库和 AI 工作流平台
- **MCP Agent**:支持 Model Context Protocol,实现 AI 智能体与云平台的无缝交互
- **GPU 调度优化**:支持 NVIDIA GPU 等异构计算资源的智能调度
Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

GPU调度优化不在LLM服务内体现


- **多模型支持**:支持 Ollama、vLLM 等多种 LLM 运行框架
- **即时模型管理**:提供预置模型镜像,实现秒级模型部署
- **弹性扩缩容**:基于容器化技术,支持灵活的弹性伸缩
Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

弹性伸缩不在LLM服务内体现

- **MCP Agent**:支持 Model Context Protocol,实现 AI 智能体与云平台的无缝交互
- **GPU 调度优化**:支持 NVIDIA GPU 等异构计算资源的智能调度

## 架构设计
Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

请移除架构设计章节

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant