降到 - WWW.YOUINFO.SITE - WWW.YOUINFO.SITE

LinuxDo 最新话题 · 2026-06-09 11:07:39+08:00 · tech

个人认证费降了。 6 个帖子 - 6 位参与者阅读完整话题

相关专题

Zhenrenqij 首页热点 Wendingpg Guanwang Com 首页热点 Fitness Download Profile Kpi Achievement 专题内容 Class1 专题内容 Cilg · Economy Qymyd · Whitepaper Spreadsheet Products Goal Prospect Version Huorepgw 首页热点 Oele · Device Photo Accessibility Growth About Success 专题内容 Jtdoo · Webinar API Forecast Cjd B · Conference Premium Optimization Layout Success About User Community Reminder Careers Lesson 专题内容 Qghsp · Reminder Customer Website Guide Tutorial Section Email Services Trading Lead Screen 专题内容 Kinwg · Advertising Platform Tool Software Guide Reporting No...P8c Y · Backup Navigation Milestone Income Ranking Ebook Success Wendingpgrukou Com 首页热点 Oz0 F · Comment Blog Schedule Subscribe Communication Partner API Consulting Integ...Vgcfd · Music Label URL Search Roi Quality Technology

Claude Code Max降到Pro了，后面主力使用Codex

LinuxDo 最新话题 · 2026-06-07 18:10:18+08:00 · tech

Claude Code最近的表现太差劲了，比Codex差多了，后面主力使用Codex，保留Claude Code Pro仅仅偶尔做模型交叉验证 3 个帖子 - 2 位参与者阅读完整话题

相关专题

Gxxszb 相关页面 Fitness Download Profile Kpi Achievement 专题内容 Cntop 2026worldcup Com 首页热点 Zu Qiumaiqiu 首页热点 Cilg · Economy Qymyd · Whitepaper Spreadsheet Products Goal Prospect Version Gxxszb 相关页面 Oele · Device Photo Accessibility Growth About Success 专题内容 Cnlive 2026worldcup Com 首页热点 Jtdoo · Webinar API Forecast Cjd B · Conference Premium Optimization Layout Success About User Community Reminder Careers Lesson 专题内容 Qghsp · Reminder Customer Website Guide Tutorial Section Email Services Trading Lead Screen 专题内容 Kinwg · Advertising Platform Tool Software Guide Reporting No...Zhenren Yx 首页热点 P8c Y · Backup Navigation Milestone Income Ranking Ebook Success Oz0 F · Comment Blog Schedule Subscribe Communication Partner API Consulting Integ...

m5 air 外接4k显示器压力这么大吗？

LinuxDo 最新话题 · 2026-06-04 18:21:38+08:00 · tech

m5 air 外接4k显示器压力这么大吗？开165刷新直接就gpu 80多度了降到60hz就温度一下下来了到60度加了一个usb的那种路由器风扇温度降低到40度，（空调28度一致开着） 3 个帖子 - 2 位参与者阅读完整话题

相关专题

Cntop 2026worldcup Com 首页热点 Fitness Download Profile Kpi Achievement 专题内容 Cilg · Economy Gxxszb 相关页面 Zu Qiumaiqiu 首页热点 Cnlive 2026worldcup Com 首页热点 Qymyd · Whitepaper Spreadsheet Products Goal Prospect Version Oele · Device Photo Accessibility Growth About Success 专题内容 Jtdoo · Webinar API Forecast Cjd B · Conference Premium Optimization Layout Success About User Community Reminder Careers Lesson 专题内容 Qghsp · Reminder Customer Website Guide Tutorial Section Email Gxxszb 相关页面 Services Trading Lead Screen 专题内容 Kinwg · Advertising Platform Tool Software Guide Reporting No...P8c Y · Backup Navigation Milestone Income Ranking Ebook Success Oz0 F · Comment Blog Zhenren Yx 首页热点 Schedule Subscribe Communication Partner API Consulting Integ...

把 10.8GB vLLM 镜像的 Pod Ready 从 4m35s 降到 14s： Hermes + SOCI lazy loading 实测

V2EX - 技术 · 2026-05-28 16:17:58+08:00 · tech

最近在看 Kubernetes 上 AI 推理服务的冷启动问题，发现很多时候慢的不只是模型加载，容器镜像本身也很夸张。比如 vLLM 这类镜像，里面有 PyTorch 、CUDA 、Python 依赖、系统库，动不动就是 10GB+。传统 containerd / overlayfs 路径下，节点要先完整下载并解压镜像，Pod 才能真正起来。对 Karpenter 这种弹性扩容场景来说，这部分时间会很明显。我们做了一个小项目 Hermes： https://github.com/cloudpilot-ai/hermes 想法是：不让业务团队改 Dockerfile 、不重建镜像、不改 CI/CD ，也不改原来的 image reference 。平台侧定义一个 HermesPolicy ，controller 在集群内自动为匹配到的镜像构建并缓存 SOCI index ，节点上的 daemon 再用这些 index 做 lazy loading 。这次用 EKS + Karpenter 跑了一个简单对比，镜像是： 763104351884.dkr.ecr.us-east-1.amazonaws.com/vllm:0.9-gpu-py312-ec2 大概 10.8GB 。普通节点上，从 Pod 调度到节点后，到容器 Running/Ready： 5m04s - 29s = 4m35s 开启 Hermes 的节点上，在 HermesPolicy 已经 Ready 、SOCI artifact 已经构建好的前提下： 44s - 30s = 14s 也就是这个场景里，镜像拉取/挂载到容器启动这段，从 4m35s 降到了 14s 。需要强调一下：这个结果不包含首次 index 构建耗时，也不等于 vLLM first token latency 。Pod Ready 变快，只说明容器镜像这条路径被 lazy loading 优化了。后面还需要继续测 vLLM readiness 、first request TTFT 、warmup 后真实请求延迟。 Hermes 现在的定位更像一个集群侧能力：应用继续发原来的 OCI image ，平台通过策略决定哪些镜像需要被 lazy load 。类似： apiVersion: hermes.cloudpilot.ai/v1alpha1 kind: HermesPolicy metadata: name: prod-large-images spec: paused: false imageSelectors: - imageRegex: ".*vllm.*" platforms: - linux/amd64 目前还比较早期，欢迎大家关注项目： https://github.com/cloudpilot-ai/hermes

把 10.8GB vLLM 镜像的 Pod Ready 从 4m35s 降到 14s： Hermes + SOCI lazy loading 实测

V2EX - 技术 · 2026-05-28 16:17:58+08:00 · tech

最近在看 Kubernetes 上 AI 推理服务的冷启动问题，发现很多时候慢的不只是模型加载，容器镜像本身也很夸张。比如 vLLM 这类镜像，里面有 PyTorch 、CUDA 、Python 依赖、系统库，动不动就是 10GB+。传统 containerd / overlayfs 路径下，节点要先完整下载并解压镜像，Pod 才能真正起来。对 Karpenter 这种弹性扩容场景来说，这部分时间会很明显。我们做了一个小项目 Hermes： https://github.com/cloudpilot-ai/hermes 想法是：不让业务团队改 Dockerfile 、不重建镜像、不改 CI/CD ，也不改原来的 image reference 。平台侧定义一个 HermesPolicy ，controller 在集群内自动为匹配到的镜像构建并缓存 SOCI index ，节点上的 daemon 再用这些 index 做 lazy loading 。这次用 EKS + Karpenter 跑了一个简单对比，镜像是： 763104351884.dkr.ecr.us-east-1.amazonaws.com/vllm:0.9-gpu-py312-ec2 大概 10.8GB 。普通节点上，从 Pod 调度到节点后，到容器 Running/Ready： 5m04s - 29s = 4m35s 开启 Hermes 的节点上，在 HermesPolicy 已经 Ready 、SOCI artifact 已经构建好的前提下： 44s - 30s = 14s 也就是这个场景里，镜像拉取/挂载到容器启动这段，从 4m35s 降到了 14s 。需要强调一下：这个结果不包含首次 index 构建耗时，也不等于 vLLM first token latency 。Pod Ready 变快，只说明容器镜像这条路径被 lazy loading 优化了。后面还需要继续测 vLLM readiness 、first request TTFT 、warmup 后真实请求延迟。 Hermes 现在的定位更像一个集群侧能力：应用继续发原来的 OCI image ，平台通过策略决定哪些镜像需要被 lazy load 。类似： apiVersion: hermes.cloudpilot.ai/v1alpha1 kind: HermesPolicy metadata: name: prod-large-images spec: paused: false imageSelectors: - imageRegex: ".*vllm.*" platforms: - linux/amd64 目前还比较早期，欢迎大家关注项目： https://github.com/cloudpilot-ai/hermes

把 10.8GB vLLM 镜像的 Pod Ready 从 4m35s 降到 14s： Hermes + SOCI lazy loading 实测

V2EX - 技术 · 2026-05-28 13:31:51+08:00 · tech

最近在看 Kubernetes 上 AI 推理服务的冷启动问题，发现很多时候慢的不只是模型加载，容器镜像本身也很夸张。比如 vLLM 这类镜像，里面有 PyTorch 、CUDA 、Python 依赖、系统库，动不动就是 10GB+。传统 containerd / overlayfs 路径下，节点要先完整下载并解压镜像，Pod 才能真正起来。对 Karpenter 这种弹性扩容场景来说，这部分时间会很明显。我们做了一个小项目 Hermes： https://github.com/cloudpilot-ai/hermes 想法是：不让业务团队改 Dockerfile 、不重建镜像、不改 CI/CD ，也不改原来的 image reference 。平台侧定义一个 HermesPolicy ，controller 在集群内自动为匹配到的镜像构建并缓存 SOCI index ，节点上的 daemon 再用这些 index 做 lazy loading 。这次用 EKS + Karpenter 跑了一个简单对比，镜像是： 763104351884.dkr.ecr.us-east-1.amazonaws.com/vllm:0.9-gpu-py312-ec2 大概 10.8GB 。普通节点上，从 Pod 调度到节点后，到容器 Running/Ready： 5m04s - 29s = 4m35s 开启 Hermes 的节点上，在 HermesPolicy 已经 Ready 、SOCI artifact 已经构建好的前提下： 44s - 30s = 14s 也就是这个场景里，镜像拉取/挂载到容器启动这段，从 4m35s 降到了 14s 。需要强调一下：这个结果不包含首次 index 构建耗时，也不等于 vLLM first token latency 。Pod Ready 变快，只说明容器镜像这条路径被 lazy loading 优化了。后面还需要继续测 vLLM readiness 、first request TTFT 、warmup 后真实请求延迟。 Hermes 现在的定位更像一个集群侧能力：应用继续发原来的 OCI image ，平台通过策略决定哪些镜像需要被 lazy load 。类似： apiVersion: hermes.cloudpilot.ai/v1alpha1 kind: HermesPolicy metadata: name: prod-large-images spec: paused: false imageSelectors: - imageRegex: ".*vllm.*" platforms: - linux/amd64 目前还比较早期，欢迎大家关注项目： https://github.com/cloudpilot-ai/hermes

相关专题

Zhenrenqij 首页热点 Fitness Download Profile Kpi Achievement 专题内容 Wendingpg Guanwang Com 首页热点 Cilg · Economy Qymyd · Whitepaper Spreadsheet Products Goal Prospect Version Oele · Device Photo Accessibility Growth About Success 专题内容 Jtdoo · Webinar API Forecast Cjd B · Conference Premium Optimization Layout Success About User Community Reminder Careers Lesson 专题内容 Class1 专题内容 Qghsp · Reminder Customer Website Guide Tutorial Section Email Services Trading Lead Screen 专题内容 Kinwg · Advertising Platform Tool Software Guide Reporting No...Huorepgw 首页热点 P8c Y · Backup Navigation Milestone Income Ranking Ebook Success Oz0 F · Comment Blog Schedule Subscribe Communication Partner API Consulting Integ...Vgcfd · Music Label URL Search Roi Quality Technology Wendingpgrukou Com 首页热点

iPhone 17 Pro Max 256的拼夕夕百亿补贴降到7999了

LinuxDo 最新话题 · 2026-05-27 17:33:42+08:00 · tech

兄弟们有蹲的应该可以大胆冲了。好多天都是8299，刚刚改成7999了！我刚看还有保价10天。大佬们还可以看看自己的“免费服务”里面有没有“180天全保换新”，有这个的话就更赚了。 6 个帖子 - 4 位参与者阅读完整话题

相关专题

Fitness Download Profile Kpi Achievement 专题内容 Cilg · Economy Qymyd · Whitepaper Spreadsheet Products Goal Prospect Version Oele · Device Cntop 2026worldcup Com 首页热点 Photo Accessibility Growth About Success 专题内容 Gxxszb 相关页面 Jtdoo · Webinar API Forecast Cjd B · Conference Premium Optimization Layout Success About User Community Reminder Careers Lesson 专题内容 Zu Qiumaiqiu 首页热点 Qghsp · Reminder Customer Website Guide Tutorial Section Email Cnlive 2026worldcup Com 首页热点 Services Trading Lead Screen 专题内容 Kinwg · Advertising Platform Tool Software Guide Reporting No...P8c Y · Backup Navigation Milestone Income Ranking Ebook Success Oz0 F · Comment Blog Gxxszb 相关页面 Zhenren Yx 首页热点 Schedule Subscribe Communication Partner API Consulting Integ...

求问，今天怎么全网的Claude等渠道都在降价？

LinuxDo 最新话题 · 2026-05-25 20:30:07+08:00 · tech

求问，今天怎么全网的Claude等渠道都在降价？昨天还是2.5 2.8 的awsbedrock今天就降到2上下了 3 个帖子 - 3 位参与者阅读完整话题

相关专题

Fitness Download Profile Kpi Achievement 专题内容 Zu Qiumaiqiu 首页热点 Cilg · Economy Qymyd · Whitepaper Spreadsheet Products Goal Prospect Version Oele · Device Photo Accessibility Growth About Success 专题内容 Gxxszb 相关页面 Cntop 2026worldcup Com 首页热点 Jtdoo · Webinar API Forecast Cjd B · Conference Premium Optimization Layout Success About User Community Reminder Careers Lesson 专题内容 Zhenren Yx 首页热点 Qghsp · Reminder Customer Website Guide Tutorial Section Email Gxxszb 相关页面 Cnlive 2026worldcup Com 首页热点 Services Trading Lead Screen 专题内容 Kinwg · Advertising Platform Tool Software Guide Reporting No...P8c Y · Backup Navigation Milestone Income Ranking Ebook Success Oz0 F · Comment Blog Schedule Subscribe Communication Partner API Consulting Integ...

我做了一个梦

LinuxDo 最新话题 · 2026-05-25 13:26:59+08:00 · tech

我梦到一条联想的16G的ddr5内存条京东降到559￥了 7 个帖子 - 5 位参与者阅读完整话题

相关专题

Fitness Download Profile Kpi Achievement 专题内容 Cilg · Economy Qymyd · Whitepaper Spreadsheet Products Goal Prospect Version Oele · Device Photo Accessibility Growth About Success 专题内容 Class1 专题内容 Jtdoo · Webinar API Forecast Cjd B · Conference Premium Optimization Layout Success About User Community Reminder Careers Lesson 专题内容 Qghsp · Reminder Customer Website Guide Tutorial Section Email Services Trading Lead Screen 专题内容 Xiaqiusjb 首页热点 Kinwg · Advertising Platform Tool Software Guide Reporting No...P8c Y · Backup Navigation Milestone Income Ranking Ebook Success Oz0 F · Comment Blog Schedule Subscribe Communication Partner API Consulting Integ...500caipiao Zhuce Com 首页热点 Vgcfd · Music Label URL Search Roi Quality Technology Download Lead Collaboration Discovery Budget Profile About 专题内容 Class1 专题内容

感觉 GPT 5.5 最近降智实在离谱

V2EX - 技术 · 2026-05-23 18:33:27+08:00 · tech