Inference Performance – NVIDIA Technical Blog http://www.open-lab.net/ko-kr/blog Mon, 07 Jul 2025 12:13:12 +0000 ko-KR hourly 1 NVIDIA Jetson? RTX?? Google DeepMind? Gemma 3n ???? http://www.open-lab.net/ko-kr/blog/run-google-deepminds-gemma-3n-on-nvidia-jetson-and-rtx/ Fri, 04 Jul 2025 11:46:07 +0000 http://www.open-lab.net/ko-kr/blog/?p=3917 Reading Time: 3 minutes ?? NVIDIA? RTX ? Jetson ????? Gemma 3n? ?? ?????. ??? Google I/O?? Google DeepMind? ??? Gemma?, ???? ????? ??? ???? ? ?? ?? ??? ???? ????. ?? Gemma?? ?? 3.5 ???? ???? ???? ?? ??? ?? ??? ?? ??? ??? ???????. ????? Universal Speech Model, ??? MobileNet v4, ???? MatFormer ? ???? ? ??? … Continued]]> Reading Time: 3 minutes ?? NVIDIA? RTX ? Jetson ????? Gemma 3n? ?? ?????. ??? Google I/O?? Google DeepMind? ??? Gemma?, ???? ????? ??? ???? ? ?? ?? ??? ???? ????. ?? Gemma?? ?? 3.5 ???? ???? ???? ?? ??? ?? ??? ?? ??? ??? ???????. ????? Universal Speech Model, ??? MobileNet v4, ???? MatFormer ? ???? ? ??? ???? ?? ?? ??? ????? ???? ????. ?? ??? ?? ??? Per-Lay Embeddings?? ???? ??? ?????.

Source

]]>
3917
NVIDIA Blackwell ??? DeepSeek-R1 ?? ???? ?? ?? ?? http://www.open-lab.net/ko-kr/blog/nvidia-blackwell-delivers-world-record-deepseek-r1-inference-performance/ Wed, 28 May 2025 06:22:07 +0000 http://www.open-lab.net/ko-kr/blog/?p=3813 Reading Time: 9 minutes NVIDIA? NVIDIA GTC 2025?? DeepSeek-R1 ?? ?? ?? ??? ??????. 8?? NVIDIA Blackwell GPU? ??? ?? NVIDIA DGX ???? ??? ??? 6,710? ? ???? DeepSeek-R1 ???? ???? ?? 250?? ?? ?? ?? ?? 30,000?? ??? ???? ??? ? ????. ??? ?? ???? ????? ??? ??? NVIDIA Blackwell ????? ???? NVIDIA? ??? ?? ??? ?? ??? … Continued]]> Reading Time: 9 minutes NVIDIA? NVIDIA GTC 2025?? DeepSeek-R1 ?? ?? ?? ??? ??????. 8?? NVIDIA Blackwell GPU? ??? ?? NVIDIA DGX ???? ??? ??? 6,710? ? ???? DeepSeek-R1 ???? ???? ?? 250?? ?? ?? ?? ?? 30,000?? ??? ???? ??? ? ????. ??? ?? ???? ????? ??? ??? NVIDIA Blackwell ????? ???? NVIDIA? ??? ?? ??? ?? ??? ?? ??? ??????. ??? ?? ??? NVIDIA ???? ?? NVIDIA Blackwell Ultra GPU? NVIDIA Blackwell…

Source

]]>
3813
Blackwell, Meta? Llama 4 Maverick? ??? ???? 1,000 TPS ?? ?? http://www.open-lab.net/ko-kr/blog/blackwell-breaks-the-1000-tps-user-barrier-with-metas-llama-4-maverick/ Wed, 28 May 2025 04:05:13 +0000 http://www.open-lab.net/ko-kr/blog/?p=3821 Reading Time: 6 minutes NVIDIA? ?? ?? ??? ?? ?? ??(LLM) ?? ??? ??????. NVIDIA Blackwell GPU 8?? ??? ?? NVIDIA DGX B200 ??? Llama 4 ??? ? ?? ?? ??? ??? 4?? ???? ??? Llama 4 Maverick ???? ???? ?? 1,000??(TPS)? ?? ??? ??? ? ????. ? ??? AI ???? ???? Artificial Analysis? ?? ????? ???????. ?? ????, … Continued]]> Reading Time: 6 minutes NVIDIA? ?? ?? ??? ?? ?? ??(LLM) ?? ??? ??????. NVIDIA Blackwell GPU 8?? ??? ?? NVIDIA DGX B200 ??? Llama 4 ??? ? ?? ?? ??? ??? 4?? ???? ??? Llama 4 Maverick ???? ???? ?? 1,000??(TPS)? ?? ??? ??? ? ????. ? ??? AI ???? ???? Artificial Analysis? ?? ????? ???????. ?? ????, NVIDIA Blackwell? Llama 4? ?? ?? ?? ?????? ??? ????? ?? ?????. ???? ?????, ?? ??? ????? ????…

Source

]]>
3821
NVIDIA Dynamo, ??? ?? ?? ??? ?? llm-d ???? ????? ??? http://www.open-lab.net/ko-kr/blog/nvidia-dynamo-accelerates-llm-d-community-initiatives-for-advancing-large-scale-distributed-inference/ Wed, 21 May 2025 02:52:29 +0000 http://www.open-lab.net/ko-kr/blog/?p=3808 Reading Time: 3 minutes 2025? Red Hat Summit?? ??? llm-d ????? ???? ????? ??? AI ?? ??? ???? ??? ?????.llm-d? vLLM? Inference Gateway ?? ?????, Kubernetes ?? ????? ?? ??? ?? ??? ?? vLLM? ??? ?????. ? ???? llm-d ????? ???? ?? NVIDIA Dynamo ?? ??? ?????. ?? ?? ??? ?? ??? ?? ??? ??, ?????, ??? ?? ??? … Continued]]> Reading Time: 3 minutes 2025? Red Hat Summit?? ??? llm-d ????? ???? ????? ??? AI ?? ??? ???? ??? ?????.llm-d? vLLM? Inference Gateway ?? ?????, Kubernetes ?? ????? ?? ??? ?? ??? ?? vLLM? ??? ?????. ? ???? llm-d ????? ???? ?? NVIDIA Dynamo ?? ??? ?????. ??? ?? ??? ??, ?????, ??? ?? ??? ?? ?? ?? ??? ????, ?? ?? ? ?? ?? ?? ??? ???? ??? ?????. ??, ??? ??? ????? prefill? decode ??? GPU…

Source

]]>
3808
NVIDIA ??? ???? ?? AI ?? ?? ??? http://www.open-lab.net/ko-kr/blog/optimize-ai-inference-performance-with-nvidia-full-stack-solutions/ Thu, 15 May 2025 03:31:05 +0000 http://www.open-lab.net/ko-kr/blog/?p=3729 Reading Time: 5 minutes 2025? 3? 18??? NVIDIA Triton Inference Server? ?? NVIDIA Dynamo? ??? ???????. AI ?? ??????? ???? ???, ??? ?? ??? ?? ??? ? ?? ???? ??? ??? ???? ?? ???? AI ??? ???? ?? ?? ??? ?? ????. NVIDIA? ?, ???, ?????? ??? ??? ??? ?? ????? AI ???? ????? ???? ??? ??? ??? ? … Continued]]> Reading Time: 5 minutes 2025? 3? 18??? NVIDIA Triton Inference Server? ?? NVIDIA Dynamo? ??? ???????. AI ?? ??????? ???? ???, ??? ?? ??? ?? ??? ? ?? ???? ??? ??? ???? ?? ???? AI ??? ???? ?? ?? ??? ?? ????. NVIDIA? ?, ???, ?????? ??? ??? ??? ?? ????? AI ???? ????? ???? ??? ??? ??? ? ??? ???? ????, ??? AI ??? ? ?? ??? ? ???, ?????, ?? ???????. ???? ?? ?? ??? ?? ??? ??? ?????.

Source

]]>
3729
NVSwitch? TensorRT-LLM ????? 3? ?? AllReduce ?? http://www.open-lab.net/ko-kr/blog/3x-faster-allreduce-with-nvswitch-and-tensorrt-llm-multishot/ http://www.open-lab.net/ko-kr/blog/3x-faster-allreduce-with-nvswitch-and-tensorrt-llm-multishot/#respond Fri, 15 Nov 2024 05:54:47 +0000 http://www.open-lab.net/ko-kr/blog/?p=3278 Reading Time: 3 minutes ??? ?? ?? ??? ??? ??? ???? ? ??? ?? ??? ??? ?? ???? ???? ??? AI ????? ???? ?? ?? ???? ????. ?? ???? ?? ?? ???? ????? GPU ??? ??? ??? ???? ?? GPU ??? ??????. ???? ??? ??? ?? NVIDIA NVLink Switch? ??? ?? ??? ?? 3??? ??? ??? ?? GPU ?? … Continued]]> Reading Time: 3 minutes ??? ?? ?? ??? ??? ??? ???? ? ??? ?? ??? ??? ?? ???? ???? ??? AI ????? ???? ?? ?? ???? ????. ?? ???? ?? ?? ???? ????? GPU ??? ??? ??? ???? ?? GPU ??? ??????. ???? ??? ??? ?? NVIDIA NVLink Switch? ??? ?? ??? ?? 3??? ??? ??? ?? GPU ?? ????, TensorRT-LLM ???? ?????. ? ?????? ? ??? ?? ?? GPU ??? ??? ??? ????? ??? ?????. ?? ??? ?? ??? ???? ?? GPU? ??? ??? ???? ?? GPU…

Source

]]>
http://www.open-lab.net/ko-kr/blog/3x-faster-allreduce-with-nvswitch-and-tensorrt-llm-multishot/feed/ 0 3278
人人超碰97caoporen国产