Skip to content

Commit f017cf8

Browse files
committed
vllm korea meetup
Signed-off-by: rebel-jiwonk <[email protected]>
1 parent ab0a471 commit f017cf8

File tree

1 file changed

+0
-1
lines changed

1 file changed

+0
-1
lines changed

_posts/2025-09-10-vllm-meetup.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -53,7 +53,6 @@ He also explained the need for hardware-specific compilation, noting how memory
5353
</picture><br>
5454
</p>
5555

56-
5756
Hong-Seok Kim, Chief Software Architect at Rebellions, spoke about the growing importance of vLLM for AI accelerator startups and shared how Rebellions is contributing to the broader AI inference serving ecosystem. He highlighted how vLLM’s hardware plugin system enables companies like Rebellions to support developers in deploying LLMs on custom hardware — delivering a near-seamless experience comparable to running on GPUs.
5857

5958
Thanks to vLLM, engineers can now run MoE (Mixture of Experts) models directly on Rebellions’ NPU, while also leveraging core optimizations like parallelism and continuous batching — all without complex integration steps. This opens the door to efficient, scalable AI serving on next-generation accelerators.

0 commit comments

Comments
 (0)