Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
NVIDIA DGX Spark In-Depth Review: A New Standard for Local AI Inference (lmsys.org)
115 points by yvbbrjdr 18 days ago | past | 93 comments
Deploying DeepSeek on 96 H100 GPUs (lmsys.org)
285 points by GabrielBianconi 63 days ago | past | 80 comments
Deploying DeepSeek on GB200 NVL72 with PD and Large Scale EP: 2.7x Throughput (lmsys.org)
1 point by gmays 4 months ago | past
Match DeepSeek's inference system performance with SGLang (lmsys.org)
1 point by echaozh 5 months ago | past
Does style matter? Disentangling style and substance in Chatbot Arena (lmsys.org)
2 points by ZeljkoS 8 months ago | past
Faster JSON Decoding for LLMs (lmsys.org)
1 point by gaocegege 10 months ago | past
Does style matter? Disentangling style and substance in Chatbot Arena (lmsys.org)
1 point by scottfr on Aug 29, 2024 | past
LLM Lookahead Decoding (lmsys.org)
2 points by mr-ai on Aug 20, 2024 | past
From Live Data to High-Quality Benchmarks: The Arena-Hard Pipeline – Lmsys Org (lmsys.org)
2 points by swyx on Aug 3, 2024 | past
Faster Open-Source Llama3 Serving with SGLang Runtime (vs. TensorRT-LLM, VLLM) (lmsys.org)
4 points by yvbbrjdr on July 25, 2024 | past
RouteLLM: An Open-Source Framework for Cost-Effective LLM Routing (lmsys.org)
4 points by adr1an on July 5, 2024 | past
RouteLLM: An Open-Source Framework for Cost-Effective LLM Routing (lmsys.org)
4 points by not-chatgpt on July 1, 2024 | past | 1 comment
Introducing Hard Prompts Category in Chatbot Arena (lmsys.org)
1 point by JumpCrisscross on June 21, 2024 | past
Introducing Hard Prompts Category in Chatbot Arena (lmsys.org)
1 point by CharlesW on May 20, 2024 | past
Hard Prompts Category in Chatbot Arena (lmsys.org)
1 point by imjonse on May 17, 2024 | past
Whats up with Llama-3? LMSYS leaderboard analysis (lmsys.org)
2 points by aadillpickle on May 15, 2024 | past
What's Up with Llama 3? (lmsys.org)
4 points by tosh on May 9, 2024 | past
Gpt2-Chatbot Removed from Lmsys (lmsys.org)
39 points by synthwave on April 30, 2024 | past | 11 comments
Lmsys Chatbot Arena: Benchmarking LLMs in the Wild (lmsys.org)
2 points by EvgeniyZh on April 10, 2024 | past
Fast JSON Decoding for Local LLMs with Compressed Finite State Machine (lmsys.org)
1 point by yeesian on March 7, 2024 | past
Mistral AI launches Mixtral-Next (lmsys.org)
204 points by varunvummadi on Feb 17, 2024 | past | 49 comments
Mistral releases their latest prototype model, Next, to Chatbot Arena (lmsys.org)
2 points by vagabund on Feb 16, 2024 | past
Fastest JSON Decoding for Local LLMs with Compressed Finite State Machine (lmsys.org)
2 points by MMMercy2 on Feb 5, 2024 | past
5x LLM Throughput with SGLang and RadixAttention (lmsys.org)
2 points by DreamGen on Jan 19, 2024 | past
Fast and Expressive LLM Inference with RadixAttention and SGLang (lmsys.org)
11 points by MMMercy2 on Jan 17, 2024 | past
Chatbot Arena (lmsys.org)
1 point by pallas_athena on Dec 14, 2023 | past
Chatbot Arena: New models and Elo system update (lmsys.org)
3 points by joak on Dec 14, 2023 | past
Chatbot Arena: New models and Elo system update (lmsys.org)
1 point by tosh on Dec 8, 2023 | past
Chatbot Arena: New models and Elo system update (lmsys.org)
1 point by EvgeniyZh on Dec 7, 2023 | past
Talk anonymously to ChatGPT, Claude, Llama and vote for the better model (lmsys.org)
3 points by vincent_s on Dec 1, 2023 | past | 1 comment

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: