BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Allsikt//Article Deadline//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
BEGIN:VEVENT
UID:f7c9644b87781dfb58e37ed31d78ae7bfc83c7ed@allsikt.tech
DTSTAMP:20260604T010804Z
DTSTART;VALUE=DATE:20260604
DTEND;VALUE=DATE:20260605
SUMMARY:Benchmarking MTP on vLLM and llama.cpp for Gemma 4 and Qwen 3.6
DESCRIPTION:Benchmark MTP on vLLM and llama.cpp to find the optimal speculative token count per model and measure speedups.\n\nSource: Reddit r/LocalLLaMA\nOpen: https://allsikt.se/article/benchmarking-mtp-on-vllm-and-llama-cpp-for-gemma-4-and-qwen-3-6-2642b5b5
URL:https://allsikt.se/article/benchmarking-mtp-on-vllm-and-llama-cpp-for-gemma-4-and-qwen-3-6-2642b5b5
STATUS:CONFIRMED
TRANSP:TRANSPARENT
END:VEVENT
END:VCALENDAR
