Qwen3.x and LLAMA.CPP – How To Extend Context Window Past 260k

Normally Qwen3.x (3.5 and 3.6) models have a limit of about 260k context. There are many scenarios where it would be advantageous to increase this to around 300 or 400k. One primary use case is having the model ingest a ton of files before working on a problem (usually source code documents). Here are the …

Qwen3.5 27B Q8 – KV Cache Benchmarks BF16 vs F16 vs Q8_0

If you're curious about how much KV Cache quantization affects Qwen3.5 27B, take a look at the table below. The model used in all of these benchmarks is Unsloth's Q8_K_XL. KV Cache BF 16 vs F16 vs Q8_0 KV Cache TypeMean PPL(Q)ΔPPL (Q - base)PPL Ratioln RatioMean KLDMax KLDRMS Δp (%)Same Top-p (%)BF166.8653 ± 0.04470———————F166.866214 …

Rose Online Asset File Format Technical Specification Part 2

This is part 2 of the Rose Online Asset File Format Technical Specification document. Part 1 can be found here. In part 1 we stopped right before getting into the IFO file format. 3.6 IFO Zone Object Files (.ifo) Object Type Enumeration: enum BlockType { DeprecatedMapInfo = 0, DecoObject = 1, Npc = 2, CnstObject …

Rose Online Asset File Format Technical Specification Part 1

This document will serve as a technical reference for the very old MMORPG Rose Online. The goal is that this information is never lost and may help decades later. This is part 1 of 2. The document got very large so I had to split it up for loading purposes. Part 2 can be found …

Accurate BGP Finite State Machine (FSM) Diagram

I made the following diagram for the BGP FSM because the ones shown by Cisco in their Cisco Press books don't accurately capture the RFC's rules. I've marked the one-way transitions as red arrows, while the two-way transitions are blue. I've also added a noteworthy item that shows you can go directly from the Active …