Blog
Posts, notes, and articles.

The Cache Coherence Protocol Of Modern Cpus: Mesi, Moesi, And Mesif With Snooping Vs. Directory Based
2020-11-02A comprehensive technical exploration of the cache coherence protocol of modern cpus: mesi, moesi, and mesif with snooping vs. directory based, covering key concepts, practical implementations, and real-world applications.

Garbage Collection Advanced: Incremental, Concurrent, Snapshot-at-the-Beginning, and Pauseless Collectors from Azul C4 to ZGC Colored Pointers
2020-10-31A deep exploration of advanced garbage collection algorithms that eliminate stop-the-world pauses — incremental marking, concurrent collection, the snapshot-at-the-beginning barrier, and the pauseless collectors that make Java viable for low-latency applications.

Building A Custom Risc V Processor In Verilog: Pipelining, Hazard Detection, And Branch Prediction
2020-10-28A comprehensive technical exploration of building a custom risc v processor in verilog: pipelining, hazard detection, and branch prediction, covering key concepts, practical implementations, and real-world applications.

JIT Compilation: Tracing vs Method JITs, V8's Ignition+TurboFan Pipeline, HotSpot's C1/C2 Tiered Compilation, and Deoptimization
2020-10-28A deep exploration of just-in-time compilation — how V8 and HotSpot turn JavaScript and Java bytecode into native code through multi-tier compilation pipelines, and the art of deoptimization that makes speculative optimization safe.

Wasm Runtime Internals: V8's Liftoff and TurboFan, Wasmtime's Cranelift, Linear Memory Sandboxing, and the Stack Machine Model
2020-10-15A deep exploration of WebAssembly runtime internals — how V8 and Wasmtime compile and execute Wasm bytecode, the linear memory sandbox that enables secure execution, and the stack machine model at Wasm's core.

A Deep Dive Into The X86 64 Instruction Set: Microbenchmarking Instruction Latency And Throughput
2020-10-04A comprehensive technical exploration of a deep dive into the x86 64 instruction set: microbenchmarking instruction latency and throughput, covering key concepts, practical implementations, and real-world applications.

Implementing A Simd Accelerated Suffix Array Construction For Large Genomic Datasets
2020-10-01A comprehensive technical exploration of implementing a simd accelerated suffix array construction for large genomic datasets, covering key concepts, practical implementations, and real-world applications.

Container Internals: Linux Namespaces, cgroups v2, OverlayFS, and the OCI Runtime Spec Under the Hood
2020-09-28A deep exploration of the Linux kernel primitives that power container runtimes — the seven namespace types, cgroups v2 resource control, OverlayFS copy-on-write storage, and the OCI runtime specification that ties them together.

The Performance Of Bit Packed Data Structures Vs. Classic Arrays In High Performance Computing
2020-09-27A comprehensive technical exploration of the performance of bit packed data structures vs. classic arrays in high performance computing, covering key concepts, practical implementations, and real-world applications.

Designing A Linear Time Algorithm For Constructing A Prefix Automaton For Multiple Patterns: Aho Corasick
2020-09-25A comprehensive technical exploration of designing a linear time algorithm for constructing a prefix automaton for multiple patterns: aho corasick, covering key concepts, practical implementations, and real-world applications.

Compiler Optimizations: From Source Code to Fast Machine Code
2020-09-23A deep dive into how modern compilers transform your code into efficient machine code. Explore optimization passes from constant folding to loop vectorization, and learn how to write code that compilers can optimize effectively.

An Analysis Of Register Allocation Via Graph Coloring: Chaitin Briggs Algorithm With Live Range Splitting
2020-09-17A comprehensive technical exploration of an analysis of register allocation via graph coloring: chaitin briggs algorithm with live range splitting, covering key concepts, practical implementations, and real-world applications.

Building A Just In Time Compiler For Regular Expressions: Thompson’S Construction And Dfa Minimization
2020-09-17A comprehensive technical exploration of building a just in time compiler for regular expressions: thompson’s construction and dfa minimization, covering key concepts, practical implementations, and real-world applications.

The Implementation Of A Garbage Collector For A Functional Language: Cheney’S Algorithm And Generational Copying
2020-09-16A comprehensive technical exploration of the implementation of a garbage collector for a functional language: cheney’s algorithm and generational copying, covering key concepts, practical implementations, and real-world applications.

Designing A Type System For Dependently Typed Programming: Agda’S Universe Hierarchy And Pattern Matching
2020-09-03A comprehensive technical exploration of designing a type system for dependently typed programming: agda’s universe hierarchy and pattern matching, covering key concepts, practical implementations, and real-world applications.

A Proof Of Correctness For The Lamport’S Bakery Algorithm In Distributed Shared Memory Systems
2020-08-28A comprehensive technical exploration of a proof of correctness for the lamport’s bakery algorithm in distributed shared memory systems, covering key concepts, practical implementations, and real-world applications.

Merkle Trees and Content‑Addressable Storage
2020-08-17From Git to distributed object stores: how Merkle DAGs enable integrity, deduplication, and efficient sync.

The Formal Verification Of A Concurrent Work Queue Using The Tla+ Model Checker: Invariants And Liveness
2020-08-10A comprehensive technical exploration of the formal verification of a concurrent work queue using the tla+ model checker: invariants and liveness, covering key concepts, practical implementations, and real-world applications.

Implementing A Small Theorem Prover For Propositional Logic Using The Dpll Algorithm With Conflict Driven Clause Learning
2020-08-09A comprehensive technical exploration of implementing a small theorem prover for propositional logic using the dpll algorithm with conflict driven clause learning, covering key concepts, practical implementations, and real-world applications.

The Algorithmic Methodology Of Coq Proof Assistant: Inductive Types, Tactics, And Extraction To Ocaml
2020-08-08A comprehensive technical exploration of the algorithmic methodology of coq proof assistant: inductive types, tactics, and extraction to ocaml, covering key concepts, practical implementations, and real-world applications.

Building A Distributed Certificate Transparency Log Using Merkle Trees And Append Only Proofs
2020-07-31A comprehensive technical exploration of building a distributed certificate transparency log using merkle trees and append only proofs, covering key concepts, practical implementations, and real-world applications.

Hypervisor Internals: VT-x, AMD-V/SVM, Nested Paging, and the Mechanics of Trap-and-Emulate
2020-07-25A deep exploration of hardware virtualization support — how Intel VT-x and AMD-V enable efficient hypervisors through VM control structures, nested paging, and the clever elimination of slow trap-and-emulate paths.

A Deep Dive Into Merkle Damgård Vs. Sponge Construction For Hash Functions: Sha 256 And Sha 3
2020-07-24A comprehensive technical exploration of a deep dive into merkle damgård vs. sponge construction for hash functions: sha 256 and sha 3, covering key concepts, practical implementations, and real-world applications.

The Implementation Of A Secure Channel Protocol Using Noise Framework: Handshake Patterns And State Machine
2020-07-11A comprehensive technical exploration of the implementation of a secure channel protocol using noise framework: handshake patterns and state machine, covering key concepts, practical implementations, and real-world applications.