QuillCore LLC — Minimum-Write Integer Sort

The Problem

Nobody Changed
the Sort

Modern write-heavy databases — built on LSM-tree storage engines — perform continuous compaction: merging and re-sorting data files to maintain read performance. Compaction is fundamentally a sort, and it dominates the write I/O budget of every high-throughput storage system.

The sort algorithms used today were designed decades ago to minimize comparisons. But flash storage doesn't wear out from comparisons — it wears from writes. Every physical write permanently consumes a NAND cell's finite endurance budget. As the industry transitions from TLC to QLC NAND — which offers 3–10× lower endurance per cell — the cost of unnecessary writes is compounding faster than ever.

90–95%

of write I/O in write-heavy LSM systems is attributed to compaction operations — a widely documented production characteristic

10–30×

write amplification commonly measured in production LSM deployments — each logical byte triggers many physical writes

// Write count comparison · same dataset · same result

Traditional Sort

O(n log n) writes

C4 Sort

n writes (minimum)

↓ Structural write reduction — not tuning, not policy

Conventional sort: Write count scales with n log n
Flash endurance: Finite P/E cycle budget per cell, shrinking with density
C4 Sort: Each element written to its final position exactly once

Licensing Opportunities

Your Competitors Will
License This. Will You?

C4 Sort is available for commercial licensing today. If your system runs a sort during compaction, segment merge, ingest flush, or garbage collection — you are paying an unnecessary write tax every single time. That stops the day you license C4 Sort.

Proven Write Reduction

18–23× fewer writes per sort vs comparison-based algorithms — and exactly n writes minimum, formally proven. Not a benchmark claim, a mathematical guarantee.

Drop-In Integration

Replace your sort kernel at the call site. No schema changes, no format migration, no rewrites to surrounding logic. Works within your existing architecture.

Immediate Impact

Write reduction begins with the first sort operation after integration. NVMe lifespan, compaction cost, and I/O overhead all improve from day one — no warm-up period.

Engineering Support Included

Every license includes full integration documentation and direct engineering access through your first production deployment. You are not integrating alone.

Technical Briefing Under NDA

Full algorithm walkthrough with the inventor — proof structure, integration architecture, and performance characteristics on your specific workload profile.

Flexible Royalty Structure

Royalty terms structured to your deployment model — per-unit, per-deployment, or flat platform license. Upcoming architecture ports (x86-64, RISC-V, GPU) can be included at signing.

Your Industry

Sort

5 domains

SEGMENT 01 OF 05

LSM Database Engines

RocksDB · Cassandra · DynamoDB · Bigtable · TiKV

Compaction is the dominant source of write I/O in every write-heavy LSM database — accounting for 90–95% of all physical writes. At its core, compaction is a sort. Replacing the sort kernel is a targeted, low-risk intervention with no schema or format changes required.

Largest near-term opportunity — every high-write database deployment worldwide is affected

Why Now

01 / 05

QLC NAND Adoption

Enterprise fleets are transitioning to higher-density QLC NAND, which carries significantly lower endurance per cell. Every LSM write system on QLC is increasingly exposed to early wear-out. Write efficiency becomes a survival requirement.

02 / 05

Data Volume Growth Outpacing Endurance

Global data creation is growing at ~23% annually. Flash endurance improvements have stalled. The gap between write workloads and hardware capacity to absorb them is widening every year — and the write amplification tax compounds with it.

03 / 05

Infrastructure Cost Pressure

Cloud storage prices have reversed their decade-long decline. FinOps is now a board-level concern at enterprises with $10M+ cloud spend. Engineering teams are actively seeking algorithmic efficiency — not just hardware upgrades.

04 / 05

RocksDB Ecosystem Standardization

RocksDB has become the de facto storage engine for write-optimized databases — DynamoDB, TiKV, CockroachDB, Yugabyte, MyRocks, and Kafka all depend on it. A single compaction kernel integration reaches the entire ecosystem simultaneously.

05 / 05

AI Infrastructure Expansion

The AI buildout of 2023–2025 created a new class of write-heavy workload: vector databases, model checkpoint storage, and LLM serving log pipelines all generate high-frequency writes to the same NVMe hardware — compounding the endurance problem across every tier.

Proven Properties

Four Properties.
Simultaneously.

C4 Sort is formally proven to achieve four properties that no prior published algorithm holds at the same time. Each is independently verifiable — not a benchmark claim, but a mathematical guarantee.

PROVEN

PROPERTY 01

Minimum Element Writes

Each element written to its final position exactly once

Elements already in the correct position are never touched. Every other element moves exactly once — directly to its destination. This is the theoretical write minimum for any in-place sort, previously only achieved at quadratic time cost. C4 Sort reaches it in linear time.

PROVEN

PROPERTY 02

Zero Element Comparisons

No element-to-element comparisons, on any input

The algorithm never compares two elements against each other. Output positions are determined arithmetically before any element moves. This sidesteps the classical comparison lower bound entirely and produces perfectly predictable behavior across all input distributions.

PROVEN

PROPERTY 03

Fully Parallel, Lock-Free

Scales linearly with available cores

Output destinations are pre-assigned to threads with zero overlap before any element moves. Parallel phases run with no locks, no atomics, no contention. Proven: no two threads ever write to the same memory location, regardless of input data.

PROVEN

PROPERTY 04

In-Place & Linear Time

Sorts in-place with minimal auxiliary memory

Operates directly on the input array without merge buffers or output copies — essential for memory-constrained environments. Time complexity is linear in element count. An adaptive variant handles sparse value domains with comparable efficiency at no additional element-write cost.

Prior art: To our knowledge, no previously published algorithm simultaneously achieves minimum writes, zero comparisons, full parallelism, and in-place linear-time execution. Each property exists in isolation across the literature. C4 Sort is the first to hold all four.

How It Works

Every Element.
One Write. One Move.

Watch C4 Sort in action. Each bar represents a value. In Phase 1, the data is surveyed. In Phase 2, every destination is pre-computed. In Phase 3, each element lands in its final position — exactly once, directly.

C4 SORT Click to run Writes: 0 / —

Each element moves to its final position exactly once — no revisits, no comparisons

Phase One

Survey

The algorithm reads the entire dataset once to build a complete picture of the input distribution. Nothing moves. No writes occur.

Read-only · zero writes

Phase Two

Plan

Every element's exact final destination is computed before anything moves. Thread assignments are resolved. No two elements will ever compete for the same slot.

Zero conflicts · pre-resolved

Phase Three

Place

Each element travels directly to its pre-determined destination in a single write. Multiple threads execute simultaneously. Every write is final — no element is touched twice.

One write per element · fully parallel

The Race

Six Algorithms.
One Winner.

The same shuffled dataset. Six of the fastest and most widely used sorting algorithms. Run head-to-head on identical input — and watch the write counters.

Demonstrates write-count behavior on uniform random integer input. All algorithms receive the same data.

Input Size N = 48

C4 Sort: — · Timsort: — · drag to preview

C4 Sort

QuillCore LLC · Proprietary

Ready

writes: —

aux: —

The Field

Timsort

Python · Java · Android

Ready

writes: —

aux: —

Quicksort

C · C++ · Rust std

Ready

writes: —

aux: —

Merge Sort

Stable · classic

Ready

writes: —

aux: —

Heapsort

In-place O(n log n)

Ready

writes: —

aux: —

Radix Sort

Integer-only LSD

Ready

writes: —

aux: —

Comparison Sorts — Writes Since Page Load

~5.2M unnecessary writes/sec · Global LSM compaction estimate

C4 Sort — Writes Since Page Load

~260K writes/sec · 20× fewer at the same global scale

0 writes saved so far

Applications

Five Infrastructure Domains

C4 Sort's write-minimization advantage applies wherever integers are sorted under write-sensitive conditions — which spans the core I/O path of modern infrastructure.

LSM Database Compaction

RocksDB · Cassandra · CockroachDB · TiKV

Compaction is the dominant source of write I/O in every write-heavy LSM database. At its core, compaction is a sort — and replacing the sort kernel is a targeted, low-risk intervention that doesn't touch data structures, schemas, or storage formats. Every high-write database deployment worldwide is affected by this bottleneck.

Drop-in integration path with no schema or format changes required

Full-Text Search & Indexing

Elasticsearch · Solr · OpenSearch · Lucene

Lucene-based search engines perform segment merges — sorted integer merges of posting lists — as their primary background I/O workload. The operation is structurally identical to LSM compaction, making a single integration point reachable across thousands of enterprise search deployments.

One integration point reaches thousands of enterprise deployments globally

Time-Series Platforms

InfluxDB · TimescaleDB · Prometheus · VictoriaMetrics

Time-series systems sort ingest buffers before every flush to storage. The workload is inherently write-heavy with near-zero updates — and it's one of the fastest-growing infrastructure segments, driven by IoT, observability pipelines, and AI telemetry at scale.

Market growing at 22% CAGR — fastest-expanding segment in the stack

Analytical Column Stores

ClickHouse · DuckDB · Snowflake · BigQuery

OLAP systems sort integer columns at ingest before writing to storage. Better sort quality enables better run-length compression — producing smaller files that reduce I/O cost on every subsequent query, compounding the benefit across the full lifetime of stored data.

Compression gains compound across every downstream query

NVMe Firmware & Storage

ZNS NVMe · Open-Channel · Computational Storage · BlueStore

Flash Translation Layer garbage collection sorts logical block addresses to coalesce valid pages. Host-managed storage stacks expose this sort as a direct integration point — and block addresses are dense integers, exactly the domain C4 Sort is built for.

Strategic long-term position — sort kernel embedded at the drive level

Development Status

Where We Are

C4 Sort is complete, proven, and available for commercial licensing today. The algorithm's core properties are formally established, the ARM64 implementation is benchmarked, and integration support is included as part of every licensing agreement.

Algorithm formally provenTermination, sortedness, write minimality, and parallel disjointness each independently verified.

Native ARM64 implementationProduction-grade implementation in AArch64 assembly with C parallel runtime. Benchmarked on Apple M-series.

Write-count benchmarksReduction confirmed across multiple distributions. Honest exception: reversed input, where both algorithms achieve n writes.

Performance characterizationWall-time profiling underway across multiple configurations. Write-count advantage is distribution-independent; wall-time scaling depends on parallelism and hardware.

Additional architecture portsCurrent implementation targets one native architecture. Ports to additional platforms are the next development priority.

Production deploymentIntegration paths for leading database and search engines are designed and documented. Included as part of the commercial licensing agreement.

Licensing Tracks

AVAILABLE

Database engine licenseIntegrate C4 Sort as your compaction or merge sort kernel. Includes integration documentation, technical support, and a royalty structure based on deployment scale.

AVAILABLE

Storage firmware licenseEmbed C4 Sort at the FTL or block address sorting layer of your NVMe, ZNS, or computational storage product. Per-unit royalty structure available.

AVAILABLE

Platform / OEM licenseLicense C4 Sort for distribution within your platform, cloud product, or managed service. Covers all downstream customer deployments under a single agreement.

ROADMAP

x86-64 & RISC-V portsNative implementations for additional architectures are in active development. License agreements can be structured to include upcoming ports at signing.

ROADMAP

GPU implementationCUDA/ROCm implementation for analytical database offload and parallel sort acceleration. Available for early licensing discussions with analytical infrastructure teams.

Licensing

Three Ways to
License C4 Sort

C4 Sort is proprietary technology available under commercial license. All license discussions begin with an NDA and a technical briefing. Integration support is included.

🗄️

Software License

License C4 Sort for integration into your database engine, search platform, or time-series system. Use it as a drop-in compaction or merge sort kernel. Royalty structure scales with deployment size.

🔩

Firmware / OEM License

Embed C4 Sort in your NVMe controller, storage firmware, or computational storage product. Per-unit royalty structure available. Reduces write I/O at the hardware layer before software touches it.

🏢

Platform License

License C4 Sort for distribution within your cloud platform, managed database service, or infrastructure product. A single agreement covers all downstream customer deployments at scale.

Your Sort Is Wearing
Out Your Hardware

Nobody Changed
the Sort

Your Competitors Will
License This. Will You?

QLC NAND Adoption

Data Volume Growth Outpacing Endurance

Infrastructure Cost Pressure

RocksDB Ecosystem Standardization

AI Infrastructure Expansion

Four Properties.
Simultaneously.

Every Element.
One Write. One Move.

Six Algorithms.
One Winner.

What This Means

Five Infrastructure Domains

Where We Are

Three Ways to
License C4 Sort

Start a Licensing Conversation.

Your Sort Is WearingOut Your Hardware

Nobody Changedthe Sort

Your Competitors WillLicense This. Will You?

QLC NAND Adoption

Data Volume Growth Outpacing Endurance

Infrastructure Cost Pressure

RocksDB Ecosystem Standardization

AI Infrastructure Expansion

Four Properties.Simultaneously.

Every Element.One Write. One Move.

Six Algorithms.One Winner.

What This Means

Five Infrastructure Domains

Where We Are

Three Ways toLicense C4 Sort

Start a Licensing Conversation.

Your Sort Is Wearing
Out Your Hardware

Nobody Changed
the Sort

Your Competitors Will
License This. Will You?

Four Properties.
Simultaneously.

Every Element.
One Write. One Move.

Six Algorithms.
One Winner.

Three Ways to
License C4 Sort