Ant64 Sound Architecture
The Ant64 audio system is not designed merely to play sound — it is designed to invite musical creation.
The Lineage
Historically, several home computers unexpectedly became instruments:
| Machine | Hardware | Legacy |
|---|---|---|
| Commodore 64 | SID chip | Spawned a generation of musicians |
| Atari ST | Built-in MIDI + deterministic timing | Became the heart of countless studios |
| Amiga | Sample-based audio + trackers | Created a new culture of electronic composition |
In each case, the designers did not explicitly set out to build a "music workstation". Instead, they created systems where sound was immediate, programmable, constrained, and deeply tied to the machine itself.
Ant64 intentionally follows this lineage — not by emulation, but by evolution.
Sound on Ant64 is...
Hardware-first. Deterministic. Memory-mapped. Deeply programmable. Capable of misuse and experimentation.
These qualities are essential for a system to become a creative platform rather than a closed appliance.
Architectural Overview
Audio generation on Ant64 is primarily handled by the FireStorm FPGA subsystem, with coordination and control provided by:
- Pulse Sequencer (RP2350) – MIDI, audio input, live control, sequencing
- SG2000 RISC-V – composition, analysis, AI-assisted workflows
- Copper & Update Units – real-time, timeline-driven audio manipulation
The FPGA audio engine is not a fixed "sound chip" — it is a reconfigurable, multi-paradigm audio fabric.
Voice Architecture
Polyphony
The baseline Ant64 configuration supports 128 simultaneous hardware-mixed voices, with headroom for more depending on sample rate, effects usage, and synthesis mode.
Voices are not uniform; each voice can independently operate in one of several modes:
| Mode | Description |
|---|---|
| Sample playback | High-fidelity sample replay with pitch control |
| FM synthesis | Operator-based frequency modulation |
| Hybrid FM + sample | Sample waveforms as FM operators |
| Granular synthesis | Grain-based time and pitch manipulation |
| Procedural waveform | Algorithmically generated waveforms |
| Phase-locked playback | Time-stretched, phase-aligned replay |
Voices are mixed in hardware, with per-voice spatialisation and effects routing.
Voice Components
Each voice may include:
- Amplitude Envelope — ADSR or multi-segment
- Pitch Envelope
- Per-voice filter — LP / HP / BP, resonant
- Per-voice LFOs — multiple, routable
- Stereo positioning
- Velocity and aftertouch modulation
- Copper-controlled parameter overrides
This mirrors — and in many cases exceeds — the per-voice control found in modern synthesizers.
Synthesis Modes
Sample-Based Synthesis
DDR3 / Flash → Sample Engine → Pitch / Rate Control → Mixer
↓
Loop / Crossfade / Granular Slice
- High-quality samples stored in DDR3 or flash
- Variable playback rate
- Independent pitch and duration control
- Looping, cross-fades, and granular slicing
This enables Fairlight-style instruments, MOD/XM-style trackers, and modern sampler workflows.
FM Synthesis (Enhanced)
Ant64 supports FM synthesis with sample-based operators — not limited to sine waves.
Operator A (sine / sample / procedural)
| modulates
Operator B (sine / sample / procedural)
| modulates
Operator C --> Output
- Configurable operator graphs
- Feedback paths
- Envelope and LFO modulation per operator
This allows:
- Classic DX-style FM
- Acid-style basslines (303-like)
- Hybrid FM/sample textures impossible on vintage hardware
Granular Synthesis
Granular synthesis treats sound as a stream of tiny grains — each independently controllable.
Implemented using the existing audio update units:
- Samples are divided into grains
- Grain position, duration, pitch, and density are hardware-controlled
- Real-time modulation via Copper or Pulse
Enables:
- Time-stretching without pitch change
- Textural soundscapes
- Evolving pads and ambiences
Phase Vocoder / Time-Domain Processing
Ant64 supports pitch-independent time manipulation using hardware-assisted resampling and phase-aligned processing:
| Operation | Result |
|---|---|
| Change pitch | Length unchanged |
| Change length | Pitch unchanged |
| Spectral transform | Both pitch and length modified |
While full FFT-based vocoding is handled in software (SG2000 + TPU), the FPGA provides the real-time primitives required for live playback.
Effects Architecture
Available Effects
| Effect | Scope | Notes |
|---|---|---|
| Reverb | Per-voice / Global | Multi-tap delay network |
| Delay / Echo | Per-voice / Group | Tempo-syncable |
| Chorus / Flanger | Per-voice / Group | BBD-style implementation |
| Distortion / Saturation | Per-voice | Soft and hard clipping modes |
| Filters & EQ | Per-voice / Global | Resonant LP/HP/BP |
Routing is flexible and fully programmable.
Reverb
Implemented using multi-tap delay lines, feedback networks, and damping filters:
- Room, hall, and plate-style reverbs
- Low-latency — suitable for live play
- Minimal FPGA resource usage
Reverb parameters can be Copper-controlled, LFO-modulated, or MIDI-automated.
Copper-Controlled Audio
One of Ant64's most distinctive features.
Just as the Copper can alter graphics registers per scanline, it can manipulate audio at sample-accurate precision:
- Change voice parameters at precise times
- Modulate filters rhythmically
- Trigger events sample-accurately
- Synchronize sound tightly with visuals
This enables effects that are extremely precise, impossible in software alone, and characteristic of the best classic hardware systems.
Pulse Sequencer Integration
The Pulse Sequencer (RP2350) acts as the real-time musical interface:
Connectivity
- USB MIDI
- DIN / DOUT / DTHRU MIDI (Creative Edition)
- Analog control pots
- Mouse and external controllers
- Live audio input
Responsibilities
- MIDI parsing
- Sequencing & clocking
- Control data routing to FireStorm and SG2000
It communicates via SPI, presenting audio control as memory-mapped registers.
Analysis & AI-Assisted Features
Audio analysis is handled by the SG2000 CPU and its on-chip TPU ("AI chip"):
Audio In --> FPGA Capture --> SG2000 + TPU
|
Note detection / Transcription
Pattern recognition
Intelligent sequencing
The FPGA does not perform heavy analysis — it focuses on deterministic sound generation.
Comparison to Modern Synthesizers
| Feature | Modern Synth | Ant64 |
|---|---|---|
| Polyphony | 8–128 voices | 128+ voices |
| Plugin latency | Often present | None — hardware only |
| Timing | Variable | Deterministic |
| Programmability | Limited | Deep — full hardware access |
| Price | High | Significantly lower |
| Also a computer? | No | Yes |
| Graphics workstation? | No | Yes |
| Open hardware platform? | No | Yes |
Creative Intent
Ant64 is designed so that unexpected uses of sound are not only possible, but encouraged.
Just as the C64, Atari ST, and Amiga were adopted by musicians for reasons their designers did not fully anticipate, Ant64 aims to create the same conditions:
- Powerful but understandable hardware — you can reason about what it does
- Constraints that inspire creativity — limits breed innovation
- Direct access to sound generation — no abstraction layers between you and the hardware
- Tight coupling between audio, visuals, and interaction — sound is part of the machine, not an addon
The goal is not merely to reproduce classic sound — but to create the environment in which the next musical subculture can emerge.