Kord Development Guide

Development Setup

Testing

cargo make test

Platform-Specific Setup: libtorch on Windows

If you need GPU training support with the tch backend on Windows, you'll need to install libtorch manually:

Download libtorch from https://pytorch.org/get-started/locally/
Follow the setup instructions at https://github.com/LaurentMazare/tch-rs?tab=readme-ov-file#libtorch-manual-install
Extract to a directory (e.g., C:\libtorch)
Set the environment variable:

$Env:LIBTORCH = "C:\libtorch"

ML Training Configuration

The ML pipeline exposes toggles that change both the training inputs and labels without modifying source code. Loader features and the deterministic guess flag can be combined, while target encodings are mutually exclusive so that model layouts stay predictable.

Training Precision Features

The project supports multiple training precision levels. Choose exactly one:

Feature	Description	Notes
`ml_train_precision_fp32`	Full 32-bit floating point training	Default; required for NdArray backend and HPT
`ml_train_precision_fp16`	Half precision (16-bit) training with dynamic loss scaling	Requires compatible backend (e.g., tch with CUDA)
`ml_train_precision_bf16`	BFloat16 training with dynamic loss scaling	Requires compatible backend (e.g., tch with CUDA)

Note: NdArray-based training and hyperparameter tuning require ml_train_precision_fp32. Inference always runs on the NdArray backend and automatically converts stored values to f32.

Storage Precision Features

Choose exactly one:

Feature	Description
`ml_store_precision_full`	Store models as full precision
`ml_store_precision_half`	Store models as half precision (smaller files)

Sample Loader Features

Choose exactly one:

Feature	Description	Input width (before deterministic guess)
`ml_loader_note_binned_convolution`	Uses the existing note-binned harmonic convolution (128 bins)	128
`ml_loader_mel`	Applies mel filter banks to the full spectrum (512 bands)	512
`ml_loader_frequency`	Feeds the raw 8,192-bin frequency spectrum	8192
`ml_loader_frequency_pooled`	Averages the raw spectrum into 2,048 pooled bins (factor ×4)	2048

Optional add-on:

Feature	Description
`ml_loader_include_deterministic_guess`	Prepends the deterministic 128-note guess vector to whichever loader you selected above (doubling 128-bin inputs, adding 128 to the others)

Target Encoding Features

Choose exactly one:

Feature	Description	Output width contribution
`ml_target_full`	Emits the full 128-note mask (per MIDI note across octaves)	+128
`ml_target_folded`	Emits a folded 12-class pitch-class mask (one octave)	+12
`ml_target_folded_bass`	Emits two 12-class masks: a categorical bass pitch class (trained with softmax / cross-entropy) and a multi-hot mask of every pitch class present across all octaves	+24

When using ml_target_folded_bass, the bass pitch uses softmax + cross-entropy loss while other notes use binary cross-entropy. Inference decodes bass via argmax to emit a single pitch class.

Example ML Feature Configurations

# Default (note-binned + deterministic guess, 128-note target)
cargo check

# Mel features with deterministic guess and folded targets
cargo check --no-default-features \
   --features "cli ml_infer ml_loader_mel ml_loader_include_deterministic_guess ml_target_folded"

# Raw frequency spectrum without deterministic guess, folded targets only
cargo check --no-default-features \
   --features "cli ml_infer ml_loader_frequency ml_target_folded"

# Pooled raw spectrum with deterministic guess, folded targets only
cargo check --no-default-features \
   --features "cli ml_infer ml_loader_frequency_pooled ml_loader_include_deterministic_guess ml_target_folded"

# Pooled spectrum with deterministic guess and folded+bass targets
cargo check --no-default-features \
   --features "cli ml_infer ml_loader_frequency_pooled ml_loader_include_deterministic_guess ml_target_folded_bass"

Make sure exactly one loader feature is enabled at a time, and exactly one target feature is enabled overall. The deterministic guess flag can be toggled independently to suit experiments.

Training Details

Uses cosine-annealed learning rate schedule starting from TrainConfig.adam_learning_rate
Reduced-precision training uses dynamic loss scaling with skip-on-overflow for gradient stability
Scale growth/backoff happens automatically per training step

Release and Publishing

Prerequisites for Publishing

Before running the release process, ensure you have:

✅ crates.io authentication: cargo login with your API token
✅ npm authentication: npm login or npm adduser
✅ GitHub Container Registry authentication: docker login ghcr.io -u USERNAME
✅ wasm32-wasip2 target: rustup target add wasm32-wasip2

✅ Required tools (prefer cargo-binstall for speed):

# Install cargo-binstall first for faster subsequent installations
curl -L --proto '=https' --tlsv1.2 -sSf https://raw.githubusercontent.com/cargo-bins/cargo-binstall/main/install-from-binstall-release.sh | bash

# Then install tools via binstall
cargo binstall --no-confirm cargo-release  # For version bumping
cargo binstall --no-confirm cargo-make     # For task orchestration
cargo binstall --no-confirm wkg            # For OCI publishing
cargo binstall --no-confirm wasm-pack      # For npm WASM builds

Complete Release Process

Follow these steps to cut a new release:

1. Prepare and Tag the Release

# Bump versions and create git tags using cargo-release
cargo make release

This will:

Update version numbers in all Cargo.toml files
Create git commits for version bumps
Create git tags (e.g., v0.8.0)
Does NOT publish (controlled by --no-publish flag)

2. Build and Publish to All Registries

# Publish to crates.io, npm, and GitHub Container Registry
cargo make publish-all

This orchestrates:

✅ Format check and tests (check-all)
✅ Build CLI binary (build-cli)
✅ Build WASM package for npm (build-npm)
✅ Build Leptos web app (build-web)
✅ Build WASM binary for OCI (build-oci)
✅ Publish kord crate to crates.io (publish-crates)
✅ Rename and publish to npm as kordweb (publish-npm)
✅ Publish to GitHub Container Registry at ghcr.io/twitchax/kord:latest (publish-oci)

3. Push Tags to GitHub

# Push the version tags created by cargo-release
git push --follow-tags

4. Create GitHub Release

🎯 Manual step: Go to GitHub Releases and:

Click "Draft a new release"
Select the tag you just pushed (e.g., v0.8.0)
Generate release notes or write your own
Attach platform binaries from CI artifacts if desired
Publish the release

Note: CI automatically builds platform binaries (Linux, Windows, macOS) and the WASM binary on every push to main, but does not automatically publish them. All publishing is manual via the steps above.

Publish Without Version Changes

If you've already bumped versions manually or want to republish:

cargo make publish-all

Individual Tasks

# Build components individually
cargo make build-cli
cargo make build-npm
cargo make build-web

# Publish individually
cargo make publish-crates
cargo make publish-npm

Publish to OCI Registry (GitHub Container Registry)

# Build the WASI binary for wasip2
cargo make build-oci

# Publish to GitHub Container Registry
cargo make publish-oci

Prerequisites:

Install the wasm32-wasip2 target: rustup target add wasm32-wasip2

Install wkg tool: cargo install wkg

Authenticate with GitHub Container Registry: docker login ghcr.io

The package will be available at ghcr.io/twitchax/kord:latest and can be run with any WASI-compatible runtime like Wasmtime or wkg.

Web Deployment

Docker

Build:

cargo make docker-build

Run:

cargo make docker-run

Fly.io

Deploy:

cargo make fly-deploy

Training Examples

Example Training Command

cargo run --bin kord --no-default-features \
  --features "cli ml_train ml_tch ml_train_precision_fp32 ml_store_precision_full ml_loader_mel ml_target_folded" \
  --release -- -q ml train \
  --backend tch \
  --training-sources samples/captured \
  --training-sources samples/slakh \
  --training-sources sim \
  --noise-asset-root samples/noise \
  --destination model \
  --model-epochs 16

Type-Check Training Configuration

cargo check --bin kord --no-default-features \
  --features "cli ml_train ml_train_precision_fp32 ml_store_precision_full ml_tch ml_loader_mel ml_target_folded"

TODO

Add APIs (and likely docs like rtz) that allow people to explore chords (may be useful for LLM). + utoipa
Add synthesizer to frontend for more pleasant audio feedback.
Add synthesizer to the website on the "play" buttons.
Add a button to allow for playing the scales in the describe page (so you can play the stacked chord, or any on of the suggested scales).
"Cm7" yields a suggested first scale of Gbadd13! during inference since the note ordering is not known. Investigate how to improve this...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Kord Development Guide

Development Setup

Testing

Platform-Specific Setup: libtorch on Windows

ML Training Configuration

Training Precision Features

Storage Precision Features

Sample Loader Features

Target Encoding Features

Example ML Feature Configurations

Training Details

Release and Publishing

Prerequisites for Publishing

Complete Release Process

1. Prepare and Tag the Release

2. Build and Publish to All Registries

3. Push Tags to GitHub

4. Create GitHub Release

Publish Without Version Changes

Individual Tasks

Publish to OCI Registry (GitHub Container Registry)

Web Deployment

Docker

Fly.io

Training Examples

Example Training Command

Type-Check Training Configuration

TODO

Uh oh!

FilesExpand file tree

DEVELOPMENT.md

Latest commit

History

DEVELOPMENT.md

File metadata and controls

Kord Development Guide

Development Setup

Testing

Platform-Specific Setup: libtorch on Windows

ML Training Configuration

Training Precision Features

Storage Precision Features

Sample Loader Features

Target Encoding Features

Example ML Feature Configurations

Training Details

Release and Publishing

Prerequisites for Publishing

Complete Release Process

1. Prepare and Tag the Release

2. Build and Publish to All Registries

3. Push Tags to GitHub

4. Create GitHub Release

Publish Without Version Changes

Individual Tasks

Publish to OCI Registry (GitHub Container Registry)

Web Deployment

Docker

Fly.io

Training Examples

Example Training Command

Type-Check Training Configuration

TODO