uniwidth - Modern Unicode Width Calculation for Go

uniwidth is a modern, high-performance Unicode width calculation library for Go 1.25+. It provides 3.9-46x faster width calculation compared to existing solutions through tiered lookup optimization and Go 1.25+ compiler features.

🚀 Performance

Based on comprehensive benchmarks vs go-runewidth:

ASCII strings: 15-46x faster
CJK strings: 4-14x faster
Mixed/Emoji strings: 6-8x faster
Zero allocations: 0 B/op, 0 allocs/op

Run benchmarks yourself: cd bench && go test -bench=. -benchmem

✨ Features

🚀 3.9-46x faster than go-runewidth (proven in benchmarks)
💎 Zero allocations (no GC pressure)
🧵 Thread-safe (immutable design, no global state)
🎯 Unicode 16.0 support
🔧 Modern API (Go 1.25+, clean design)
📊 Tiered lookup (O(1) for 90-95% of cases)

📦 Installation

go get github.com/unilibs/uniwidth

Requirements: Go 1.25 or later

🔧 Usage

Basic Usage

package main

import (
    "fmt"
    "github.com/unilibs/uniwidth"
)

func main() {
    // Calculate width of a string
    width := uniwidth.StringWidth("Hello 世界")
    fmt.Println(width) // Output: 10 (Hello=5, space=1, 世界=4)

    // Calculate width of a single rune
    w := uniwidth.RuneWidth('世')
    fmt.Println(w) // Output: 2

    // ASCII-only strings are super fast!
    width = uniwidth.StringWidth("Hello, World!")
    fmt.Println(width) // Output: 13
}

Options API (NEW!)

Configure handling of ambiguous-width characters:

import "github.com/unilibs/uniwidth"

// East Asian locale (ambiguous characters are wide)
opts := []uniwidth.Option{
    uniwidth.WithEastAsianAmbiguous(uniwidth.EAWide),
}
width := uniwidth.StringWidthWithOptions("±½", opts...)
fmt.Println(width) // Output: 4 (each character is 2 columns)

// Neutral locale (ambiguous characters are narrow) - DEFAULT
opts = []uniwidth.Option{
    uniwidth.WithEastAsianAmbiguous(uniwidth.EANarrow),
}
width = uniwidth.StringWidthWithOptions("±½", opts...)
fmt.Println(width) // Output: 2 (each character is 1 column)

Real-World TUI Examples

// Terminal prompt
prompt := "❯ Enter command: "
width := uniwidth.StringWidth(prompt)
fmt.Printf("Prompt width: %d columns\n", width)

// Table cell padding
text := "Hello 世界"
padding := 20 - uniwidth.StringWidth(text)
fmt.Printf("%s%s\n", text, strings.Repeat(" ", padding))

// Truncate to fit terminal width
func truncate(s string, maxWidth int) string {
    width := 0
    for i, r := range s {
        w := uniwidth.RuneWidth(r)
        if width+w > maxWidth {
            return s[:i] + "…"
        }
        width += w
    }
    return s
}

Performance-Critical Code

// ASCII fast path (46x faster than go-runewidth!)
text := "Hello, World!"
width := uniwidth.StringWidth(text) // ~4.6 ns/op

// CJK fast path (14x faster!)
text := "你好世界"
width := uniwidth.StringWidth(text) // ~33.7 ns/op

// Mixed content (8x faster!)
text := "Hello 👋 World"
width := uniwidth.StringWidth(text) // ~65.9 ns/op

// All with zero allocations!

🏗️ Architecture

Tiered Lookup Strategy

uniwidth uses a multi-tier approach for optimal performance:

Tier 1: ASCII Fast Path (O(1))
- Covers ~95% of typical terminal content
- Uses simple len(s) for ASCII-only strings
- 15-46x faster than binary search
Tier 2: Common CJK & Emoji (O(1))
- Range checks for frequent characters
- CJK Unified Ideographs: 20,992 characters
- Common emoji ranges
- 4-14x faster than binary search
Tier 3: Binary Search Fallback (O(log n))
- For rare characters not in hot paths
- Minimal overhead (~5-10% of cases)

Go 1.25+ Optimizations

SIMD Auto-Vectorization: ASCII detection uses SSE2/AVX2
Aggressive Inlining: Hot paths compile to minimal instructions
Zero Allocations: No heap allocations, no GC pressure

📊 Benchmarks

BenchmarkStringWidth_ASCII_Short_Uniwidth-12     149590729   9.500 ns/op   0 B/op   0 allocs/op
BenchmarkStringWidth_ASCII_Short_GoRunewidth-12   10065044  150.1 ns/op   0 B/op   0 allocs/op
                                                             ^^^^^^^^^^
                                                             15.8x faster!

BenchmarkStringWidth_CJK_Short_Uniwidth-12        19064941   63.64 ns/op   0 B/op   0 allocs/op
BenchmarkStringWidth_CJK_Short_GoRunewidth-12      2771077  368.0 ns/op   0 B/op   0 allocs/op
                                                             ^^^^^^^^^^^
                                                             5.8x faster!

Run benchmarks yourself:

go test -bench=. -benchmem

🎯 Use Cases

Perfect for:

TUI frameworks (terminal rendering hot paths)
Terminal emulators (text layout calculations)
CLI tools (table alignment, formatting)
Text editors (cursor positioning, column calculation)
Any high-performance text width calculation

🔄 Migration from go-runewidth

uniwidth provides a compatible API for easy migration:

// Before (go-runewidth)
import "github.com/mattn/go-runewidth"
width := runewidth.StringWidth(s)

// After (uniwidth) - drop-in replacement!
import "github.com/unilibs/uniwidth"
width := uniwidth.StringWidth(s)

Performance improvement: 3.9-46x faster, zero code changes!

📚 Documentation

API Reference - Full godoc documentation
Benchmark Comparisons - Performance comparison vs go-runewidth
Architecture Design - Technical deep dive & design decisions
Changelog - Version history & upgrade guide

🧪 Testing

# Run tests
go test -v

# Run benchmarks
go test -bench=. -benchmem

# Run with coverage
go test -cover

Current test coverage: 90.3% (exceeds 90% target ✅)

🚀 Development Status

Current: v0.1.0 (Stable Release)

✅ Stable Release: This library has completed beta testing. The API is stable and ready for production use. Minor version updates (v0.2.x) will maintain backward compatibility.

What Beta Means:

✅ Feature-complete for core functionality
✅ Production-quality code and performance
⚠️ API may evolve based on community feedback
⚠️ Edge cases still being discovered and fixed
🎯 Goal: API freeze before v1.0.0-rc

Completed:

✅ PoC (3 days) - 3.9-46x speedup proven
✅ Complete Unicode 16.0 tables - Generated from official data
✅ Options API - East Asian Width & emoji configuration
✅ Comprehensive testing - 84.6% coverage, fuzzing, conformance tests
✅ Bug fixes - Variation selectors, regional indicator flags
✅ Documentation - README, ARCHITECTURE, CHANGELOG

Beta Goals (Before RC):

Future Roadmap (v1.0+):

Grapheme cluster support (for complex emoji ZWJ sequences)
Additional locale support
Extended SIMD optimizations
Profile-Guided Optimization (PGO)

🤝 Contributing

Contributions welcome! This is part of the unilibs organization - modern Unicode libraries for Go.

📄 License

MIT License - see LICENSE file

🌟 Related Projects

Built by the Phoenix TUI Framework team.

Part of the unilibs ecosystem:

uniwidth - Unicode width calculation (this project)
unigrapheme - Grapheme clustering (planned)
More Unicode utilities coming soon!

📞 Support

Issues: GitHub Issues
Discussions: GitHub Discussions

🙏 Special Thanks

Professor Ancha Baranova - This project would not have been possible without her invaluable help and support. Her assistance was crucial in bringing uniwidth to life.

Made with ❤️ by the Phoenix team | Powered by Go 1.25+

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github		.github
bench		bench
cmd/generate-tables		cmd/generate-tables
docs		docs
scripts		scripts
.gitignore		.gitignore
.golangci.yml		.golangci.yml
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
benchmark_test.go		benchmark_test.go
conformance_test.go		conformance_test.go
fuzz_test.go		fuzz_test.go
go.mod		go.mod
go.sum		go.sum
options.go		options.go
options_test.go		options_test.go
tables.go		tables.go
tables_generated.go		tables_generated.go
uniwidth.go		uniwidth.go
uniwidth_test.go		uniwidth_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

uniwidth - Modern Unicode Width Calculation for Go

🚀 Performance

✨ Features

📦 Installation

🔧 Usage

Basic Usage

Options API (NEW!)

Real-World TUI Examples

Performance-Critical Code

🏗️ Architecture

Tiered Lookup Strategy

Go 1.25+ Optimizations

📊 Benchmarks

🎯 Use Cases

🔄 Migration from go-runewidth

📚 Documentation

🧪 Testing

🚀 Development Status

🤝 Contributing

📄 License

🌟 Related Projects

📞 Support

🙏 Special Thanks

About

Uh oh!

Releases 2

Packages

Contributors 2

Uh oh!

Languages

License

unilibs/uniwidth

Folders and files

Latest commit

History

Repository files navigation

uniwidth - Modern Unicode Width Calculation for Go

🚀 Performance

✨ Features

📦 Installation

🔧 Usage

Basic Usage

Options API (NEW!)

Real-World TUI Examples

Performance-Critical Code

🏗️ Architecture

Tiered Lookup Strategy

Go 1.25+ Optimizations

📊 Benchmarks

🎯 Use Cases

🔄 Migration from go-runewidth

📚 Documentation

🧪 Testing

🚀 Development Status

🤝 Contributing

📄 License

🌟 Related Projects

📞 Support

🙏 Special Thanks

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Uh oh!

Languages

Packages