Skip to content
Ali Osman Berk Şapcı edited this page Nov 25, 2025 · 10 revisions

krepp is a k-mer-based maximum likelihood tool for estimating distances of reads to genomes and phylogenetic placement.

Overview

krepp comprises two main stages: indexing and querying. You can index a set of reference genomes using the krepp index subcommand, and then use an index to perform queries: distance calculation from reads to genomes (krepp dist) or phylogenetic/taxonomic placement (krepp place).

You can start by installing the tool or compiling from the source as described in Installation. A toy dataset is available in here. You can make yourself familiar with the commands and test your installation by following this little tutorial.

If you would like to use a prebuilt microbial index, see Quickstart for a brief overview of commands. We offer a wide range of indices for various datasets, which are listed in this catalogue. You can pick one that fits your needs and install it. If you would like to index from scratch using your own dataset or phylogeny, refer to Indexing reference genomes.

Detailed descriptions of options for the distance estimation (krepp dist) and phylogenetic placement (krepp place) are available in Estimating distances and Placing on a tree. krepp also offers a simpler functionality to quickly analyze query sequences against a single reference using krepp sketch and krepp seek, described here.

We plan to post tutorials for specific tasks and various use cases available in Tutorials page.

Citation

See the preprint on BioRxiv. You can use the following BibTeX entry to cite krepp:

@misc{sapci_k-mer-based_2025,
	title = {A k-mer-based maximum likelihood method for estimating distances of reads to genomes enables genome-wide phylogenetic placement.},
	copyright = {2025, Posted by Cold Spring Harbor Laboratory. The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission.},
	url = {https://www.biorxiv.org/content/10.1101/2025.01.20.633730v1},
	doi = {10.1101/2025.01.20.633730},
	language = {en},
	urldate = {2025-01-27},
	publisher = {bioRxiv},
	author = {Sapci, Ali Osman Berk and Mirarab, Siavash},
	month = jan,
	year = {2025},
	note = {Pages: 2025.01.20.633730
Section: New Results},
}

Clone this wiki locally