Skip to content

paulsengroup/hictk

Repository files navigation

hictk

License docs Ubuntu CI MacOS CI Windows CI Build Dockerfile Fuzzy testing Download from Bioconda

Zenodo DOI


hictk is a blazing fast toolkit to work with .hic and .cool files.

This repository hosts hictk: a set of CLI tools to work with Cooler, as well as libhictk: the C++ library underlying hictk.

Python bindings for libhictk are available at paulsengroup/hictkpy.

hictk is capable of reading files in .cool, .mcool, .scool and .hic format (including hic v9) as well as writing .hic, .cool and .mcool files.

Installing hictk

hictk is developed on Linux and tested on Linux, MacOS and Windows.

hictk can be installed using containers, bioconda or directly from source. Refer to Installation for more information.

Running hictk

hictk provides the following subcommands:

subcommand description
balance Balance HiC matrices using ICE, SCALE or VC.
convert Convert matrices to a different format.
dump Dump data from .hic and Cooler files to stdout.
fix-mcool Fix corrupted .mcool files.
load Build .cool and .hic files from interactions in various text formats.
merge Merge multiple Cooler or .hic files into a single file.
rename-chromosomes Rename chromosomes found in a Cooler file.
validate Validate .hic and Cooler files.
zoomify Convert single-resolution Cooler and .hic files to multi-resolution by coarsening.

Refer to Quickstart (CLI) and CLI Reference for more details.

Using libhictk

libhictk can be installed in various way, including with Conan and CMake FetchContent. Section Quickstart (API) of hictk documentation contains further details on how this can be accomplished.

Quickstart (API) also showcases the basic functionality offered by libhictk. For more complex examples refer to the sample programs under the examples/ folder as well as to the source code of hictk.

The public C++ API of hictk is documented in the C++ API Reference section of hictk documentation.

Citing

If you use hictk in you reaserch, please cite the following publication:

Roberto Rossini, Jonas Paulsen hictk: blazing fast toolkit to work with .hic and .cool files. bioRxiv 2023.11.26.568707. https://doi.org/10.1101/2023.11.26.568707

BibTex
@article {hictk,
	author = {Roberto Rossini and Jonas Paulsen},
	title = {hictk: blazing fast toolkit to work with .hic and .cool files},
	elocation-id = {2023.11.26.568707},
	year = {2023},
	doi = {10.1101/2023.11.26.568707},
	publisher = {Cold Spring Harbor Laboratory},
	URL = {https://www.biorxiv.org/content/early/2023/11/27/2023.11.26.568707},
	eprint = {https://www.biorxiv.org/content/early/2023/11/27/2023.11.26.568707.full.pdf},
	journal = {bioRxiv}
}