Section: Genomics
Topic: Cell biology, Ecology, Genetics/genomics

Chromosome-level reference genome assembly for the mountain hare (Lepus timidus)

Corresponding author(s): Pohjoismäki, Jaakko L. O. (jaakko.pohjoismaki@uef.fi)

10.24072/pcjournal.514 - Peer Community Journal, Volume 5 (2025), article no. e14.

Get full text PDF Peer reviewed and recommended by PCI
article image

Abstract

We present here a high-quality genome assembly of a male mountain hare (Lepus timidus Linnaeus), from Ilomantsi, Eastern Finland, utilizing an isolated fibroblast cell line as the source for high quality DNA and RNA. Following the previously published brown hare reference genome assembly, the mountain hare is the second Finnish pilot species for the European Reference Genome Atlas (ERGA) initiative, a collaborative effort to generate reference genomes for European biodiversity. The genome was assembled using 21× PacBio HiFi sequencing data and scaffolded using the Hi-C chromosome structure capture approach. After manual curation, the primary assembly length was 2,695,305,354 bp with N50 125,755,317 bp. The largest scaffold was 181 Mbp and the scaffold N50 127 Mbp, contributing to a primary assembly consisting of 85 scaffolds and an alternate assembly with 109 scaffolds. The scaffolds include 23 autosomes, numbered according to their size, as well as X and Y chromosomes, matching the known karyotype. Telomeric regions were present on at least one end of 19 of the chromosomes. The genome has a high degree of completeness based on the BUSCO score (mammalia_odb10 database), Complete: 95.1 % [Single copy: 92.3 %, Duplicated: 2.7 %], Fragmented 0.8 %, and Missing 4.1 %. The mitochondrial genome of the cell line was sequenced and assembled separately. The assembly meets the Earth BioGenome Project criteria for a reference-standard genome assembly. Compared to the previous pseudo-reference genome assembly of  L. timidus ssp. hibernicus Bell, assembled using the rabbit genome, this new reference genome represents the nominate subspecies and the species-specific chromosomal conformation. The published genome assembly will provide a solid foundation for future genomic research on Lagomorpha, including the insights into the genomic basis of adaptations to snowy and cold environments. Furthermore, it opens opportunities for experimental analysis of mountain hare gene functions.

Metadata
Published online:
DOI: 10.24072/pcjournal.514
Type: Research article
Mots-clés : Lagomorpha, reference genome, chromosome structure, phylogeny, fibroblast cell line

Fekete, Zsófia 1, 2; Absolon, Dominic E. 3; Michell, Craig 4; Wood, Jonathan M. D. 3; Goffart, Steffi 1; Pohjoismäki, Jaakko L. O. 1

1 University of Eastern Finland, Department of Environmental and Biological Sciences – Joensuu, Finland
2 Hungarian University of Agriculture and Life Sciences, Institute of Genetics and Biotechnology - Gödöllő, Hungary
3 Tree of Life, Wellcome Sanger Institute - Hinxton, Cambridge, UK
4 King Abdullah University of Science and Technology, Red Sea Research Center, Biological and Environmental Science and Engineering Division – Thuwal, Kingdom of Saudi Arabia
License: CC-BY 4.0
Copyrights: The authors retain unrestricted copyrights and publishing rights
@article{10_24072_pcjournal_514,
     author = {Fekete, Zs\'ofia and Absolon, Dominic E. and Michell, Craig and Wood, Jonathan M. D. and Goffart, Steffi and Pohjoism\"aki, Jaakko L. O.},
     title = {Chromosome-level reference genome assembly for the mountain hare {(\protect\emph{Lepus} timidus})},
     journal = {Peer Community Journal},
     eid = {e14},
     publisher = {Peer Community In},
     volume = {5},
     year = {2025},
     doi = {10.24072/pcjournal.514},
     language = {en},
     url = {https://peercommunityjournal.org/articles/10.24072/pcjournal.514/}
}
TY  - JOUR
AU  - Fekete, Zsófia
AU  - Absolon, Dominic E.
AU  - Michell, Craig
AU  - Wood, Jonathan M. D.
AU  - Goffart, Steffi
AU  - Pohjoismäki, Jaakko L. O.
TI  - Chromosome-level reference genome assembly for the mountain hare (Lepus timidus)
JO  - Peer Community Journal
PY  - 2025
VL  - 5
PB  - Peer Community In
UR  - https://peercommunityjournal.org/articles/10.24072/pcjournal.514/
DO  - 10.24072/pcjournal.514
LA  - en
ID  - 10_24072_pcjournal_514
ER  - 
%0 Journal Article
%A Fekete, Zsófia
%A Absolon, Dominic E.
%A Michell, Craig
%A Wood, Jonathan M. D.
%A Goffart, Steffi
%A Pohjoismäki, Jaakko L. O.
%T Chromosome-level reference genome assembly for the mountain hare (Lepus timidus)
%J Peer Community Journal
%D 2025
%V 5
%I Peer Community In
%U https://peercommunityjournal.org/articles/10.24072/pcjournal.514/
%R 10.24072/pcjournal.514
%G en
%F 10_24072_pcjournal_514
Fekete, Zsófia; Absolon, Dominic E.; Michell, Craig; Wood, Jonathan M. D.; Goffart, Steffi; Pohjoismäki, Jaakko L. O. Chromosome-level reference genome assembly for the mountain hare (Lepus timidus). Peer Community Journal, Volume 5 (2025), article  no. e14. doi : 10.24072/pcjournal.514. https://peercommunityjournal.org/articles/10.24072/pcjournal.514/

PCI peer reviews and recommendation, and links to data, scripts, code and supplementary information: 10.24072/pci.genomics.100377

Conflict of interest of the recommender and peer reviewers:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article.

Full text

The full text below may contain a few conversion errors compared to the version of record of the published article.

Introduction

The mountain hare (Lepus timidus Linnaeus, 1758), including several subspecies, is the most widespread hare species in the world, with its distribution ranging from Ireland to Japan (Angerbjörn & Schai-Braun, 2022). The species is adapted to the cold and snowy conditions of northern Eurasia by having wide snowshoe feet and white winter pelage. The ongoing climate change-induced shortening of the snow-covered season is detrimental for the mountain hare, especially due to camouflage mismatch (Zimova et al., 2016; Zimova et al., 2020), resulting in range contraction to the North or higher altitudes in mountains, while benefitting the expansion of its more southern relative, the brown hare (L. europaeus) (Thulin, 2003; Jansson & Pehrson, 2007; Reid, 2011; Levänen et al., 2018a; Rehnus et al., 2018).

The mountain hare has shown resilience during the past warm periods of the Holocene (Smith et al., 2017), diverging also to more temperate climate-adapted subspecies (Giska et al., 2022). Notable examples include the Irish hare (L. t. hibernicus Bell, 1837) and the heath hare (L. t. sylvaticus Nilsson, 1831), which are likely relatively old evolutionary lineages that are genetically distinct from the widespread nominate subspecies (Giska et al., 2022; Michell et al., 2022). In addition, post-glacial contact with other congeneric hare species has resulted in recurrent hybridization and genetic introgression (Melo-Ferreira et al., 2009; Levänen et al., 2018b), which appears to have adaptive significance (Giska et al., 2019; Pohjoismäki et al., 2021; Giska et al., 2022).

Besides the fact that geographically restricted, relict-like subspecies are threatened (Caravaggi et al., 2017; Thulin et al., 2021) and thus interesting in the context of conservation genetics, hares provide ample opportunities to study local adaptations (Giska et al., 2022; Michell et al., 2022), genomic introgression (Levänen et al., 2018b; Giska et al., 2019; Pohjoismäki et al., 2021) and genetic constituents of species boundaries (Gaertner et al., 2023). Such studies are greatly facilitated by the availability of reference-level genome assemblies (Michell et al., 2024).

Relatively recent advances in sequencing technologies, so-called third generation sequencing or high-throughput sequencing of long molecules, has provided affordable means to produce highly continuous genome assemblies. Combined with techniques for sequence contact maps for chromosomal scaffolding, these methods enable reference-level genome assemblies for almost any species (Lawniczak et al., 2022), a level previously achieved only by a few model organisms. This in turn has resulted in emergence of several sequencing initiatives to produce reference genome assemblies for various taxa, including the European Reference Genome Atlas (ERGA) and the Darwin Tree of Life (DToL) (Blaxter et al., 2022). In the presented work, we use these methods to provide a reference-level sequence assembly for the mountain hare. Together with our previously released brown hare genome assembly (Michell et al., 2024), the mountain hare genome represents another Finnish contribution to the ERGA pilot project, aimed to demonstrate the feasibility of a continent-wide collaboration to sequence and assemble reference genomes for local species. Through resource and knowledge sharing, this collaborative approach not only accelerates the creation of reference-grade genomes but also enhances biological discovery, benefiting the broader scientific community (Mc Cartney et al., 2023).

Since long-molecule sequencing relies on intact, high-molecular-weight DNA, requiring fresh or flash-frozen tissue samples, we used a living fibroblast cell line derived from a male mountain hare specimen from Ilomantsi, Eastern Finland (Gaertner et al., 2023), as the DNA source. The sex of the specimen was important to cover the Y-chromosome. The sampling locality was chosen outside of the brown hare range to minimize the probability for hybridization (Levänen et al., 2018a; Levänen et al., 2018b). Furthermore, the sampled population represents a continuum across the vast eastern Taiga zone, making our sample very representative geographically for the nominate mountain hare subspecies. It is worth noting that Linné did not formally designate a type specimen for the mountain hare, and the type locality has been later decided to be Uppsala in Sweden (Angerbjörn & Schai-Braun, 2022). The Swedish hares belong to the same larger, geographically connected Fennoscandian population as our specimen (Levänen et al., 2018b; Michell et al., 2022). This is important because sampling from or near the type locality is considered desirable for the representativeness of the reference genome assembly (Lawniczak et al., 2022).

Following the same approach as with our brown hare reference genome assembly (Michell et al., 2024), we assembled a highly complete genome of the nominate mountain hare subspecies (Lepus timidus timidus) using 21× genome coverage of PacBio HiFi read data. The 2.69 Gbp assembly was further scaffolded with Hi-C sequencing data to chromosome-level, including all of the expected 23 autosomes, and X and Y sex chromosomes (2n = 48) (Gustavsson, 1972). Telomeric regions were present on at least one end in 19 of the assembled chromosomes, further demonstrating the high continuity of the assembly.

While there is an existing genome assembly for the mountain hare (GCA_009760805), it represents the Irish subspecies L. t. hibernicus (hereafter the “Irish hare”) and is based on a female specimen, thus lacking the Y-chromosome. Also, the Irish hare genome assembly can be considered to be a “pseudoreference”, as it has been scaffolded using the rabbit (Oryctolagus cuniculus L.) reference genome assembly (Marques et al., 2020), and retains the rabbit 2n = 44 karyotype (Korstanje et al., 1999). Compared to the previous, our genome assembly represents a significant improvement both in continuity and scaffolding, providing a robust reference for future studies on adaptation, hybridization, and conservation of the mountain hare. Additionally, it can offer insights into the evolutionary history of hares, genetic basis of physiological adaptations, as well as expand the molecular toolkit for experimental investigation of genes and their functions.

Methods

Sampling and confirming of the species identity

A young male mountain hare was sampled during a routine hunt by the last author in October 2018 at Kelovaara, Ilomantsi, Finland (63.0609 N, 30.6248 E). The sampling had minimal impact on the local mountain hare population and no impact on the habitat. As mountain hares are legal game animals in Finland and the hunting followed the regional hunting seasons and legislation (Metsästyslaki [Hunting law] 1993/615/5§), the sampling adheres to the ARRIVE guidelines and no ethical assessment for the sampling was required. Sampling also did not involve activities governed by the Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES) or other export of specimens, as defined by the Convention on Biological Diversity (CBD).

The species identity was confirmed at site based on the morphological features distinguishing the mountain hare from the brown hare, the only other hare species in northern Europe. The collection location is remote wilderness, dominated by boreal Taiga Forest (Figure 1A) and devoid of brown hares, which in Finland are dependent on man-made environments (Levänen et al., 2018a). Further analysis of the coding part of the genome and mitochondrial DNA haplotyping showed no ancestral admixture with brown hares (Gaertner et al., 2023).

Generation and vouchering of the cell line

We generated the fibroblast cell line (LT1) from a skin biopsy obtained from the specimen as described earlier (Gaertner et al., 2023). The primary cells were subsequently immortalized through SV40 large T-antigen transformation to enable their long-term maintenance for experimental use. The resulting cell line has been deposited under voucher number ZFMK-TIS-69753 in the biobank of the Stiftung Leibniz-Institut zur Analyse des Biodiversitätswandels, Zoological Research Museum Alexander König (ZFMK), Bonn, Germany, ensuring accessibility for future research. For genome sequencing and assembly, DNA was extracted from cells at passage number 10, as counted from the initial isolation of the primary cells. Typically, cell lines with under 20 passages are considered to have a low passage number (ATCC, 2024) and thus also having low risk of genomic changes associated with prolonged cell culture.

High molecular weight DNA extraction and PacBio HiFi sequencing

High molecular weight DNA was purified using the classical phenol:chloroform:IAA extraction method (Sambrook & Russell, 2006). Briefly, the cells were grown to confluency on a 10 cm cell culture dish and detached using trypsin, followed by centrifugation and two washes with PBS. The cell pellet was resuspended in 2 ml of DNA extraction buffer (25 mM EDTA pH 8.0, 75 mM NaCl, 1 SDS and RNAseA), followed by Proteinase K digestion and phenol extraction, as detailed in the protocol (Sambrook & Russell, 2006). As done previously (Michell et al., 2024), the sequencing library preparations and sequencing on the PacBio Sequel II was performed in the DNA Sequencing and Genomics Laboratory, Institute of Biotechnology, University of Helsinki. Briefly, the DNA was quantified using Qubit and the fragment length was assessed with the Fragment Analyzer (HS large fragment kit) and then sheared to a length of 24 kb using the Megaruptor 3 (Diagenode) (45/200/30). Following this, the buffer was replaced with PacBio’s Elution buffer using AMPure beads. The DNA underwent repair and A-tailing, adapter ligation, nuclease treatment, and cleanup with SMRTbell cleanup beads, utilizing the SMRTbell prep kit 3.0. Fragments larger than 10,000 bp were purified with the BluePippin (Sage Science) (0.75% DF Marker S1 high pass 6-10kb v3), and the DNA was further purified using PacBio AMPure beads. It was then treated with DNA damage repair mix (PacBio) at 37°C for 60 minutes, followed by another purification with AMPure cleanup beads and elution into 11 µl of PacBio’s Elution buffer. Libraries were sequenced on two SMRT flow cells of the PacBio Sequel II at a concentration of 90 pM according to PacBio’s instructions, as provided by the SMRTlink software.

Mitochondrial DNA (mtDNA) sequencing

The mitochondrial genome of this hare cell line has been sequenced for the purposes of another study by PCR amplification of approximately 2 kb overlapping mtDNA fragments (Tapanainen et al., 2024) (GenBank accession # OR915850).

Hi-C library preparation

As with the brown hare genome assembly (Michell et al., 2024), Hi-C sequencing libraries were prepared following the protocol of (Belaghzal et al., 2017) with the following changes: 1.) The Hi-C protocol was performed in triplicates to produce a diverse sequencing library; 2.) NEBNext Ultra II FS DNA module was used for size fractionation; 3.) the NEBNext Ultra II library preparation kit for Illumina was utilized to obtain the sequencing libraries and 4.) triplicate PCR reactions with six cycles of PCR were used for library enrichment. The PCR reactions were purified using Ampure XP beads at a ratio of 0.9X. The final clean libraries were quantified using Qubit, followed by agarose gel electrophoresis to confirm the fragment size. The sequencing was performed on a single lane of the Illumina NovaSeq 6000 using the SP flowcell with paired-end chemistry 2 x 150bp.

Genome assembly

Quality of the sequencing data was assessed by FastQC v0.11.9 (Andrews, 2010), and cutadapt v4.6 (Martin, 2011) was used to remove any adapters or low-quality sequences. Prior to assembly, k-mer profiling was done using Meryl v1.3 (https://github.com/marbl/meryl) and GenomeScope v2.0 (Vurture et al., 2017; Ranallo-Benavidez et al., 2020), and assembly parameters were adjusted based on the expected genome size and coverage.

Figure 1 - A) The geographic location in Finland and a view of the sampling biotope in Ilomantsi. Photo by Jaakko Pohjoismäki. B) A male mountain hare caught in the same region. Although the cell line used in this study originates from a killed specimen, fibroblasts can be isolated from sources such as ear clippings taken during tagging of animals for behavioral studies. Photo by Mervi Kunnasranta. C) Phase contrast microscopic image of the LT1 cell line used for the genome sequencing and assembly. The cells exhibit a typical fibroblast morphology. Photo by Steffi Goffart.

HiFiasm version 0.16.1 (Cheng et al., 2021) was used to assemble the PacBio HiFi reads using the arguments -m 10000000 -p 100000 –hom-cov 20, and –hic1 –hic2 to integrate the Hi-C read data and produce two raw assemblies. Statistics and k-mer profiles of both assemblies were checked using gfastats v1.3.6 (Formenti et al., 2022) and Merqury v1.3 (Rhie et al., 2020), and the most complete assembly was chosen as primary, with the other kept as alternate. Duplicate contigs were removed using purge_dups v1.2.6 (Guan et al., 2020) from both assemblies. We then continued with the scaffolding of the assemblies. To process the HiC data, we followed the base pipeline by ArimaGenomics (https://github.com/ArimaGenomics/mapping_pipeline). Briefly, we mapped the HiC data to the genome assemblies using bwa-mem2 v2.2.1 (Vasimuddin et al., 2019). The mapped reads were then parsed and filtered using samtools v1.18 (Danecek et al., 2021) and the scripts provided on the ArimaGenomics github page above, and duplicate reads were removed using the MarkDuplicates tool of the GATK toolkit v4.3.0.0 (O'Connor & van der Auwera, 2020).

For scaffolding, YaHS v1.2 (Zhou et al., 2023) was run using parameters -e GATC -l 5000 –no-contig-ec for the assembly and the filtered Hi-C bam file. Contiguity and general genome statistics were calculated using QUAST v5.2.0 (Mikheenko et al., 2018). We assessed the completeness of the genome by calculating the number of complete single copy orthologs with BUSCO v5.5.0 (Manni et al., 2021), using both the glires_odb10 and mammalia_odb10 databases.

Telomere and repeat annotation

Repeat annotation of the genome was performed with RepeatModeler v2.0.5 (Flynn et al., 2020). Using the repeat library produced by RepeatModeler, we masked the scaffolded genome using RepeatMasker v4.1.6 (Smit et al., 2015).

Telomeric sequences were identified using the Telomere Identification ToolKit (tidk v0.2.0) by first running tidk explore to identify potential telomeric repeat sequences, and then tidk search with the sequence of “AACCCT”.

Manual curation

The assembled and annotated genome was manually curated to further improve its quality as described in (Howe et al., 2021) using PretextView https://github.com/sanger-tol/PretextView/ and the rapid curation workflow https://gitlab.com/wtsi-grit/rapid-curation. The manual curation allows the identification and fixing of erroneous scaffold assemblies and contig duplications.

Comparison with previous assembly

We performed a comparison of our scaffolded assembly to the current L. timidus genome assembly (NCBI Accession number: GCA_009760805) as well as the L. europaeus reference genome assembly (NCBI accession number: GCF_033115175.1). Mapping to the genome was performed using minimap2 version 2.21 (Li, 2018) with the arguments -asm5. A dot plot of the alignment was created using the R script pafCoordsDotPlotly.R (https://github.com/tpoorten/dotPlotly).

Results

Genome assembly

The expected haploid genome size of L. timidus is 3.1785 Gbp (Vinogradov, 1998), containing 23 autosomal chromosomes as well as X and Y sex chromosomes (Gustavsson, 1972). PacBio HiFi sequencing produced reads with sequence N50 – the length of the shortest read at 50% of the total sequence length - of 19.97 kb with 21× coverage. Based on the PacBio HiFi data, the expected genome size using k-mer length k =28 is 2.65 Gbp (Figure 2A). The data enabled haplotype-phasing of a primary and an alternative assembly (Figure 2B). The illumina sequencing of the Hi-C data produced 995,011,507 paired reads representing about 56 × coverage of the genome. The duplication rate of the Hi-C data was 18 %. Assembly with HiFiasm yielded a primary contig assembly of 2,838,807,612 bp made up of 2332 contigs and an alternate assembly of 2,654,293,704 bp of 1989 contigs (The longest contigs for the assemblies were 33 Mbp and 26.6 Mbp, and the contig N50s 4.7 Mbp and 4.8 Mbp, respectively. Using the uniquely mapped Hi-C data, we were able to scaffold the contigs and fix misassembled contigs. The primary Hi-C scaffolded assembly was 2,654,826,856 bp in size, the largest scaffold was 181 Mbp and the scaffold N50 127 Mbp.

Figure 2 - An overview of the mountain hare genome assembly. A) K-mer frequency and coverage distribution obtained from the GenomeScope analysis. The observed k-mers (blue lined area) include any and all k-mers observed. K-mers close to coverage 1 are putative errors (orange outline) due to technical errors in sequencing. Dotted lines show peaks for each ploidy: at the haploid, or heterozygous peak, unique k-mers were found enough times for haploid coverage (10.5), at the diploid, or homozygous peak, k-mers were found enough times for diploid coverage (21). The triploid and tetraploid coverage is also marked, although there are no obvious peaks there, as expected for a known diploid species. K-mers at this location in a diploid assembly represent duplicated heterozygous and duplicated homozygous regions. B) k-mer distribution profile after the first assembly step with Hifiasm showing also the amount of unique k-mers for the primary (01) and alternative (02) assemblies. Shared k-mers are found in both assemblies, while read-only k-mers are sequencing errors, and not found in either assembly. C) Contact map of Hi-C scaffolding of the primary assembly after manual curation, generated with PretextView.

To further improve the assembly, manual curation was performed at the Sanger Institute using Hi-C maps (Figure 2C). The manual curation changed the scaffold N50 of 127.6 Mb to 125.8 Mb and caused a reduction in scaffold count of 103 to 85 in the primary and 128 to 109 in the alternate assembly. Of the finalized assembly, 93.16 % could be assigned to 25 identified chromosomes (23 autosomes plus X and Y) (Table 1). Autosomes were named according to their size. After decontamination and manual curation, the scaffold counts are 85 for the primary, and 109 for the alternate assembly. Detailed final assembly statistics are shown in Table 2. The curated genome length was 2,695,305,354 bp. The mitochondrial genome assembly and the annotation of the functional loci have been presented elsewhere (Tapanainen et al., 2024).

Table 1 - Sequence assignment of the primary mountain hare genome assembly.

  Length % of total length Count
Autosomes 2,519,453,057 93.48 23
Unlocalized on autosomes 1,279,896 0.05 1
Other scaffolds 11,202,567 0.42 58
X 139,721,860 5.18 1
Y 24,927,870 0.92 1
MT 17,482   1
TOTAL 2,695,305,354   85

The BUSCO scores of the L. timidus assembly suggest that it is near-complete, with the following results: Complete: 95.1 % [Single copy: 92.3 %, Duplicated: 2.7 %] Fragmented 0.8 %, and Missing 4.1 % based on the mammalia_odb10 database of mammalian orthologs and Complete: 93.2% [Single copy: 90.6 %, Duplicated: 2.7 %], Fragmented 1.1 % and Missing 5.6 % based on the glires_odb10 database of lagomorph orthologs. The categories refer to if a gene’s ortholog, that was expected to be found in a single copy, was found completely, partially (fragmented), in a single copy or in multiple copies, or if it was completely missing. The number of total groups searched were 9226 and 13798, respectively (Table 3, Figure 3). T-antigen vector insertions were detected on the chromosomes 17 and 19. The insertions were in intergenic regions and have been replaced with equal length of Ns in the final assembly. The SV40 T-antigen-containing vector is a recombinant expression construct used to immortalize the fibroblast cell lines (Gaertner et al., 2023). Immortalization simplifies handling for subsequent experiments and minimizes phenotypic variation between cell lines but is not required for genomic work.

Repeat detection with RepeatModeler produced a curated custom repeat library of 28,386 unique repetitive elements classified into 809 repeat families. Interestingly, the proportion of the genome masked as repetitive elements was very high, at 42.35 % of the genome, similar to the situation observed with our brown hare genome assembly (46.8 %) (Michell et al., 2024). In contrast, in the Irish hare genome assembly (GCA_009760805), only 28 % of the genome was masked (Marques et al., 2020). The k-mer based estimate of the repetitive elements in our mountain hare was 23.0 %, which is slightly less than the 26.7 % observed for the brown hare (Michell et al., 2024).

Telomeric sequences were found at both ends of 9 chromosomes, and only on one end of 10 more chromosomes (Figure 4). More than half of the chromosomes (13) also contain telomeric elements not only at their ends but also at various internal locations.

The genome assemblies can be accessed via BioSample accession SAMN41430840 as well as BioProject accession PRJNA1112569 for the primary assembly and raw data of the PacBio and Hi-C sequencing, and PRJNA1112568 for the alternative assembly.

Table 2 - Assembly statistics for the Hi-C scaffolded, post-curation sequencing data for the mountain hare reference genome.

Assembly statistics Primary Alternate
# scaffolds 85 109
Total scaffold length 2,695,305,354 2,556,802,277
Average scaffold length 31,709,474.75 23,456,901.62
Scaffold N50 125,755,317 126,936,470
Scaffold auN 125,197,910.08 126,023,208.11
Scaffold L50 9 9
Largest scaffold 182,608,553 185,135,719
Smallest scaffold 10,771 21,369
# contigs 1,414 1,454
Total contig length 2,695,169,802 2,556,665,250
Average contig length 1,906,060.68 1,758,366.75
Contig N50 4,885,811 5,114,768
Contig auN 6,632,874.02 6,902,786.36
Contig L50 157 136
Largest contig 32,947,725 26,587,001
Smallest contig 967 21,369
# gaps in scaffolds 1,329 1,345
Total gap length in scaffolds 135,552 137,027
Average gap length in scaffolds 102.00 101.88
Gap N50 in scaffolds 100 100
Gap auN in scaffolds 104.10 103.78
Gap L50 in scaffolds 650 660
Largest gap in scaffolds 200 200
Smallest gap in scaffolds 26 27
Base composition (A:C:G:T)    
A 758,096,831 716,813,053
C 589,442,155 561,406,584
G 589,603,553 561,320,925
T 758,027,263 717,124,688
GC content % 43.75 43.91
# soft-masked bases 0 0
# segments 1,414 1,454
Total segment length 2,695,169,802 2,556,665,250
Average segment length 1,906,060.68 1,758,366.75
# gaps 1,329 1,345
# paths 85 109

Table 3 - BUSCO completeness statistics for the mountain hare reference genome assembly.

BUSCO Primary Alternate
Mammalia Count % Count %
Complete BUSCOs (C) 8,770 95.06 8,601 93.23
Complete and single-copy BUSCOs (S) 8,517 92.32 8,378 90.81
Complete and duplicated BUSCOs (D) 253 2.74 223 2.42
Fragmented BUSCOs (F) 76 0.82 84 0.91
Missing BUSCOs (M) 380 4.12 541 5.86
Total BUSCO groups searched 9,226   9,226  
Glires        
Complete BUSCOs (C) 12,863 93.22 12,553 90.98
Complete and single-copy BUSCOs (S) 12,495 90.56 12,226 88.61
Complete and duplicated BUSCOs (D) 368 2.67 327 2.37
Fragmented BUSCOs (F) 157 1.14 149 1.08
Missing BUSCOs (M) 778 5.64 1096 7.94
Total BUSCO groups searched 13,798   13,798  

Comparison to previous assemblies

Minimap2 was able to align 99.95 % (97.57 % primary mapped) of the contig sequences from the Irish hare genome assembly with the presented mountain hare genome assembly (Figure 5A). The two assemblies have a high level of sequence similarity as well as a high degree of synteny. As pointed out earlier, the Irish hare genome assembly retains the chromosomal arrangement of the rabbit genome (Beklemisheva et al., 2011) used for its scaffolding (Marques et al., 2020). As a result, a few notable differences are present in the chromosome assignments, which were also noted when comparing the Irish hare assembly with the brown hare assembly (Michell et al., 2024). Specifically, hare chromosomes Chr7 and Chr12 correspond to Chr1 of the rabbit genome, and chromosomes Chr13 and Chr16 correspond to rabbit Chr2 (Figure 5A), explaining the karyotype difference (2n = 44 vs. 2n = 48).

Consequently, the brown hare genome (Michell et al. 2024, NCBI RefSeq
GCF_033115175.1) shows an even higher degree of synteny with our mountain hare genome assembly (Figure 5B), with Minimap2 aligning 99.87 % (78.77 % primary mapped) of the sequences. However, while the visual representation of the two genomes does not show major rearrangements, the relatively lower percentage of primary mapped contigs with minimap2 also indicates a higher occurrence of chimeric alignments, suggesting minor differences in chromosome sequence organization. Despite the high levels of synteny, the sequence similarity between the brown hare and mountain hare assemblies is lower than that between the mountain hare subspecies. This is illustrated by the sequence identity colors in the alignments (Figure 5A vs. B). The mean sequence similarity, as calculated from the number of matches between each of the aligned scaffolds, is 96.77 % for the mountain hare vs. Irish hare and 93.09 % for the mountain hare vs. brown hare.

Figure 3 - A snail plot summary of assembly statistics for the final, manually curated 2,695,305,354 bp primary assembly. The circumference of the plot symbolizes the full genome length of 2.7 Gbp. The main plot is divided into 1,000 size-ordered bins around the circumference with each bin representing 0.1 % of the final assembly. The distribution of sequence lengths is shown in dark grey with the plot radius scaled to the longest sequence present in the assembly (182,608,553 bp, shown in red). Due to the data being divided into bins, the sequence lengths seem to change in a jagged pattern instead of a smooth one. Orange and pale-orange arcs show the N50 and N90 sequence lengths (125,755,317 and 68,953,458 bp), respectively, while the red arc signifies the length of only one record, the longest one. The scale starts at the outer side, and it is logarithmic. The pale grey spiral shows the cumulative sequence count on a log scale with white scale lines showing successive orders of magnitude. The blue and pale-blue area around the outside of the plot shows the distribution of GC, AT and N percentages in the same bins as the inner plot.A summary of complete, fragmented, duplicated and missing BUSCO genes based on the glires_odb10 set is shown in the top right.

Figure 4 - Telomeric repeat arrays on the mountain hare chromosome assemblies, calculated in windows of 200 kb. The location and relative length of the repeat is presented by the red bars. Short telomeric repeats are not visible in the graph.

Figure 5 - A comparison of the LT1 mountain hare (Lepus timidus timidus) genome assembly with A) the previous Irish hare (Lepus timidus hibernicus) pseudoreference assembly (GCA_009760805, examples of karyotype differences are highlighted in red) and B) our previously published European brown hare (Lepus europaeus) genome assembly (GCF_033115175). Note the higher synteny but lower sequence similarity compared to (A). See the Results section for further explanation.

Discussion

To complement our previously published brown hare reference genome assembly (Michell et al., 2024), we have sequenced and produced a reference-quality genome assembly for the mountain hare. Although large and complex mammalian genomes can pose problems for assembly, our final genome assembly is highly continuous (85 scaffolds in the primary assembly, with an N50 of 126 Mbp, and contig N50 4.9 Mbp) and complete (BUSCO complete 95.1 % mammalia_odb10) (Tables 1–3, Figures 1–3). The genome assembly is at chromosome scale, with all expected 23 autosomes as well as X and Y scaffolded. Our assembly meets the 6.C.Q40 quality criteria for a reference-standard genome assembly (>1 Mb N50 contig continuity and chromosomal scale scaffolding), as proposed by the Earth BioGenome Project (Lawniczak et al., 2022). With the 99.9 % accuracy of PacBio HiFi reads (Wenger et al., 2019) and 21× genome coverage, the assembly also satisfies the reference genome standard of a nucleotide error rate of no more than 1/10,000 (Lawniczak et al., 2022). Most of the chromosome assemblies represent telomere-to-telomere continuity (Figure 4). High copy number telomeric sequences were also present within interstitial positions of some chromosomes, confirming previous observations using FISH staining of mountain hare cells (Forsyth et al., 2005). These regions might be remnants of ancestral chromosome translocation events that gave rise to the current 2n = 46 karyotype in hares.

At approximately 2.7 Gbp the curated genome length is notably shorter than the 3.1785 Gbp estimated for the haploid genome size of Lepus timidus from flow cytometry data (Vinogradov, 1998). However, the size of our genome assembly is very similar to the previous pseudoreference assembly (also 2.7 Gbp). We consider it likely that genome assemblies derived from long-molecule sequencing data provide more accurate genome size estimates than older methods such as flow cytometry, which might be influenced by the choice of references and chromosomal organization of the analyzed species.

As in the case of our brown hare genome assembly (Michell et al., 2024), we utilized living cultured fibroblasts to overcome the technical challenges in obtaining large quantities of intact, high molecular weight DNA for HiFi sequencing, essential for high quality and highly continuous genome assemblies. Importantly, our mountain hare genome assembly did not show any genomic rearrangements or aneuploidies that would have been obtained during cell culture. Although genomic changes can occur during cell culture (Didion et al., 2014), these are more likely in cancer cell lines with compromised DNA repair and cell proliferation pathways (He et al., 2023). Genetic changes need time as they appear by chance and increase their prevalence through drift. Therefore, long-term culturing of the cells should be avoided, and low-passage number cells should be preferred for analyses. Ultimately, the genomic stability of any cell line can be only assessed through karyotyping. Here too, modern sequencing approaches offer far greater insight than traditional chromosome-spreading methods. In this context, our hare genome assemblies can serve as benchmarks for using fibroblast cell lines for such work.

However, despite being devoid of karyotypic changes, we noted the insertion of the T-antigen vector used for immortalization of the cells (Gaertner et al., 2023) on chromosome 17 and 19. Interestingly, such insertions are not present in our brown hare genome assembly (Michell et al., 2024), indicating that the immortalization of this brown hare cell line has not worked. Although the insertions were in the intergenic region and we have removed the vector sequences from the final genome assembly, we recognize that this is not an ideal situation as we cannot control for possible insertional effects on the surrounding chromosomal region. Therefore, we recommend that primary cell lines should be used for reference genome work. For other types of studies it needs to be taken into consideration that non-immortalized cell lines have limited lifetime and can show signs of senescence, which can influence analyses.

The presented mountain hare genome assembly shares a high degree of similarity with the existing Irish hare pseudo-reference genome assembly. The observed differences in synteny (Figure 5A) can be attributed to use of the karyotypically different European rabbit (Oryctolagus cuniculus) for the chromosomal scaffolding of the Irish hare genome assembly (Marques et al., 2020). Compared to the recently published brown hare genome assembly (Michell et al., 2024), the presented mountain hare genome assembly shows almost identical synteny (Figure 5B), reflecting their close evolutionary relationship. However, when compared to the Irish hare, mountain hare – brown hare comparison yielded a higher number of chimeric alignments, likely due to minor differences in chromosome structure. These structural differences are unlikely to have a major biological significance, given that the two species can hybridize and produce fertile offspring.

The 93.09 % sequence similarity of the mountain hare and brown hare genomes is interesting to contrast with the 96.77 % similarity between the mountain hare subspecies. It should be noted that these similarities are based on matches between scaffold alignments, and comparing specific elements, such as coding sequences alone, would likely yield higher similarity percentages. Nonetheless, these figures highlight the relative differences between species and subspecies genomes, particularly as the Irish hare is known to be genetically distinct from Fennoscandian mountain hares (Giska et al., 2022). More detailed population-level comparisons might allow to pinpoint functional loci contributing to local adaptations (Giska et al., 2019) as well as shed light onto speciation processes (Gaertner et al., 2023).

The mountain hare is an iconic boreoalpine mammalian species with unique physiological and morphological adaptations to cold, snowy winter conditions, it is also an important part of the local food webs. Consequently, the population genetics, evolution and genomics of mountain hares has been intensively studied (e.g. Melo-Ferreira et al., 2009; Smith et al., 2017; Levänen et al., 2018b; Marques et al., 2020; Ferreira et al., 2021; Pohjoismäki et al., 2021; Giska et al., 2022; Michell et al., 2022; Gaertner et al., 2023). Together, this genome and the previously published brown hare genome assembly will provide a solid base for future studies on hares. Annotated genome assemblies also enable effective use of reverse genetics, allowing experimental manipulation of any gene or functional feature to study its phenotypic effects. This capability opens opportunities to experimentally demonstrate the functional significance of genetic variants controlling interesting traits, which otherwise could only be indirectly inferred from population genetic analyses. For population genetics, chromosomally scaffolded genome assemblies enable fast and reliable identification of informative SNPs and their linkage. Specifically, the mountain hare genome assembly may offer crucial insights for the conservation of threatened subspecies such as the heath hare (Lepus timidus sylvaticus) (Thulin et al., 2021; Michell et al., 2022) and Irish hare (Lepus timidus hibernicus) (Reid, 2011; Caravaggi et al., 2017).

In conclusion, the successful sequencing and assembly of the mountain hare genome, alongside the previously published brown hare genome, mark significant advancements in our understanding of these species' genomic landscapes. These high-quality genome assemblies not only provide invaluable resources for exploring the genetic basis of adaptation and resilience in hares, but also pave the way for future research in evolutionary biology, conservation genetics, and ecological genomics.

Acknowledgements

We acknowledge the DNA Sequencing and Genomics Laboratory, Institute of Biotechnology, University of Helsinki for the PacBio and Hi-C sequencing. We are grateful to the European Reference Genome Atlas (ERGA) for the opportunity to include our mountain hare genome into their pilot project. Mr Jussi Mähönen, MSc Manu Soininmäki and Finnish hound Hupi are thanked for their contribution in obtaining the mountain hare specimen. We are indebted also to Ms Anita Kervinen for her valuable laboratory assistance and Dr. Mervi Kunnasranta for kindly allowing to use her mountain hare photo in the publication. Preprint version 2 of this article has been peer-reviewed and recommended by Peer Community In Genomics (https://doi.org/10.24072/pci.genomics.100377; Narayan, 2025).

Funding

This study belongs to the xHARES consortium funded by the R'Life initiative of the Academy of Finland, grant no. 329264.

Conflict of interest disclosure

The authors declare that they comply with the PCI rule of having no financial conflicts of interest in relation to the content of the article.

Data, scripts, code, and supplementary information availability

The genome assembly and sequencing data is available from the NCBI database under BioProject ID PRJNA1112569 and PRJNA1112568 and accession numbers JBEDNU000000000 and JBEDNT000000000. The cell line is available through the corresponding author as well as from the ZFMK biobank (https://bonn.leibniz-lib.de/en/biobank). The hare cell lines can be also found through the Global Genome Biodiversity Network (GGBN) portal (https://www.ggbn.org/ggbn_portal/search/index).


References

[1] Andrews, S. FastQC: a quality control tool for high throughput sequence data, 2010 (http://www.bioinformatics.babraham.ac.uk/projects/fastqc)

[2] Angerbjörn, A.; Schai-Braun, S. C. Mountain Hare Lepus timidus Linnaeus, 1758, Handbook of the Mammals of Europe, Springer International Publishing, Cham, 2022, pp. 1-29 | DOI

[3] ATCC Passage number effects in cell lines, ATCC Techincal Documents , 2024 (https://www.atcc.org/-/media/resources/technical-documents/passage-number-effects-in-cell-lines.pdf)

[4] Beklemisheva, V. R.; Romanenko, S. A.; Biltueva, L. S.; Trifonov, V. A.; Vorobieva, N. V.; Serdukova, N. A.; Rubtsova, N. V.; Brandler, O. V.; O’Brien, P. C. M.; Yang, F.; Stanyon, R.; Ferguson-Smith, M. A.; Graphodatsky, A. S. Reconstruction of karyotype evolution in core Glires. I. The genome homology revealed by comparative chromosome painting, Chromosome Research, Volume 19 (2011) no. 4, pp. 549-565 | DOI

[5] Belaghzal, H.; Dekker, J.; Gibcus, J. H. Hi-C 2.0: An optimized Hi-C procedure for high-resolution genome-wide mapping of chromosome conformation, Methods, Volume 123 (2017), pp. 56-65 | DOI

[6] Bell, T. Irish Hare. In: A History of British Quadrupeds, Including the Cetacea, John van Voorst, London, 1837, 341 pages

[7] Blaxter, M.; Archibald, J. M.; Childers, A. K.; Coddington, J. A.; Crandall, K. A.; Di Palma, F.; Durbin, R.; Edwards, S. V.; Graves, J. A. M.; Hackett, K. J.; Hall, N.; Jarvis, E. D.; Johnson, R. N.; Karlsson, E. K.; Kress, W. J.; Kuraku, S.; Lawniczak, M. K. N.; Lindblad-Toh, K.; Lopez, J. V.; Moran, N. A.; Robinson, G. E.; Ryder, O. A.; Shapiro, B.; Soltis, P. S.; Warnow, T.; Zhang, G.; Lewin, H. A. Why sequence all eukaryotes?, Proceedings of the National Academy of Sciences, Volume 119 (2022) no. 4 | DOI

[8] Caravaggi, A.; Leach, K.; Santilli, F.; Rintala, J.; Helle, P.; Tiainen, J.; Bisi, F.; Martinoli, A.; Montgomery, W. I.; Reid, N. Niche overlap of mountain hare subspecies and the vulnerability of their ranges to invasion by the European hare; the (bad) luck of the Irish, Biological Invasions, Volume 19 (2017) no. 2, pp. 655-674 | DOI

[9] Cheng, H.; Concepcion, G. T.; Feng, X.; Zhang, H.; Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm, Nature Methods, Volume 18 (2021) no. 2, pp. 170-175 | DOI

[10] Danecek, P.; Bonfield, J. K.; Liddle, J.; Marshall, J.; Ohan, V.; Pollard, M. O.; Whitwham, A.; Keane, T.; McCarthy, S. A.; Davies, R. M.; Li, H. Twelve years of SAMtools and BCFtools, GigaScience, Volume 10 (2021) no. 2 | DOI

[11] Didion, J. P.; Buus, R. J.; Naghashfar, Z.; Threadgill, D. W.; Morse, H. C.; de Villena, F. P.-M. SNP array profiling of mouse cell lines identifies their strains of origin and reveals cross-contamination and widespread aneuploidy, BMC Genomics, Volume 15 (2014) no. 1 | DOI

[12] Ferreira, M. S.; Jones, M. R.; Callahan, C. M.; Farelo, L.; Tolesa, Z.; Suchentrunk, F.; Boursot, P.; Mills, L. S.; Alves, P. C.; Good, J. M.; Melo-Ferreira, J. The Legacy of Recurrent Introgression during the Radiation of Hares, Systematic Biology, Volume 70 (2021) no. 3, pp. 593-607 | DOI

[13] Flynn, J. M.; Hubley, R.; Goubert, C.; Rosen, J.; Clark, A. G.; Feschotte, C.; Smit, A. F. RepeatModeler2 for automated genomic discovery of transposable element families, Proceedings of the National Academy of Sciences, Volume 117 (2020) no. 17, pp. 9451-9457 | DOI

[14] Formenti, G.; Abueg, L.; Brajuka, A.; Brajuka, N.; Gallardo-Alba, C.; Giani, A.; Fedrigo, O.; Jarvis, E. D. Gfastats: conversion, evaluation and manipulation of genome sequences using assembly graphs, Bioinformatics, Volume 38 (2022) no. 17, pp. 4214-4216 | DOI

[15] Forsyth, N. R.; Elder, F. F.; Shay, J. W.; Wright, W. E. Lagomorphs (rabbits, pikas and hares) do not use telomere-directed replicative aging in vitro, Mechanisms of Ageing and Development, Volume 126 (2005) no. 6-7, pp. 685-691 | DOI

[16] Gaertner, K.; Michell, C.; Tapanainen, R.; Goffart, S.; Saari, S.; Soininmäki, M.; Dufour, E.; Pohjoismäki, J. L. O. Molecular phenotyping uncovers differences in basic housekeeping functions among closely related species of hares ( Lepus spp., Lagomorpha: Leporidae), Molecular Ecology, Volume 32 (2023) no. 15, pp. 4097-4117 | DOI

[17] Giska, I.; Farelo, L.; Pimenta, J.; Seixas, F. A.; Ferreira, M. S.; Marques, J. P.; Miranda, I.; Letty, J.; Jenny, H.; Hackländer, K.; Magnussen, E.; Melo-Ferreira, J. Introgression drives repeated evolution of winter coat color polymorphism in hares, Proceedings of the National Academy of Sciences, Volume 116 (2019) no. 48, pp. 24150-24156 | DOI

[18] Giska, I.; Pimenta, J.; Farelo, L.; Boursot, P.; Hackländer, K.; Jenny, H.; Reid, N.; Montgomery, W. I.; Prodöhl, P. A.; Alves, P. C.; Melo‐Ferreira, J. The evolutionary pathways for local adaptation in mountain hares, Molecular Ecology, Volume 31 (2022) no. 5, pp. 1487-1503 | DOI

[19] Guan, D.; McCarthy, S. A.; Wood, J.; Howe, K.; Wang, Y.; Durbin, R. Identifying and removing haplotypic duplication in primary genome assemblies, Bioinformatics, Volume 36 (2020) no. 9, pp. 2896-2898 | DOI

[20] Gustavsson, I. Mitotic and meiotic chromosomes of the variable hare (Lepus timidus L.), the common hare (Lepus europaeus Pall.) and their hybrids, Hereditas, Volume 67 (1972) no. 1, pp. 27-34 | DOI

[21] He, Z.; Wilson, A.; Rich, F.; Kenwright, D.; Stevens, A.; Low, Y. S.; Thunders, M. Chromosomal instability and its effect on cell lines, Cancer Reports, Volume 6 (2023) no. 6 | DOI

[22] Howe, K.; Chow, W.; Collins, J.; Pelan, S.; Pointon, D.-L.; Sims, Y.; Torrance, J.; Tracey, A.; Wood, J. Significantly improving the quality of genome assemblies through curation, GigaScience, Volume 10 (2021) no. 1 | DOI

[23] Jansson, G.; Pehrson, Å. The recent expansion of the brown hare (Lepus europaeus) in Sweden with possible implications to the mountain hare (L. timidus), European Journal of Wildlife Research, Volume 53 (2007) no. 2, pp. 125-130 | DOI

[24] Korstanje, R.; O’Brien, P.; Yang, F.; Rens, W.; Bosma, A.; van Lith, H.; van Zutphen, L.; Ferguson-Smith, M. Complete homology maps of the rabbit (Oryctolagus cuniculus) and human by reciprocal chromosome painting, Cytogenetic and Genome Research, Volume 86 (1999) no. 3-4, pp. 317-322 | DOI

[25] Lawniczak, M. K. N.; Durbin, R.; Flicek, P.; Lindblad-Toh, K.; Wei, X.; Archibald, J. M.; Baker, W. J.; Belov, K.; Blaxter, M. L.; Marques Bonet, T.; Childers, A. K.; Coddington, J. A.; Crandall, K. A.; Crawford, A. J.; Davey, R. P.; Di Palma, F.; Fang, Q.; Haerty, W.; Hall, N.; Hoff, K. J.; Howe, K.; Jarvis, E. D.; Johnson, W. E.; Johnson, R. N.; Kersey, P. J.; Liu, X.; Lopez, J. V.; Myers, E. W.; Pettersson, O. V.; Phillippy, A. M.; Poelchau, M. F.; Pruitt, K. D.; Rhie, A.; Castilla-Rubio, J. C.; Sahu, S. K.; Salmon, N. A.; Soltis, P. S.; Swarbreck, D.; Thibaud-Nissen, F.; Wang, S.; Wegrzyn, J. L.; Zhang, G.; Zhang, H.; Lewin, H. A.; Richards, S. Standards recommendations for the Earth BioGenome Project, Proceedings of the National Academy of Sciences, Volume 119 (2022) no. 4 | DOI

[26] Levänen, R.; Thulin, C.-G.; Spong, G.; Pohjoismäki, J. L. O. Widespread introgression of mountain hare genes into Fennoscandian brown hare populations, PLOS ONE, Volume 13 (2018) no. 1 | DOI

[27] Li, H. Minimap2: pairwise alignment for nucleotide sequences, Bioinformatics, Volume 34 (2018) no. 18, pp. 3094-3100 | DOI

[28] Linnæus, C. Systema naturæ per regna tria naturæ, secundum classes, ordines, genera, species, cum characteribus, differentiis, synonymis, locis (Vol. Tomus I. Editio decima, reformata.). p. 57. Holmiæ. (Salvius) (1758)

[29] Manni, M.; Berkeley, M. R.; Seppey, M.; Zdobnov, E. M. BUSCO: Assessing Genomic Data Quality and Beyond, Current Protocols, Volume 1 (2021) no. 12 | DOI

[30] Marques, J. P.; Seixas, F. A.; Farelo, L.; Callahan, C. M.; Good, J. M.; Montgomery, W. I.; Reid, N.; Alves, P. C.; Boursot, P.; Melo-Ferreira, J. An Annotated Draft Genome of the Mountain Hare (Lepus timidus), Genome Biology and Evolution, Volume 12 (2020) no. 1, pp. 3656-3662 | DOI

[31] Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads, EMBnet.journal, Volume 17 (2011) no. 1 | DOI

[32] Mc Cartney, A. M.; Formenti, G.; Mouton, A.; De Panis, D.; Marins, L. S.; Leitao, H. G.; Diedericks, G.; Kirangwa, J.; Morselli, M.; Salces, J.; Escudero, N.; Iannucci, A.; Natali, C.; Svardal, H.; Fernandez, R.; De Pooter, T.; Joris, G.; Strazisar, M.; Wood, J.; Herron, K. E.; Seehausen, O.; Watts, P. C.; Shaw, F.; Davey, R. P.; Minotto, A.; Fernandez Gonzalez, J. M.; Bohne, A.; Alegria, C.; Alioto, T.; Alves, P. C.; Amorim, I. R.; Aury, J.-M.; Backstrom, N.; Baldrian, P.; Ballarin, L.; Baltrunaite, L.; Barta, E.; BedHom, B.; Belser, C.; Bergsten, J.; Bertrand, L.; Bilandija, H.; Binzer-Panchal, M.; Bista, I.; Blaxter, M.; Borges, P. A.; Borges Dias, G.; Bosse, M.; Brown, T.; Bruggmann, R.; Buena-Atienza, E.; Burgin, J.; Buzan, E.; Cariani, A.; Casadei, N.; Chiara, M.; Chozas, S.; Ciampor, F.; Crottini, A.; Cruaud, C.; Cruz, F.; Dalen, L.; De Biase, A.; del Campo, J.; Delic, T.; Dennis, A. B.; Derks, M. F.; Diroma, M. A.; Djan, M.; Duprat, S.; Eleftheriadi, K.; Feulner, P. G.; Flot, J.-F.; Forni, G.; Fosso, B.; Fournier, P.; Fournier-Chambrillon, C.; Gabaldon, T.; Garg, S.; Gissi, C.; Giupponi, L.; Gomez-Garrido, J.; Gonzalez, J.; Grilo, M. L.; Gruening, B.; Guerin, T.; Guiglielmoni, N.; Gut, M.; Haesler, M. P.; Hahn, C.; Halpern, B.; Harrison, P.; Heintz, J.; Hindrikson, M.; Hoglund, J.; Howe, K.; Hughes, G.; Istace, B.; Cock, M. J.; Jancekovic, F.; Jonsson, Z. O.; Joye-Dind, S.; Koskimaki, J. J.; Krystufek, B.; Kubacka, J.; Kuhl, H.; Kusza, S.; Labadie, K.; Lahteenaro, M.; Lantz, H.; Lavrinienko, A.; Leclere, L.; Lopes, R. J.; Madsen, O.; Magdelenat, G.; Magoga, G.; Manousaki, T.; Mappes, T.; Marques, J. P.; Martinez Redondo, G. I.; Maumus, F.; McCarthy, S. A.; Megens, H.-J.; Melo-Ferreira, J.; Mendes, S. L.; Montagna, M.; Moreno, J.; Mosbech, M.-B.; Moura, M.; Musilova, Z.; Myers, E.; Nash, W. J.; Nater, A.; Nicholson, P.; Niell, M.; Nijland, R.; Noel, B.; Noren, K.; Oliveira, P. H.; Olsen, R.-A.; Ometto, L.; Oomen, R. A.; Ossowski, S.; Palinauskas, V.; Palsson, S.; Panibe, J. P.; Pauperio, J.; Pavlek, M.; Payen, E.; Pawlowska, J.; Pellicer, J.; Pesole, G.; Pimenta, J.; Pippel, M.; Pirttila, A. M.; Poulakakis, N.; Rajan, J.; Rego, R. M.; Resendes, R.; Resl, P.; Riesgo, A.; Rodin-Morch, P.; Soares, A. E.; Rodriguez Fernandes, C.; Romeiras, M. M.; Roxo, G.; Ruber, L.; Ruiz-Lopez, M. J.; Saarma, U.; Silva, L. P.; Sim-Sim, M.; Soler, L.; Sousa, V. C.; Sousa Santos, C.; Spada, A.; Stefanovic, M.; Steger, V.; Stiller, J.; Stock, M.; Struck, T. H. H.; Sudasinghe, H.; Tapanainen, R.; Tellgren-Roth, C.; Trindade, H.; Tukalenko, Y.; Urso, I.; Vacherie, B.; Van Belleghem, S. M.; van Oers, K.; Vargas-Chavez, C.; Velickovic, N.; Vella, N.; Vella, A.; Vernesi, C.; Vicente, S.; Villa, S.; Vinnere Pettersson, O.; Volckaert, F. A.; Voros, J.; Wincker, P.; Winkler, S.; Ciofi, C.; Waterhouse, R. M.; Mazzoni, C. J. The European Reference Genome Atlas: piloting a decentralised approach to equitable biodiversity genomics, bioRxiv, 2023 | DOI

[33] Melo‐Ferreira, J.; Alves, P. C.; Freitas, H.; Ferrand, N.; Boursot, P. The genomic legacy from the extinct Lepus timidus to the three hare species of Iberia: contrast between mtDNA, sex chromosomes and autosomes, Molecular Ecology, Volume 18 (2009) no. 12, pp. 2643-2658 | DOI

[34] Michell, C.; Collins, J.; Laine, P. K.; Fekete, Z.; Tapanainen, R.; Wood, J. M. D.; Goffart, S.; Pohjoismäki, J. L. O. High quality genome assembly of the brown hare (Lepus europaeus) with chromosome-level scaffolding, Peer Community Journal, Volume 4 (2024) | DOI

[35] Michell, C. T.; Pohjoismäki, J. L. O.; Spong, G.; Thulin, C.-G. Mountain- and brown hare genetic polymorphisms to survey local adaptations and conservation status of the heath hare (Lepus timidus sylvaticus, Nilsson 1831), Scientific Data, Volume 9 (2022) no. 1 | DOI

[36] Mikheenko, A.; Prjibelski, A.; Saveliev, V.; Antipov, D.; Gurevich, A. Versatile genome assembly evaluation with QUAST-LG, Bioinformatics, Volume 34 (2018) no. 13 | DOI

[37] Narayan, J. The genomic foundations of adaptation: evaluating the mountain hare, Peer Community in Genomics (2025) | DOI

[38] Nilsson, S. Mo-Hare In: In: Illuminerade figurer till Skandinavens fauna, Plate 22. Academie-boktryckeriet hos C. F. Berling (1831)

[39] O'Connor, B.; van der Auwera, G. Genomics in the Cloud: Using Docker, GATK, and WDL in Terra, O'Reilly Media, Incorporated, 2020

[40] Pohjoismäki, J. L. O.; Michell, C.; Levänen, R.; Smith, S. Hybridization with mountain hares increases the functional allelic repertoire in brown hares, Scientific Reports, Volume 11 (2021) no. 1 | DOI

[41] Ranallo-Benavidez, T. R.; Jaron, K. S.; Schatz, M. C. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes, Nature Communications, Volume 11 (2020) no. 1 | DOI

[42] Rehnus, M.; Bollmann, K.; Schmatz, D. R.; Hackländer, K.; Braunisch, V. Alpine glacial relict species losing out to climate change: The case of the fragmented mountain hare population (Lepus timidus) in the Alps, Global Change Biology, Volume 24 (2018) no. 7, pp. 3236-3253 | DOI

[43] Reid, N. European hare (Lepus europaeus) invasion ecology: implication for the conservation of the endemic Irish hare (Lepus timidus hibernicus), Biological Invasions, Volume 13 (2011) no. 3, pp. 559-569 | DOI

[44] Rhie, A.; Walenz, B. P.; Koren, S.; Phillippy, A. M. Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies, Genome Biology, Volume 21 (2020) no. 1 | DOI

[45] Sambrook, J.; Russell, D. W. Purification of Nucleic Acids by Extraction with Phenol:Chloroform, Cold Spring Harbor Protocols, Volume 2006 (2006) no. 1 | DOI

[46] Smit, A.; Hubley, R.; Green, P. RepeatMasker Open-4.0, 2015 (http://www.repeatmasker.org)

[47] Smith, S.; Sandoval-Castellanos, E.; Lagerholm, V. K.; Napierala, H.; Sablin, M.; Von Seth, J.; Fladerer, F. A.; Germonpré, M.; Wojtal, P.; Miller, R.; Stewart, J. R.; Dalén, L. Nonreceding hare lines: genetic continuity since the Late Pleistocene in European mountain hares (Lepus timidus), Biological Journal of the Linnean Society, Volume 120 (2017) no. 4, pp. 891-908 | DOI

[48] Tapanainen, R.; Aasumets, K.; Fekete, Z.; Goffart, S.; Dufour, E.; L. O. Pohjoismäki, J. Species-specific variation in mitochondrial genome tandem repeat polymorphisms in hares (Lepus spp., Lagomorpha, Leporidae) provides insight into their evolution, Gene, Volume 926 (2024) | DOI

[49] Thulin, C.-G. The distribution of mountain hares Lepus timidus in Europe: a challenge from brown hares L. europaeus?, Mammal Review, Volume 33 (2003) no. 1, pp. 29-42 | DOI

[50] Thulin, C.-G.; Winiger, A.; Tallian, A.; Kindberg, J. Hunting harvest data in Sweden indicate precipitous decline in the native mountain hare subspecies Lepus timidus sylvaticus (heath hare), Journal for Nature Conservation, Volume 64 (2021) | DOI

[51] Vasimuddin, M.; Misra, S.; Li, H.; Aluru, S. Efficient Architecture-Aware Acceleration of BWA-MEM for Multicore Systems, 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS) (2019) | DOI

[52] Vinogradov, A. E. Genome size and GC-percent in vertebrates as determined by flow cytometry: The triangular relationship, Cytometry, Volume 31 (1998) no. 2, pp. 100-109 | DOI

[53] Vurture, G. W.; Sedlazeck, F. J.; Nattestad, M.; Underwood, C. J.; Fang, H.; Gurtowski, J.; Schatz, M. C. GenomeScope: fast reference-free genome profiling from short reads, Bioinformatics, Volume 33 (2017) no. 14, pp. 2202-2204 | DOI

[54] Wenger, A. M.; Peluso, P.; Rowell, W. J.; Chang, P.-C.; Hall, R. J.; Concepcion, G. T.; Ebler, J.; Fungtammasan, A.; Kolesnikov, A.; Olson, N. D.; Töpfer, A.; Alonge, M.; Mahmoud, M.; Qian, Y.; Chin, C.-S.; Phillippy, A. M.; Schatz, M. C.; Myers, G.; DePristo, M. A.; Ruan, J.; Marschall, T.; Sedlazeck, F. J.; Zook, J. M.; Li, H.; Koren, S.; Carroll, A.; Rank, D. R.; Hunkapiller, M. W. Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome, Nature Biotechnology, Volume 37 (2019) no. 10, pp. 1155-1162 | DOI

[55] Zhou, C.; McCarthy, S. A.; Durbin, R. YaHS: yet another Hi-C scaffolding tool, Bioinformatics, Volume 39 (2023) no. 1 | DOI

[56] Zimova, M.; Giery, S. T.; Newey, S.; Nowak, J. J.; Spencer, M.; Mills, L. S. Lack of phenological shift leads to increased camouflage mismatch in mountain hares, Proceedings of the Royal Society B: Biological Sciences, Volume 287 (2020) no. 1941 | DOI

[57] Zimova, M.; Mills, L. S.; Nowak, J. J. High fitness costs of climate change‐induced camouflage mismatch, Ecology Letters, Volume 19 (2016) no. 3, pp. 299-307 | DOI


block.super