Mitochondrial Control Region Database of Hungarian Fallow Deer (Dama dama) Populations for Forensic Use
Orsolya K. Zorkóczy, Zsombor Wagenhoffer, Pál Lehotzky, Zsolt Pádár, Petra Zenke

TL;DR
This study creates a mitochondrial DNA database for Hungarian fallow deer to help in forensic investigations, but finds limited genetic diversity and limited ability to determine population origin.
Contribution
The study provides a forensic mitochondrial DNA haplotype database for Hungarian fallow deer and identifies a new haplotype.
Findings
Four haplotypes were identified, including one previously undescribed.
Low mtDNA diversity was observed (Hd = 0.565 and π = 0.002), similar to other countries.
A differentiation pattern among regions was detected, which may be useful in forensic contexts.
Abstract
Hungary is world-famous for its fallow deer (Dama dama) population and hunting, with approximately 60% of the best trophies originating from this country. Unfortunately, the species also falls victim to poaching. Several studies have already assessed the genetic relationship between fallow deer in certain areas of Europe. The identification of biological materials through mitochondrial DNA analysis has become increasingly important in forensic cases, as it can provide associative evidence connecting victims and suspects. In this study, we determined the extent to which fallow deer mitochondrial control region haplotypes occurring in a given country can be used in legal cases for the preliminary selection of evidence or to link the incriminated animals and the degraded biological remains to the given area. Additionally, we determined which segment of the control region, with its adequate…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8- —University of Veterinary Medicine Budapest
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWildlife Ecology and Conservation · Animal Ecology and Behavior Studies · Isotope Analysis in Ecology
1. Introduction
The fallow deer (Dama dama) has a cosmopolitan distribution across nearly every continent worldwide, facilitated by human intervention, and boasts a significant population of more than 40,000 individuals in Hungary [1,2]. Almost 45% of this population is harvested annually, underscoring the species’ substantial game management value. This is due to the infrastructure established to support their hunting, as well as the meat (venison) and trophies obtained from hunting, which also hold cultural and nature conservation significance [3]. Although the hunting of fallow deer in Hungary operates within the legal framework (e.g., Act LV of 1996) [4], instances of poaching still occur. Hunting is only permitted for authorized individuals using approved hunting tools (such as bullet firearms exceeding the energy content established by law) and strictly during the designated hunting season, which differs according to sex and age (Supplementary Material Table S1) [4]. Failure to meet any of these conditions constitutes illegal hunting, a trend highlighted by several studies conducted in Hungary [5,6].
In cases with significant legal implications, proving the suspect’s guilt poses a challenge. However, genetic identification can be employed to compare biological remains (e.g., hairs, blood contamination) found at the scene with those on the suspect’s belongings, as well as identification and selection of the out-portioned, vast number of uncontrolled or illegal meats from private or restaurant freezers. If the test results reveal differences between evidential and reference samples, the genetic evidence cannot support the suspect’s involvement in the crime. Conversely, if there is a match, it strengthens this assumption. The representativeness of the reference samples in the database is crucial in assessing the rarity of DNA profiles associated with evidence from a crime scene when both reference and evidentiary samples originate from the same geographic populations. Hence, a more precise understanding of the genetic structure of local and global deer populations is necessary to interpret matching DNA haplotypes or genotypes accurately. While a genetic method capable of identifying individual fallow deer is available, it is not universally applicable [7]. Many crime scenes are outdoors, leading to the environmental breakdown of evidential samples and significant DNA degradation over time due to factors such as UV exposure, moisture, and bacterial activity.
The advantage of mitochondrial DNA (mtDNA), with its circular structure, is that it is surrounded by a double phospholipid membrane and can be found in multiple copies per cell; thus, it can usually be successfully detected even in small amounts of degraded samples [8]. Therefore, mitochondrial DNA is useful to assist in the identification of the source of a biological sample (such as species and subspecies determination) or to confirm matrilineal relatedness in phylogenetic studies [9,10,11,12,13]. The examination of mtDNA can be important for differentiating populations [14,15] and for determining their geographical origin, as a haplotype may become fixed in a particular region, and the place of origin of an individual can be determined based on the haplotype of a biological sample of unknown origin [9]. The mutation rate of the mitochondrial non-coding control region (CR or D-loop) is five to ten times higher than the average rate of synonymous substitutions of nuclear genes, and therefore more polymorphic, making it widely researched in various Cervidae species [14,15,16,17,18,19,20,21]. Although mitochondrial DNA is not a tool for individualization purposes, owing to its matrilineal inheritance, mtDNA directly links maternal relatives, which can be used as match references where two or more nucleotide discrepancies are needed for a mismatch or exclusion [22], and it is capable of excluding many potential sources despite lower discriminating power than nuclear DNA. For this marker type, it is also necessary to create databases to assess the frequency of alleles or haplotypes within a relevant population [23]. For haploid markers such as mtDNA, where profiles are expected to be shared by many matrilineally related individuals, the strength of the evidence is determined not only by the variability of the sequence but also by the size of the geographically relevant genetic database, which should be large enough to accurately reflect the local diversity [24]. It is well known from previous studies that, especially in fallow deer, the founder effect and the relatively low mutation rate of the mitochondrial genome (compared to microsatellites) indicate that there can be large sets of matrilineally related individuals sharing a common mitogenome [9,10,11,12]. While the CR is widely used in various fields, the primers, and consequently the length of the amplicon, are not standardized. Although shorter sequences are easier to amplify, information loss can occur; therefore, we planned to assess the most informative region. As many articles support the fact that polymorphic sites may occur outside of the conventionally examined, shorter, ‘mutation hotspot’ section, we also aimed to examine the sequence of the entire control region in the samples.
Assessment of the diversity of fallow deer populations based on the mitochondrial CR has already begun in the peripheral regions of Hungary. However, these studies examined only shorter sections of the control region, 708 bp in Southern Hungary (n = 13) and 450 bp in Northeastern Hungary (n = 41) [9,10]. No such data are available from other parts of the country. It is important to determine to what extent the control region can be both sufficient and efficient for the regional differentiation of the domestic fallow deer populations, and thus, to what extent it can assist in cases with legal consequences. For this reason, based on the aggregated (newly defined plus existing) haplotype data, we assessed the probability of matching and whether this extended section of the CR is suitable for the regional separation of domestic fallow deer herds.
2. Materials and Methods
2.1. Sampling and DNA Extraction
Muscle or hide samples (n = 138) were collected from registered shootings by hunters with a license between 2019 and 2024 from five regions in Hungary, focusing on those regions in which investigations had not been carried out so far and where the occurrence and hunting of fallow deer are common (i.e., north-west (NW), n = 34; south-west (SW), n = 30; north-middle (NM), n = 42; south-middle (SM), n = 31; and south-east (SE), n = 1). Genomic DNA was isolated using a FavorPrep^TM^ Tissue Genomic DNA Extraction Mini Kit (Favorgen Biotech, Ping-Tung, Taiwan) following the procedural guidelines. The quality of the extracted DNA was tested using 1% agarose gel stained with GelRed^TM^ Nucleic Acid Gel Stain (Biotium, Fremont, CA, USA), and the concentration was measured using a Qubit 2.0 Fluorometer (Life Technologies Corporation, Carlsbad, CA, USA). Isolated DNA from the tissue samples was stored at −20 °C until subsequent analysis.
2.2. Mitochondrial Control Region Amplification, Sequencing, and Haplotype Determination
The most informative length and variable sites for the fallow deer CR sequences were determined by an analysis of previous studies [9,10,11,12]. Based on the mitogenome sequence found in GenBank (Accession Number: NC_020700), the entire mitochondrial control region (from 15,400 to 16,146 base pairs) was amplified using newly designed primers (Primer Designer 4 software [http://www.scied.com, accessed on 14 February 2023]) positioned outside the CR. A primer naming convention was used, where the primer name indicates the position of the 5′ base. The forward primer (5′-ACCCCACTATCAACACCC-3′) was defined as F15,386, and the reverse primer (5′-TATGCATAATTAGAGAAAAATTGG-3′) was defined as R16,330.
Amplification was performed in a 25 μL reaction volume containing 12.5 μL DreamTaq™ Green DNA Polymerase (Thermo Fisher Scientific, Waltham, MA, USA), 0.5 μmol forward and reverse primer, 1–10 ng DNA template, and PCR grade H_2_O to volume. PCR was carried out on 2720 Thermal Cyclers (Applied Biosystems, Waltham, MA, USA) using the following conditions: initial denaturation for 10 s at 94 °C, 36 cycles of 40 s at 94 °C, 40 s at 56 °C, 60 s at 72 °C, and a final extension for 2 m at 60 °C. Sequencing reactions of the purified DNA fragments (GenElute™ PCR Clean-Up Kit, Sigma-Aldrich, St. Louis, MO, USA) were carried out with the BigDye^®^ Terminator v3.1 Cycle Sequencing Kit (Thermo Fisher Scientific, Waltham, MA, USA) and on an ABI3500 genetic analyzer (ThermoFisher Scientific, Waltham, MA, USA). Sequence data were analyzed by Sequence Analysis 3.4.1 (Applied Biosystems) and aligned against a reference sequence (GenBank Acc. No: NC_020700) by Sequencher^TM^ 5.4.6 software (Gene Codes Corp, Ann Arbor, MI, USA) for unique haplotype identification.
Our results were supplemented with the previous control region sequences of 54 Hungarian fallow deer [9,10], downloaded from the NCBI (National Center for Biotechnology Information, Bethesda, MD USA) GenBank database. The downloaded sequences (n = 54) and our new sequences (n = 138) were aligned using MEGA11 software with ClustalW default settings [25]. Statistical tests were performed with a combined examination of the total of these 192 sequences. These sequences were also analyzed separately as six populations mainly divided based on natural barriers such as rivers and lakes.
Wright’s F-statistic was calculated using the DNA Sequence Polymorphism (DnaSP) software [26], and the Fst value was calculated per population pair. Additionally, we determined the number of polymorphic sites and haplotypes found in the sequence for each sampling site and all samples, as well as the haplotype and nucleotide diversity. To estimate the overall haplotype match probability (or random match probability, RMP) for fallow deer sampled at random within the Hungarian population, we used RMP = ∑pi^2^, where p is the frequency of the observed haplotype. The RMP, or probability of matching (PM), is defined as the probability of observing a random match between two unrelated individuals [27] and conveying the significance of a statistical match between reference and evidential sequences in forensic cases.
3. Results
Based on previous research and sequences downloaded from GenBank, we determined the investigated section of the control region and the frequency of polymorphic sites in fallow deer introduced to different countries. Sequences from original populations, such as Rhodes or Turkey, were not included in this examination, as they have many more variable sites. This indicates that they still possess some of the original genetic variety of fallow deer not found in the current introduced populations, thus excluding them from the study’s main target. Figure 1 shows that the entire length of the mitochondrial control region, as well as the section that follows, contains variable SNP (Single Nucleotide Polymorphism) sites that contribute to the formation of different haplotypes (see Supplementary Material Tables S2 and S3 for details). Based on this information, a 945-bp-long section containing the whole CR was amplified with the designed primer pair, of which a sequence with a length of approx. 900 base pairs could be reliably evaluated. Due to our selected CR section, four haplotypes were detected for a total of 138 fallow deer samples from the five sampling sites, three of which were already described in previous research [9,10]. A new haplotype sequence was detected from the NM region (Figure 1 and Figure 2), which has been uploaded to GenBank (Accession Number: PP558272). Based on the sequence alignments, we standardized the names of the haplotypes for the current and previous results obtained from Hungarian fallow deer (Figure 1), and thenceforth, we used these names.
We analyzed the observed haplotypes and their frequencies, supplemented with domestic fallow deer sequences (n = 54) from previous research [9,10]. Figure 2 represents the six populations created, within each of which three haplotypes were detected, except for the north-west (NW) region, where only two haplotypes occurred. Altogether, six haplotypes were identified, including six variable sites among the 192 individuals. All polymorphic positions were the result of substitutions, with no indels observed (see Supplementary Material Figure S1 for more details). Regarding haplotype frequencies among the 192 Hungarian samples examined so far (Figure 2), 66 individuals had Hun1 (34%, previously named H1 or Hap17 [9,10]), and 108 had Hun2 (56%, previously named H2 [9]). One individual each had Hun3 (previously named H3 [9]) and Hun4 (0.5% each). In comparison, nine individuals had Hun5 (4.7%, previously named Hap5), and seven individuals had Hun6 (3.6%, previously named Hap9). Indices of molecular diversity for each group are presented in Table 1. Based on our calculation in the combined dataset, F statistical tests showed that only two southern regions (SM and SE) differ significantly (Fst values ranged between 0.177 and 0.359) from all other sampling locations in Hungary. However, the corrected Gst value, which also considers the population sizes, showed a significant difference in the comparison of only four population pairs (SE-NW, SE-NE, SE-SM, and SM-SW). No significant differences (0.15–0.25) were observed either in Fst or in Gst values between the other sampling sites, and the negative value (effectively seen as zero) calculated between NM and NE indicates that there is no genetic subdivision among these populations.
The aggregated frequencies from recent and previous studies provided a random match probability between the six areas ranging from 0.429 (SE) to 0.677 (SM).
4. Discussion
The importance or legal status of wild animal representatives can vary by country. Fallow deer from Hungary have some of the finest known antlers of the species, and this is one of the reasons why the species frequently falls victim to poaching. In forensic cases, there is a need to obtain the most accurate and informative data possible. Based on the comparative analysis of fallow deer sequence data from several countries, differences were found in the locations of the polymorphic positions of the mitochondrial control region in introduced populations with very low diversity, due to the founder effect. The number and location of these variations strongly depends on the region (country) of origin of the sequences. This means that sequence regions showing mutational hotspots in one country show no diversity at all in another geographic area. At the same time, it can also be seen that the pattern of sections frequented by mutations is similar in sequences with different origins. The length and position of the corresponding genetic section to be examined must therefore be assessed based on the population study of the given country.
Mitochondrial control region databases are highly valuable for the analysis of minimal amounts of degraded samples (such as hair, feces, and processed samples). These databases have been developed for several species, besides humans, for forensic use [28,29,30,31,32]. For this reason, we aimed to assess the accuracy of the CR sequences of different lengths and whether the implementation of longer sections would be beneficial for forensic investigations. With the help of the 127 fallow deer sequences worldwide, downloaded from GenBank, we showed that mutation hot spots can occur over the entire length of the CR. The examination of the Hungarian fallow deer population revealed that mutations within the CR are concentrated in a shorter section, resulting in all six SNP positions being found within the shorter (450 bp) region previously examined in Hungary [10]. However, further surveys in herds from other countries can result in different patterns.
Because fallow deer, except for the native populations in Turkey and Rhodes, have a very limited genetic diversity [9,10,11,12], every opportunity should be explored to examine potential polymorphic sites, thus detecting possible deviations. The variation reported herein was compared with the published fallow deer genetic data to determine whether our database was typical of fallow deer populations in other countries. Haplotype diversity (Hd) is the probability that the haplotypes of two randomly selected individuals differ, while nucleotide diversity (π) is the average number of nucleotide differences between sequences per base position [33]. The investigated Hungarian fallow deer populations show similarly low average mtDNA diversity (Hd = 0.565) and nucleotide diversity (π = 0.002), compared with data from other countries (Hd = 0–0.902; π = 0–0.01029) [9,11] and previous data from Hungary [10].
While previous studies rarely examined more than 60 individuals per country [9,10], the current study includes broader sampling within the country to uncover additional variation. In our study, a new haplotype was registered; therefore, six haplotypes have altogether been detected so far in Hungary from 192 samples. A similar degree of polymorphism was detected in Germany, where 10 haplotypes were detected by surveying a similarly large number of samples (n = 365) [11]. In other European countries, the number of haplotypes ranged between three and fifteen, investigated from a few dozen samples. The determination of when sufficient samples have been ascertained to adequately represent a population depends on the population size and the mitotype diversity observed within a population. An ideal sample set would be considered saturated when sampling additional individuals from the population no longer increases the absolute number of observed types [34]. Generally, the number of observed haplotypes increases with sample size, while the proportion of rare haplotypes (i.e., encountered only once or twice) decreases [35]. Our results in most of the populations investigated support these statements; as a result, the exclusion probability largely remained the same with sample size expansion [36]. In one of the regions investigated in this study, only a limited sample size was available (SW, n = 7), thus causing a tendency to overestimate haplotype frequencies, which decreases the evidential value of an mtDNA match. Because overestimations do not increase the risk of incriminating a false suspect, under-sampling can be considered a conservative error [37].
In general, genetic subdivisions and inbreeding must be considered in wildlife forensic DNA analysis, particularly in the case of fallow deer. DNA databases that reflect the genetic composition and geographic structure are important for accurately calculating the rarity of allele frequencies and mtDNA haplotypes to determine exclusion probabilities [22]. Therefore, we examined the possibilities of the mitochondrial DNA control region to determine whether there is geographical separation of the haplotypes, and this can also help solve cases with legal consequences. To evaluate the significance of a haplotype match between a biological trace and its suspected donor, a population sample should reliably represent the population to which the donor of the trace is supposed to belong. Regarding the distribution of domestic haplotypes, Hun3 and Hun4 have, so far, only occurred in one individual each in the north-eastern (NE) and north-middle (NM) regions of Hungary, respectively; Hun5 and Hun6 only occur in Southern Hungary, in nine and seven individuals, respectively. The other two haplotypes (Hun1 and Hun2) were far more common at 90.6%, with these occurring almost throughout the entire country. Although earlier genetic research conducted in European fallow deer showed a high degree of nuclear and mitochondrial DNA diversity, only a small degree of variation is present per country due to the fixation of local allelic variants [9,10,14,19,38,39,40,41,42,43,44]. The Fst (fixation index) and Gst values provide information on the segmentation of the populations, as it shows how much the proportion of haplotypes decreases because the metapopulation consists of two or more subpopulations with different haplotype frequencies. The higher the fixation index, the more suitable the genetic marker is for separating populations. Fst ≥ 0.15 already indicates a significant genetic difference between subpopulations [45]. Our results indicate that despite the overall low genetic mtDNA diversity within the Hungarian fallow deer samples, a pattern of differentiation among the regions is present, which can have relevance from a forensic point of view.
We are aware of the limitations of our dataset in identifying individuals derived from the study of autosomal microsatellites [7]; however, these tests can be conducted faster and more easily. Furthermore, in case a nuclear DNA test is unsuccessful, this type of extranuclear genome can likely be primarily used as an exclusionary tool. Nonetheless, using these techniques without reference data for comparison may lead to incompatible case reporting; therefore, comprehensive domestic genetic databases are needed. These can help investigators trace samples to their sources of origin (population, species, geographic region) and aid in the arrest, conviction, and subsequent sentencing of perpetrators for smuggling, poaching, or possession [46]. Despite the fact that our calculated random match probability (RMP) of 0.547 shows a high probability of coincidence and, therefore, a limited capacity of exclusionary potential, this work will help to interpret the strength of evidence in forensic cases. As the power of genetic evidence is directly correlated to the exclusionary power generated by the local haplotype frequencies available [34], this dataset provides an additional forensic asset for a fallow deer mtDNA CR database.
Beyond forensic considerations, mtDNA data are indispensable tools in population management and conservation. They provide insights into genetic diversity, population structure, phylogeography, and lineage tracking. These insights help conservationists make informed decisions to restore healthy, genetically diverse populations, thereby enhancing the long-term viability of species.
5. Conclusions
Country-by-country differences in the location of polymorphic positions of the mtDNA control region necessitate the assessment of local populations in connection with the examination of the most informative sections. By increasing the number of tested sample elements, although it is possible to detect new haplotypes, the probability of exclusion remains largely the same. Based on the detected uneven haplotype frequencies, there can be a partial geographical separation between the surveyed stocks. However, the probability of random matching within the populations is still very high in Hungary. Consequently, it may be necessary to search for polymorphic sites on other mitochondrial gene sections or to sequence the entire mitogenome of fallow deer.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Chapman N.G. Chapman D.I. The distribution of fallow deer: A worldwide review Mammal Rev.1980106113810.1111/j.1365-2907.1980.tb 00234.x · doi ↗
- 2Pemberton J. Smith R. Lack of biochemical polymorphism in British fallow deer Heredity 19855519920710.1038/hdy.1985.924055416 · doi ↗ · pubmed ↗
- 3FaragóS. Köller J. Zoltán A. Természeti-Vadászati Örökségünk. A Legkiválóbb Magyar Vadásztrófeák Nimród Vadászújság Budapest, Hungary 2009
- 4Évi L.V. Törvény—Nemzeti Jogszabálytár 1996 Available online: https://njt.hu/jogszabaly/1996-55-00-00.49(accessed on 23 September 2023)
- 5Elek B.S. The criminalization of poaching in Hungary Zb. Rad.20195363964810.5937/zrpfns 53-20330 · doi ↗
- 6Szabolcsi Z. Egyed B. Zenke P. Padar Z. Borsy A. Steger V. Pasztor E. Csanyi S. Buzas Z. Orosz L. Constructing STR multiplexes for individual identification of Hungarian red deer J. Forensic Sci.2014591090109910.1111/1556-4029.1240324512288 · doi ↗ · pubmed ↗
- 7Zorkóczy O.K. Turi O. Wagenhoffer Z. Ózsvári L. Lehotzky P. Pádár Z. Zenke P. A Selection of 14 Tetrameric Microsatellite Markers for Genetic Investigations in Fallow Deer (Dama dama)Animals 202313208310.3390/ani 1313208337443886 PMC 10339914 · doi ↗ · pubmed ↗
- 8Alacs E.A. Georges A. Fitz Simmons N.N. Robertson J. DNA detective: A review of molecular approaches to wildlife forensics Forensic Sci. Med. Pathol.2010618019410.1007/s 12024-009-9131-720013321 · doi ↗ · pubmed ↗
