Association of the Transmembrane Protease Serine 6 rs855791 Variant and Nongenetic Factors With Iron Deficiency Among Female Medical Students in Yogyakarta: Protocol for a Case-Control Study
Fenty Fenty, Erna Kristin, Tri Ratnaningsih, Supanji Supanji, Jajah Fachiroh, Dwi Aris Agung Nugrahaningsih

TL;DR
This study explores how a genetic variant and non-genetic factors contribute to iron deficiency in female medical students in Indonesia.
Contribution
The study investigates the interaction between the TMPRSS6 rs855791 variant and nongenetic factors in iron deficiency among female medical students.
Findings
The study will determine the association of the TMPRSS6 rs855791 variant with iron deficiency.
It will assess the impact of sociodemographics, nutrition, and menstrual patterns on iron status.
Findings may inform strategies to reduce iron deficiency in this population.
Abstract
Iron deficiency is the most widespread nutritional deficiency worldwide, and it is the primary cause of anemia, particularly in low- and middle-income countries such as Indonesia. Iron deficiency has a multifactorial etiology involving complex interactions between genetic factors, especially the transmembrane protease serine 6 (TMPRSS6) rs855791 variant, which encodes matriptase-2, a protein involved in regulating hepcidin expression, and nongenetic factors, including sociodemographics, nutritional status, iron intake, and menstrual patterns. Women of reproductive age, including medical students, are susceptible to iron deficiency owing to unhealthy dietary habits, growth requirements, and menstruation. Iron deficiency among medical students may lead to decreased academic performance and productivity. Additionally, as future mothers, women may experience a heightened risk of delivering…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIron Metabolism and Disorders · Hemoglobinopathies and Related Disorders · Blood properties and coagulation
Introduction
Background
Iron deficiency is the most prevalent nutritional deficiency globally. It accounts for more than 60% of anemia cases [1,2]. This condition generally affects children younger than 5 years, pregnant women, and women of reproductive age, especially those in low- and middle-income countries (LMICs) [3,4]. The World Health Organization reports that anemia is most widespread in Africa and Southeast Asia [4]. A study conducted across 7 South and Southeast Asian countries found that more than half of women of reproductive age were affected by anemia [5]. Another study, from the Demographic and Health Survey of 46 LMICs, showed that the prevalence of anemia in pregnant (45.2%) and nonpregnant women (39.5%) was higher than the global prevalence [6]. According to the 2018 Indonesian Basic Health Research survey, the overall prevalence of anemia in Indonesia was 23.7% (women: 27.2%; men: 20.3%) [7].
Iron plays a crucial role in several essential biological processes in the body, including electron transfer, oxygen transport, cell growth, aerobic metabolism, DNA synthesis, and enzymatic processes, and functions in muscle and neurological development, nerve fiber myelination, and neurotransmitter production [8-10]. Iron deficiency can affect cognitive development, school achievements, work performance, and productivity [11-13]. Iron deficiency that occurs during essential phases of brain development can cause permanent structural damage [9]. Iron deficiency during pregnancy can result in detrimental effects on infants and children. Many women of reproductive age enter pregnancy with low iron levels and may have no iron reserves whatsoever [2,14].
Several factors contribute to iron deficiency, including environmental factors, nutritional deficiencies, iron intake, sociodemographics, social habits, diet, lifestyle, infections, and blood loss (eg, menstrual bleeding or childbirth among women) [6,15,16]. Studies have also reported the role of genetic variability in an individual’s iron status, accounting for approximately 20% to 30% of the variation [17-21]. Genome-wide association studies (GWASs) have demonstrated that single-nucleotide polymorphisms (SNPs) in certain genes are involved in hepcidin hormone regulation pathways that control iron homeostasis. Among these genes, loci on the transmembrane protease serine 6 (TMPRSS6) gene are frequently linked to low levels of iron and hematological indices. The most common SNP in the TMPRSS6 gene that is strongly associated with iron deficiency anemia is rs855791. This gene encodes matriptase-2, which interacts with and promotes the cleavage of hemojuvelin on the cell surface, leading to the release of soluble hemojuvelin and inhibiting hepcidin expression [17,18,21,22]. This gene polymorphism is linked to a reduced ability to decrease hepcidin production, thereby causing increased levels in the circulation despite iron deficiency. Furthermore, matriptase-2 dysfunction is associated with iron-refractory iron deficiency anemia [17,21,23].
Iron deficiency has a multifactorial etiology involving complex interactions between genetic and nongenetic factors. Although GWASs have identified risk alleles for complex genetic characteristics related to iron status, the detected risk alleles did not fully explain heritability. Moreover, the impact of allele frequency and biological adaptation varies across ethnicities owing to environmental and physiological factors. This complexity presents a challenge in effectively addressing the contributing factors of iron deficiency in the population [17,21].
In Indonesia, research on the association of genetic variations and nongenetic factors with iron deficiency remains limited, particularly in vulnerable populations, including female medical students, who are part of the representative population of women of reproductive age. They are susceptible to the risk of developing iron deficiency. Moreover, this group is particularly relevant to study owing to its more homogeneous characteristics. Despite having good knowledge of preventing iron deficiency anemia, they frequently practice unhealthy eating habits, including skipping breakfast, snacking, consuming fast food, and lacking fruit and vegetable intake, and unhealthy lifestyle habits, such as not engaging in physical activity due to heavy study loads and extracurricular activity schedules in college. Furthermore, they experience physiological menstrual blood loss and remain in a growth period that requires adequate nutritional intake [24,25].
This Study
Several studies in India, Pakistan, Iran, and Saudi Arabia have reported a significant rate of iron deficiency and anemia among female medical students, ranging from 16% to 63%, especially among those who live in dormitories and have a low or high BMI [24-29]. However, in Indonesia, only 1 study on female medical students with a sufficient socioeconomic background revealed a relatively low anemia prevalence of 13.6% [30]. The approach to managing iron deficiency has yet to fully consider genetic and nongenetic factors. Consequently, despite efforts to promote the distribution of iron supplements, the prevalence of iron deficiency remains high. Identifying the primary determinants of iron deficiency is crucial for more efficient management strategies. By recognizing these determinants, efforts can be made to increase the body’s iron reserves and provide appropriate prevention and treatment. Identifying these factors is also valuable in preparing adolescent girls for pregnancy to prevent and reduce the risk of maternal mortality, preterm delivery, low birth weight, and stunting. Therefore, our study aims to analyze the association of the TMPRSS6 rs855791 variant and nongenetic factors with iron deficiency in apparently healthy female medical students.
Methods
Study Design and Setting
This study will use a case-control design. Cases and controls will consist of medical students at the Faculty of Medicine, Public Health, and Nursing, Universitas Gadjah Mada in Yogyakarta, Indonesia. Participants will be selected using purposive sampling according to predefined inclusion and exclusion criteria. The study will specifically target final-year female undergraduate medical students (aged 18-24 years) before entering the clinical clerkship phase; who have provided written informed consent; are not pregnant; and have no history of chronic and inflammatory diseases, congenital diseases, hematological disorders, or blood transfusions within the previous 3 months. Participants with elevated levels of the inflammatory biomarker C-reactive protein (CRP; elevated level defined as >5 mg/L) will be excluded. Study participants will be classified into the control and case groups based on their serum ferritin (SF) and hemoglobin levels. A case-control ratio of 1:1 will be applied. The matching condition will be an age difference of less than 3 years. The control group will include participants with normal hemoglobin (≥12 g/dL) and SF (≥30 μg/L) levels. The case group will include participants with iron deficiency. Iron deficiency will be determined based on SF levels of less than 30 μg/L with or without anemia [10,31].
Sample Size
This study will be conducted from February 2025 to January 2026. The sample size will be determined using the unpaired categorical analytical research sample size formula with a confidence level of 95%, α of 5%, and research power of 80% (with β of 20%). The sample size calculation will use the proportion of exposed controls and the odds ratio of each independent variable for the risk of iron deficiency, yielding a minimum required sample size of 116 participants (58 cases and 58 controls). This sample size will ensure adequate statistical power to examine the influence of sociodemographic characteristics (type of residence and parental income), BMI, menstrual pattern, iron intake, and the TMPRSS6 rs855791 genetic variant on the risk of iron deficiency.
Data Collection and Analysis
We will use questionnaires for collecting data on the demographics, medical history, and socioeconomic information of the participants. The dietary iron intake of the participants will be assessed using the IRON Intake Calculation-Food Frequency Questionnaire, which has been modified and evaluated for validity and reliability [32]. This questionnaire contains information about the primary sources of iron, which are divided into 12 food groups, including meat, processed meat, eggs, fish, dairy products, cereal products, fruits, vegetables, tubers, oils, nuts, and chocolate products. For each food group included in the questionnaire, participants will be asked to report portion sizes and consumption frequency over the preceding week. The total weekly iron intake calculated from these responses will then be divided by 7 to estimate the mean daily iron intake. The NutriSurvey software will be used for analyzing iron intake [32,33]. Participants’ nutritional status will be measured using BMI. Menstrual patterns will be evaluated using a menstrual pictogram for 2 menstrual cycles. Menorrhagia is defined as menstrual bleeding exceeding 80 mL for each cycle [34,35].
Laboratory investigations will be performed as mentioned subsequently. In total, 7 mL of venous blood will be collected under aseptic conditions between 8 AM and 11 AM using 2 different tubes. A total of 3 mL of blood will be collected from each participant using an ethylenediaminetetraacetic acid tripotassium salt tube, with 2 mL of blood allocated for hemogram and reticulocyte profile analysis and 1 mL for genotyping. All laboratory analyses will be performed in accredited laboratories, strictly following the manufacturer’s instructions and under Good Clinical and Laboratory Practice conditions with internal quality controls to ensure accuracy and reliability. The hemogram and reticulocyte profile will be analyzed within 1 hour following sample collection using an automated hematology analyzer, Mindray BC-760 (Mindray Corporation). The remaining 4 mL will be collected into plain gel separator tubes and subsequently centrifuged. Serum samples will be stored at –20 °C for CRP and SF examinations and at –80 °C for serum hepcidin examinations. A cobas c 111 analyzer (particle-enhanced immunoturbidimetric assay; Roche Diagnostics), a VIDAS analyzer (enzyme-linked fluorescent assay; bioMérieux), and a sandwich human enzyme-linked immunosorbent assay kit (catalog number EH3221; Wuhan Fine Biotech Co, Ltd) will be used for CRP, SF, and serum hepcidin measurement.
Genotyping Procedure
Genomic DNA will be isolated from 1 mL of ethylenediaminetetraacetic acid–anticoagulated whole blood using a commercially available DNA extraction kit (gSYNC DNA Blood Mini kit; catalog number 51104; Geneaid Biotech Ltd), following the manufacturer’s protocol. The concentration of the extracted DNA will be assessed using a NanoDrop spectrophotometer (MaestroGen) to assess the A260/A280 and A260/230 absorbance ratios. All extracted DNA samples will be preserved at –80 °C. TMPRSS6 genotyping for the rs855791 polymorphism will be conducted using the TaqMan genotyping assay protocol (Applied Biosystems). PCR will be performed using a Taq DNA polymerase kit following the manufacturer’s instructions.
Data Analysis Plan
The collected data will be categorized into quantitative and qualitative data. SPSS Statistics (version 22; IBM Corp) will be used for the analysis. The Kolmogorov-Smirnov test will be used to assess whether the data follow a normal distribution. If the data are normally distributed, quantitative data will be presented as means and SDs; if not, the data will be presented as medians (minimum and maximum). Qualitative data will be presented as frequencies and percentages.
Parameters measuring hemogram profiles and iron status will be compared between the case and control groups using the independent samples 2-tailed t test or Mann**-**Whitney U test. Differences in genotype and allele frequencies between the case and control groups will be evaluated using the chi-square test. To determine the effects of individual SNPs on iron markers, comparison of iron status across the genotyping classes (AA, AG, and GG) will be performed using 1-way ANOVA or the Kruskal-Wallis test. Assessment of bivariate associations between each independent variable and the outcome (dependent variable) will be conducted using the chi-square or Fisher test. Risk factors with a P value of <.25 in the bivariate analysis will be further analyzed through multivariate logistic regression analysis.
All risk factors meeting the aforementioned selection criteria will be entered one by one in the multivariate analysis, starting with the most significant factors from the bivariate analysis. The selection of the final model equation will be based on parsimony, biological interpretability, and statistical significance (P≤.05). This final model equation will be assessed using the Hosmer-Lemeshow test and the area under the curve value obtained through receiver operating characteristic analysis. Subsequently, the results of this final equation will be used for predicting the probability of iron deficiency.
Ethical Considerations
The study protocol has been formally approved by the medical and health research ethics committee of the Faculty of Medicine, Public Health and Nursing, Universitas Gajah Mada in Yogyakarta, Indonesia (KE/FK/0114/EC/2025). Prior to enrollment, written informed consent will be obtained from all participants after they have been fully informed about the study objectives, procedures, potential risks and benefits, and the voluntary nature of their participation. To ensure confidentiality, all collected data will be securely stored with access restricted to authorized members of the research team. As a token of appreciation for their time, participants will receive a snack following the blood draw and reimbursement of Rp 50,000 (US $3.02) for transportation costs.
Results
Data collection commenced in February 2025 and is expected to conclude by October 2025. Data analysis is scheduled for completion by the end of 2025, with the publication of study findings anticipated in 2026.
Discussion
Anticipated Findings
Anemia remains a prevalent health issue in LMICs, with iron deficiency identified as its primary cause. Most anemia cases are preventable and manageable, provided that the underlying causes are identified. Consequently, recognizing the multifactorial etiology of anemia is crucial for developing effective and tailored therapeutic strategies [36]. We anticipate that this study will identify both genetic variations and nongenetic factors impacting the risk of iron deficiency.
Several GWASs have consistently demonstrated a strong correlation between iron status and variants in the TMPRSS6 gene, particularly the SNP rs855791, across various populations, predominantly in Asia. Despite the significant burden of iron deficiency in Asia, particularly in Southeast Asian nations such as Indonesia, studies focusing on unique gene variants associated with iron homeostasis in humans remain limited [18,21].
Studies by Hamed et al [37] and Elmahdy et al [38] revealed that individuals with iron deficiency or iron deficiency anemia exhibited a significantly increased frequency of the rs855791 variant (both heterozygous and homozygous) compared with the control group. Conversely, Jallow et al [20] and Varikuti et al [39] did not observe any association between this variant and the iron status. Shinta et al [19] investigated children aged 12 to 17 months in the Sasak community of East Lombok regency, West Nusa Tenggara province, Indonesia, and reported that iron intake had a stronger influence on SF levels than the rs855791 polymorphism. Building upon these findings, our study is anticipated to provide further evidence regarding the prevalence and potential role of the rs855791 gene variant in the Indonesian population, particularly among female medical students.
Research results from LMICs have indicated that nongenetic factors, including poor socioeconomic status, illiteracy, lack of formal education, an interpregnancy interval of less than 2 years, excessive menstrual bleeding, living in rural areas, the lack of physical activity, poor nutritional intake, low BMI, obesity, and insufficient knowledge regarding anemia, can all contribute to anemia, particularly iron deficiency–induced anemia [16,25,27,28,36,37]. Most existing research has focused on isolated risk factors, either genetic or nongenetic, with limited investigation into their combined effects. Only a few studies have explored the interaction between genetic factors and a single nongenetic factor. Shinta et al [19] investigated the relationship between TMPRSS6 gene polymorphism and iron intake on the risk of iron deficiency, whereas Pei et al [22] reported that certain genetic variations have a protective role against iron deficiency anemia among women with menorrhagia. The occurrence of iron deficiency involves a multifactorial interaction between genetic and nongenetic determinants. We aim to identify the interplay between these factors and susceptibility to iron deficiency in the specific context of female medical students. A comprehensive understanding of both genetic and nongenetic contributors will support the development of targeted and evidence-based therapeutic approaches informed by the identified causal factors.
Strengths and Limitations
A notable strength of this study lies in its comprehensive methodology for investigating both genetic and nongenetic factors associated with iron deficiency among female medical students. Iron intake, nutritional status, menstrual patterns, and sociodemographic characteristics will be the nongenetic variables assessed, whereas the genetic aspect will focus on the rs855791 variant. In addition, serum hepcidin levels, a key regulator of iron homeostasis associated with the rs855791 variant and other iron-related biomarkers, will be measured. Despite these strengths, several limitations should be acknowledged. First, hemoglobin electrophoresis will not be performed in the case group, thereby limiting the ability to detect coexisting conditions, such as thalassemia trait, in participants with iron deficiency. Second, this study will not include measurements of vitamin D levels. Given vitamin D’s role in regulating hepcidin expression, its deficiency may lead to elevated hepcidin levels, thereby reducing iron bioavailability and contributing to iron deficiency [40,41]. Third, the recruitment of participants from a single academic institution may limit the external validity of the findings, potentially constraining their generalizability to the broader population of women in Indonesia. Nevertheless, the conceptual framework and integrated methodological approach addressing both genetic and nongenetic factors may meaningfully contribute to community-based iron deficiency screening strategies, particularly for young, educated women in urban areas.
Further research is recommended to address this study’s limitations, including the assessment of hemoglobin electrophoresis and vitamin D levels, which may affect the accuracy of iron deficiency evaluation, by expanding recruitment to a larger, multicenter scale. Moreover, subsequent studies should investigate the efficacy of treatment approaches for iron deficiency based on the causal determinants identified in this study.
Conclusions
Our study aims to elucidate how genetic and nongenetic factors contribute to the risk of iron deficiency in women of reproductive age, especially female medical students in Indonesia. The results are expected to provide valuable insights into the primary determinants contributing to iron deficiency in this population.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Anaemia in women and children World Health Organization 2024-07-24 https://www.who.int/data/gho/data/themes/topics/anaemia_in_women_and_children
- 2Derman RJ Patted A Overview of iron deficiency and iron deficiency anemia in women and girls of reproductive age Int J Gynaecol Obstet 20230804162 Suppl 2S 2788210.1002/ijgo.149503753801337538013 · doi ↗ · pubmed ↗
- 3Kassebaum NJ Jasrasaria R Naghavi M Wulf SK Johns N Lozano R Regan M Weatherall D Chou DP Eisele TP Flaxman SR Pullan RL Brooker SJ Murray CJA systematic analysis of global anemia burden from 1990 to 2010 Blood 20140130123561524 10.1182/blood-2013-06-50832524297872 S 0006-4971(20)36066-324297872 PMC 3907750 · doi ↗ · pubmed ↗
- 4Accelerating anaemia reduction: a comprehensive framework for action World Health Organization 20235122024-07-24 https://www.who.int/publications/i/item/9789240074033
- 5Sunuwar DR Singh DR Chaudhary NK Pradhan PM Rai P Tiwari K Prevalence and factors associated with anemia among women of reproductive age in seven South and Southeast Asian countries: evidence from nationally representative surveys P Lo S One 2020813158 e 0236449 10.1371/journal.pone.023644932790764 PONE-D-20-1036532790764 PMC 7425935 · doi ↗ · pubmed ↗
- 6Alem AZ Efendi F Mc Kenna L Felipe-Dimog EB Chilot D Tonapa SI Susanti IA Zainuri A Prevalence and factors associated with anemia in women of reproductive age across low- and middle-income countries based on national data Sci Rep 2023112013120335 10.1038/s 41598-023-46739-z 3799006910.1038/s 41598-023-46739-z 37990069 PMC 10663544 · doi ↗ · pubmed ↗
- 7Kemenkes RI Laporan Nasional Riskesdas 2018 Health Research and Development Agency, Government of Indonesia 20192025-05-29Lembaga Penerbit Badan Penelitian dan Pengembangan Kesehatan https://repository.badankebijakan.kemkes.go.id/id/eprint/3514/
- 8Yehuda S Mostofsky DI Iron Deficiency and Overload: From Basic Biology to Clinical Medicine 2010 Cham, Switzerland Springer
