Predicting breast cancer risk and its association to biopsychosocial factors among Taiwanese women with a family history of breast cancer: an investigation based on the Gail model
Sabiah Khairi, Nur Aini, Lalu Muhammad Harmain Siswanto, Min-Huey Chung

TL;DR
This study uses the Gail model to assess breast cancer risk in Taiwanese women with a family history and finds that biopsychosocial factors are linked to higher risk, especially in older women.
Contribution
The study is one of the first to investigate biopsychosocial factors associated with breast cancer risk in Taiwanese women with a family history using the Gail model.
Findings
Women with a family history of breast cancer had significantly higher risk scores (p < 0.001).
Six biopsychosocial factors were significant predictors for women under 50, and seven for those 50 and older.
Older women showed more consistent and significant associations between biopsychosocial factors and breast cancer risk.
Abstract
First-degree relatives with breast cancer have a two-fold higher risk than women without a family history. The Gail model approach has been employed in numerous studies to investigate the risk of breast cancer among women in a variety of countries. Nevertheless, the studies investigating the correlation between the level of breast cancer risk and biopsychosocial factors among Taiwanese women with a family history of breast cancer (FHBC) are limited. By using the Gail model, we explored the breast cancer risk score and its relationship to biopsychosocial factors among Taiwanese women with FHBC. The present study was a cross-sectional study from secondary data of the Taiwan Biobank from 2008 to 2018. Self-reports were conducted to determine biopsychosocial factors. A total of 3,060 women aged 35–70 years with and without FHBC were considered eligible for enrollment. The Gail model, which…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1- —Institution of Biomedical Sciences, Academia Sinica
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer Risks and Factors · Global Cancer Incidence and Screening · BRCA gene mutations in cancer
Background
Breast cancer remains one of the most prevalent cancers worldwide, with first-degree relatives having a two-fold higher risk than those without a family history. According to the World Health Organization (WHO), 2.3 million (11.7% of total cases) women were diagnosed with breast cancer in 2020, and 685,000 (6.9%) deaths globally [1]. In 2022, The global cancer burden using the GLOBOCAN reported lung cancer was the most commonly detected cancer, accounting for about 2.5 million new cases, which is equivalent to one out of every eight cancers worldwide (12.4% of all global cancers). It was followed by breast cancer (11.6%), colorectal cancer (9.6%), prostate cancer (7.3%), and stomach cancer (4.9%). Among women, breast cancer was the most prevalent form of cancer, accounting for both the highest number of cases and deaths. It was responsible for around 6.9% of all breast cancer-related fatalities [2]. In Taiwan, the Top 10 cancer incidence rate per 100,000 for carcinoma in situ and invasive cancers in 2021 was female breast with a total of 18,182 cases. resulting in approximately 2,913 fatalities [3]. In addition, breast cancer is frequently found among younger Taiwanese women aged 40–65, with stages 0 to II being the most prevalent at diagnosis [4, 5].
It is important to note that, each individual is a combination of biological, psychological, and social components that are more than the sum of their parts. Biopsychosocial factors which influence internal micro-environmental cues also contribute to the development of these alterations that are responsible for tumorigenesis [6]. Several biopsychosocial factors are associated with the development of breast cancer, including sociodemographic [7, 8], reproductive characteristics [9], lifestyle [10, 11], psychological stress, coping ability, and family history of breast cancer (FHBC) [7, 8]. According to age-standardized incidence and mortality data from the United Kingdom, where the incidence is highest in the world, women aged 50 suffer from the disease at a rate of approximately two per 1,000 per year, and it is the leading cause of death for women in the 40–50 age bracket [12]. On the other hand, breast cancer is uncommon in young women, occurring in only 4–6% of those under 40. However, it is still the most frequently diagnosed malignancy in this age group. In recent years, there has been a noticeable rise in BC cases among pre-menopausal individuals [13]. Young women are more likely to develop tumors that exhibit a higher prevalence of unfavorable clinicopathologic characteristics, such as higher histological grade, increased lymph node involvement, lower estrogen receptor (ER) positivity, and elevated Her2/neu overexpression. Additionally, they are often diagnosed at more advanced stages of the disease [13].
In relation to the family history, this factor is a well-established biological risk factor for breast cancer that contributes significantly to the risk of the disease (odds ratio: 1.71, 95% CI: 1.59–1.84) [9]. A different investigation, employing a substantial group of patients, revealed that women with two or more relatives who have previously had breast cancer have a 2.5-fold higher risk (95% CI: 1.83–3.47) of developing the disease [14]. These findings collectively indicate that older age and a family history of breast cancer as a crucial factor in breast cancer prevention screening [12, 15].
As previously pointed out, breast cancer is a malignant disease that primarily affects women and can be effectively prevented with early detection strategies. Early detection techniques can include screening tests, and breast self-examination to identify life-threatening diseases at their initial stages [16]. Over the last thirty years, numerous statistical models have emerged to evaluate breast cancer risk in both individuals and populations [17]. The Gail model discovered by Gail et al. [18] stands out as the most widely used for overall risk evaluation, capable of assessing a range of potential risk factors for both five-year and lifetime invasive breast cancer risk [18, 19]. Present models have the capability to provide a reasonably precise prediction of the likelihood of developing breast cancer, and they can adequately advise women if they possess an elevated risk over their lifetime [20]. The National Cancer Institute has created the Breast Cancer Risk Assessment Tool (BCRAT), an online calculator that has been validated to predict the risk of invasive breast cancer for women in the United States. This tool is based on the Gail Model, which has been validated in large samples specifically for white women [16]. The inclusion of family history is a crucial component of this instrument, as it is in most other breast cancer screening techniques. Women who receive a positive screening result might benefit from being referred to a specialized breast center for genetic counseling, additional testing, and treatment [16].
Numerous studies have investigated breast cancer risk based on the Gail model approach in women of reproductive age in Nigeria [21], among secondary school teachers in Saudi women [17], patients diagnosed with non-proliferative lesions who have had a breast biopsy in Iraq [22], and patients with invasive breast carcinoma in USA, UK, Italy, and Sweden [23]. A study by Zhang et al. [24], evaluated the accuracy of the Gail model and the Tyrer-Cuzick model in predicting breast cancer risk among Chinese women. The study demonstrated that both models exhibited reasonable predictive performance, supporting their applicability in Chinese populations. Given the genetic, epidemiological, and lifestyle similarities between Chinese and Taiwanese populations, these findings suggest that the Gail model may also be a valuable tool for breast cancer risk assessment in Taiwan. However, further empirical validation studies within the Taiwanese population are necessary to confirm its predictive accuracy and clinical utility. Furthermore, recent data indicate an alarming increase in early-onset breast cancers globally, including significant trends observed in Taiwan. This study aims to address these critical gaps by exploring the breast cancer risk score using the Gail model and its relationship to biopsychosocial factors particularly among different age groups in this high-risk population.
Methods
Study population
The present study was a cross-sectional study from secondary data of the Taiwan Biobank in 2008 to 2018. The Taiwan Biobank is a government-funded research project that gathers extensive phenotypic measures and genetic data from the Taiwanese population in order to create a prospective cohort study [25]. The recruitment process for Taiwan Biobank involves two components: a community-based arm and a hospital-based collection. The community-based branch has been enrolling individuals between the ages of 30 and 70, who have not been previously diagnosed with cancer, from over 30 recruitment locations around Taiwan [25]. The distribution of these sites is based on the population density of various counties and cities. During the recruitment process, participants were required to give written informed consent and had their first data collected by questionnaires, physical examination, and tests on their blood and urine samples [25].
This study included Taiwanese women who were cancer-free at the time of enrollment from the local population in a community setting that enrolls participants from more than 30 locations in Taiwan. Women between the ages of 35 and 70, both with and without a family history of breast cancer from first-degree relatives, were eligible for enrollment. In total, 149 individuals were excluded from the current investigation because they were under the age of 35.; hence the data for 3,060 healthy women were included in this study.
Ethical consideration
The investigation was conducted using the Taiwan Biobank data. Within the Taiwan Biobank, the application will undergo rigorous scientific and ethical evaluations conducted by external experts in the corresponding scientific disciplines, as well as the Ethics and Governance Committee (EGC) of Taiwan Biobank [25]. The current study gained ethical approval from the Taipei Medical University Joint Institutional Review Board (No. N201804027). All procedures performed in this study involving human participants were conducted in accordance with Declaration of Helsinki.
Data collection
In the Taiwan Biobank, phenotypic data primarily relies on self-reporting methods. At recruitment, participants provided written informed consent and had their baseline data collected through questionnaires [25]. A structured questionnaire administered during interviews gathered data on demographic traits, lifestyle choices, environmental exposures, family medical history, and health status [25]. In the current study, we focused on exploring the questionnaire encompassed three main categories related to breast cancer risk factors. Firstly, biological factors such as age [7], BMI, reproductive characteristics (age at menarche, having pregnancy, age at first pregnancy, birth experience, age at first life birth, breastfeeding practice and menopause) [9], and first-degree relatives with breast cancer (mother and/or sister) [8, 26]. Secondly, psychological and behavioral factors like depression status (yes/no) alcohol consumption (never or occasional/ not recently/ recently), smoking habits (yes/no), exposure to secondhand smoke (yes/no), and engagement in physical exercise (yes/no) [10, 11]. Lastly, social factors include marital status (unmarried/married), educational level (primary school/ high school above), and dependency status (living alone/not living alone) [7, 8].
The Gail Model employed to assess risk in our research is a software designed by the National Cancer Institute (NCI) in the United States and has been tested in large populations which provide an accurate estimate of breast cancer risk [22, 27]. This study employed the Breast Cancer Risk Assessment Tool (BCART) created by NCI. The online risk calculator can be accessible through the website https://bcrisktool.cancer.gov/calculator.html. This calculator collects data on various factors including the individual’s age (35–85 years old), age at the onset of menstruation (7–11, 12–13, and ≥ 14 years), age at the first live birth of a child (unknown, no children, < 20, 20–24, 25–29, and ≥ 30 years), number of first-degree relatives (mother, sisters, daughters) with breast cancer, number of previous breast biopsies (whether positive or negative), and presence of atypical hyperplasia in a biopsy (unknown, yes, no) [27]. Females with a Gail Model score exceeding 1.66% were classified as high-risk, while those below this threshold were labeled as low-risk of breast cancer over the next 5 years [22, 28].
Statistical analysis
We utilized the Statistical Package for the Social Sciences (SPSS), version 27.0 (SPSS Inc., Chicago, IL, USA) to conduct our statistical analysis. Intergroup significance was assessed using Chi-square and Fisher test for categorical. In the current study, a Logistic regression test was performed using the entry approach to evaluate the magnitude of the association between the statistically significant biopsychosocial factors and the level of breast cancer risk. The association was measured using odds ratios (OR) along with a 95% confidence interval (CI). P-values below 0.05 are commonly used to indicate statistical significance and were designed as potential risk factors for breast cancer. All analyses were adjusted for potential factors and stratified for age. Furthermore, we utilized the logistic regression model to present the Area Under the Curve (AUC). The curve will provide a clearer understanding of how the biopsychosocial factors enhance predictive accuracy along with relevant statistical details and visual representations of the Receiver Operating Characteristic (ROC) curves. The classification of AUC values were interpreted as excellent (0.9 ≤ AUC), considerable (0.8 ≤ AUC < 0.9), fair (0.7 ≤ AUC < 0.8), poor (0.6 ≤ AUC < 0.7), and fail (0.6 ≤ AUV < 0.5) respectively [29].
Results
Table 1 presents the distribution of breast cancer risk levels among the participants, as determined by the Gail model. Based on the statistics, 34.2% of the female population in the FHBC group (574 out of 1,676) were found to have a high probability of developing breast cancer, and 495 of these individuals belong to women with FHBC ≥ 50 years old. The determination was made based on a score exceeding 1.66%, which is the standard threshold for identifying high risk in the Gail model. Meanwhile, all of the women in the non-FHBC group (1,384 individuals) were categorized as having a low probability of developing breast cancer. Furthermore, there was a statistically significant difference in breast cancer risk scores between the FHBC and non-FHBC groups (p < 0.001).
Table 1. The distribution of participants according to the level of the breast cancer risk score based on the Gail model stratified by age (n = 3060)Level of BC Risk All women (n = 3060)
Women < 50 years old (n = 1237)
Women ≥ 50 years old (n = 1823) FHBCn (%)Non-FHBCn (%)p-valueFHBCn (%)Non-FHBCn (%)p-valueFHBCn (%)Non-FHBCn (%)p-valueGail model score: High-risk (≥ 1.66%)574 (34.2)0 (0.0) < 0.001 79 (10.8)0 (0.0) < 0.001 495 (52.5)0 (0.0) < 0.001 Low-risk (< 1.66%)1,102 (65.8)1,384 (100)654 (89.2)504 (100)448 (47.5)880 (100) Total1,676 (100)1,384 (100)733 (100)504 (100)943 (100)880 (100)BC: breast cancer; FHBC: family history of breast cancer
The study examined personal information factors to assess the risk of developing invasive breast cancer. The results showed that there were significant associations between age at first live birth, breast cancer in mother and sister, and the presence of FHBC stratified with age. The p-values for these associations were < 0.001, < 0.001, and < 0.001, respectively (Table 2). However, two out of the six variables related to personal information (prior breast biopsy and presence of atypical hyperplasia in biopsy) were unknown due to the certainty that all participants were healthy women without breast cancer. Therefore, Taiwan Biobank is unable to offer the data.
Table 2. The distribution of participants according to the personal information components of the Gail model, stratified by age (n = 3060)Variables All women (n = 3060)
Women < 50 years old (n = 1237)
Women ≥ 50 years old (n = 1823) FHBCn (%)Non-FHBCn (%)p-valueFHBCn (%)Non-FHBCn (%)p-valueFHBCn (%)Non-FHBCn (%)p-valueAge at menarche 7–11177 (10.6)156 (11.3)0.28987 (11.9)78 (15.5)0.05590 (9.5)78 (8.9)0.881 12–13768 (45.8)595 (43.0)377 (51.4)228 (45.2)391 (41.5)367 (41.7) ≥ 14731 (43.6)633 (45.7)269 (36.7)198 (39.3)462 (49.0)435 (49.4)Age at 1st live birth No birth134 (8.0)215 (15.5) < 0.001 87 (11.9)114 (22.6) < 0.001 47 (5.0)101 (11.5) < 0.001 < 20221 (13.2)50 (3.6)112 (15.3)10 (2.0)109 (11.6)40 (4.5) 20–351,273 (76.0)1,089 (78.7)502 (68.5)366 (72.6)771 (81.8)723 (82.2) > 3548 (2.8)30 (2.2)32 (4.3)14 (2.8)16 (1.7)16 (1.8)First-degree relatives: Breast cancer mother No846 (50.5)1,384 (100.0) < 0.001 300 (40.9)504 (100.0) < 0.001 546 (57.9)880 (100.0) < 0.001 Yes830 (49.5)0 (0.0)433 (59.1)0 (0.0)397 (42.1)0 (0.0) Breast cancer sisters No801 (47.8)1,384 (100.0) < 0.001 421 (57.4)504 (100.0) < 0.001 380 (40.3)880 (100.0) < 0.001 Yes875 (52.2)0 (0.0)312 (42.6)0 (0.0)563 (59.7)0 (0.0)Previous breast biopsy Yes0 (0.0)0 (0.0)N/A0 (0.0)0 (0.0)N/A0 (0.0)0 (0.0)N/A No0 (0.0)0 (0.0)0 (0.0)0 (0.0)0 (0.0)0 (0.0) Unknown1,676 (100.0)1,384 (100.0)807 (100.0)540 (100.0)869 (100.0)844 (100.0)Presence of Atypical hyperplasia in biopsy Yes0 (0.0)0 (0.0)N/A0 (0.0)0 (0.0)N/A0 (0.0)0 (0.0)N/A No0 (0.0)0 (0.0)0 (0.0)0 (0.0)0 (0.0)0 (0.0) Unknown1,676 (54.8)1,384 (45.2)807 (100.0)540 (100.0)869 (100.0)844 (100.0)
Table 3. Association between biopsychosocial factors and level of breast cancer risk, stratified by age (n = 3060)Variables All women (n = 3060)
Women < 50 years old (n = 1237)
Women ≥ 50 years old (n = 1823) Low-riskn (%)High-riskn (%)p-valueLow-riskn (%)High-riskn (%)p-valueLow-riskn (%)High-riskn (%)p-value Biological factors BMI < 302,339 (94.1)541 (94.3)0.8801,088 (94.0)75 (94.9)0.7221,251 (94.2)466 (94.1)0.961 ≥ 30147 (5.9)33 (5.7)70 (6.0)4 (5.1)77 (5.8)29 (5.9)Having pregnancy No335 (13.5)54 (9.4) 0.008 228 (19.7)4 (5.1) 0.001 107 (8.1)50 (10.1)0.167 Yes2,151 (86.5)520 (90.6)930 (80.3)75 (94.9)1,221 (91.9)445 (89.9)Age at 1st pregnancy No pregnant335 (13.5)54 (9.4) < 0.001 228 (19.7)4 (5.1) < 0.001 107 (8.1)50 (10.1) < 0.001 < 20327 (13.1)20 (3.5)138 (11.9)4 (5.1)189 (14.2)16 (3.3) 20–351,787 (71.9)485 (84.5)767 (66.2)67 (84.8)1,020 (76.8)418 (84.4) > 3537 (1.5)15 (2.6)25 (2.2)4 (5.1)12 (0.9)11 (2.2)Birth experience No408 (16.4)67 (11.7) 0.005 278(24.0)4 (5.1) < 0.001 130 (9.8)63 (12.7) 0.070 Yes2,078 (83.6)507 (88.3)880 (76.0)75 (94.9)1,198 (90.2)432 (87.3)Breastfeeding practice Never194 (7.8)66 (11.5) 0.014 164 (14.2)4 (5.1) 0.009 30 (2.3)62 (12.5) < 0.001 No886 (35.6)203 (35.4)322 (27.8)16 (20.3)564 (42.5)187 (37.8) Yes1,406 (56.6)305 (53.1)672 (58.0)59 (74.7)734 (55.2)246 (49.7)Menopause No1,343 (54.0)172 (30.0) < 0.001 1,072 (92.6)71 (89.9)0.381271 (20.4)101 (20.4)0.999 Yes1,143 (46.0)402 (70.0)86 (7.4)8 (10.1)1,057 (79.6)394 (79.6) Psychological and behavioral factors Depression status No2,356 (94.8)548(95.5)0.4921,106 (95.5)76 (96.2)0.7721,250 (94.1)472 (95.4)0.308 Yes130 (5.2)26 (4.5)52 (4.5)3 (3.8)78 (5.9)23 (4.6)Alcohol consumption Never / occasionally2,427 (97.6)561 (97.7)0.0911,129 (97.5)77 (97.5)0.7721,298 (97.7)484 (97.8) 0.065 Not recently17 (0.7)0 (0.0)6 (0.5)0 (0.0)11 (0.8)0 (0.0) Recently42 (1.7)13 (2.3)23 (2.0)2 (2.5)19 (1.5)11 (2.2)Smoking No2,244 (90.3)516 (89.9)0.7881,016 (87.7)58 (86.1)0.6641,128 (92.5)448 (90.5)0.171 Yes242 (9.7)58 (10.1)142 (12.3)11 (13.9)100 (7.5)47 (9.5)Secondhand smoke No2,290 (92.1)553 (96.3) < 0.001 1,048 (90.5)77 (97.5) 0.037 1,242 (93.5)476 (96.2) 0.032 Yes196 (7.9)21 (3.7)110 (9.5)2 (2.5)86 (6.5)19 (3.8)Exercise No1,483 (59.7)301 (52.4) 0.002 851 (73.5)60 (75.9)0.631632 (47.6)241 (48.7)0.677 Yes1,003 (40.3)273 (47.6)307 (26.5)19 (24.1)696 (52.4)254 (51.3) Social factors Marital Status Unmarried300 (12.1)44 (7.7) 0.003 220 (19.0)5 (6.3) 0.005 80 (6.0)39 (7.9)0.154 Married2,186 (87.9)530 (92.3)938 (81.0)74 (93.7)1,248 (94.0)456 (92.1)Educational level Primary school294 (11.8)43 (7.5) 0.003 45 (3.9)0 (0.0)0.074249 (18.8)43 (8.7) < 0.001 High school or above2,191 (88.2)531 (92.5)1,113 (96.1)79 (100.0)1,079 (81.3)452 (91.3)Job experience No1,720 (69.2)440 (76.7) < 0.001 813 (70.2)56 (70.9)0.898907 (68.3)384 (77.6) < 0.001 Yes766 (30.8)134 (23.3)345 (29.8)23 (29.1)421 (31.7)111 (22.4)Dependency Not live alone2,250 (90.5)528 (92.0)0.2691,083 (93.5)77 (6.5)0.1601,167 (87.9)451 (91.1)0.052 Live alone236 (9.5)46 (8.0)75 (6.5)2 (2.5)161 (12.1)44 (8.9)
Table 4. Evaluation the risk of breast cancer according to the biopsychosocial factors stratified by ageVariablesOutcome: High-RiskWomen < 50 years oldWomen ≥ 50 years oldOR (95% CI)p-valueOR (95% CI)p-value Biological factors Having pregnancy YesRef 0.001 Ref0.122 No0.19 (0.07 to 0.53)1.33 (0.93 to 1.89)Age at 1st pregnancy < 35Ref0.055Ref 0.021 ≥ 353.11 (0.97 to 9.89)2.69 (1.16 to 6.27)Birth experience YesRef < 0.001 Ref 0.044 No0.15 (0.06 to 0.41)1.39 (1.01 to 1.93)Breastfeeding practice YesRefRef No0.70 (0.42 to 1.18)0.1821.04 (0.83 to 1.30)0.729 Never0.62 (0.17 to 2.18)0.4527.65 (4.67 to 12.54) < 0.001 Menopause NoRef 0.040 Ref Yes1.89 (1.03 to 3.49)0.91 (0.69 to 1.19)0.504 Psychological and behavioral factors Secondhand smoke NoRef 0.047 Ref0.052 Yes0.23 (1.16 to 6.27)0.59 (0.34 to 1.00)Exercise YesRef0.599Ref0.904 No1.15 (0.68 to 1.95)1.01 (0.82 to 1.26) Social factors Marital Status MarriedRef0.820Ref0.135 Unmarried0.78 (0.09 to 6.37)1.39 (0.90 to 2.14)Educational level Primary schoolRef0.122Ref < 0.001 High school or above4.79 (0.66 to 34.6)2.43 (1.72 to 3.43)Job experience YesRef0.540Ref < 0.001 No1.18 (0.70 to 1.97)1.58 (1.23 to 2.03)
The biopsychosocial characteristics of the study population are presented individually in Table 3. Among the six biological factors analyzed, five demonstrated significant variations in breast cancer risk among women in both groups: pregnancy status (p = 0.008), age at first pregnancy (p < 0.001), birth experience (p = 0.005), breastfeeding practices (p = 0.014), and menopausal status (p < 0.001). Additionally, psychological and behavioral factors, including exposure to secondhand smoke (p < 0.001), and exercise habits (p = 0.002) exhibited significant differences across all groups rather than within individual groups. Social factors, such as marital status (p = 0.003), educational level (p = 0.003), and job experience (p < 0.001), showed significant differences across all group.
Logistic regression analysis was conducted to examine the substantial predictor of biopsychosocial factors that showed a p-value of less than 0.05 on the bivariate analysis. 10 out of 15 variables were selected and taken into consideration in the logistic regression model. Based on Table 4 indicates that several factors serve as potential predictors of breast cancer risk in the older group, including age at first pregnancy (OR: 2.69; 95% CI: 1.16–6.27; p = 0.021), birth experience (OR: 1.39; 95% CI: 1.01–1.93; p = 0.044), breastfeeding practice (OR: 7.65; 95% CI: 4.67–12.54; p < 0.001), educational level (OR: 2.43; 95% CI: 1.72 to 3.43; p < 0.001), and job experience (OR: 1.58; 95% CI: 1.23–2.03; p < 0.001).
At the same time, significant predictors of breast cancer risk in the younger group include pregnancy status (OR: 0.19; 95% CI: 0.07–0.53; p = 0.001), birth experience (OR: 0.15; 95% CI: 0.06–0.41; p < 0.001), menopausal status (OR: 1.89; 95% CI: 1.03–3.49; p = 0.040), and exposure to secondhand smoke (OR: 0.23; 95% CI: 1.16–6.27; p = 0.047). However, there appears to be an inconsistency in the odds ratio (OR) values for significant factors in younger women, particularly pregnancy status, birth experience, and exposure to secondhand smoke, which seem to function as protective factors rather than predictive risk factors.
Fig. 1. Receiver operating characteristic (ROC) curve analysis for the testing data. (A) ROC curve for women aged ≥ 50 years, with an AUC score of 0.633. (B) ROC curve for women aged < 50 years, with an AUC score of 0.648. AUC: Area Under the Curve
Our study conducted sensitivity and specificity analyses using logistic regression, represented by the Receiver Operating Characteristic (ROC) curve, to assess the accuracy of biopsychosocial factors in predicting breast cancer risk across different age groups. As shown in Fig. 1, the biopsychosocial factors demonstrated poor discriminatory ability in predicting breast cancer risk in both age groups, with AUC scores of 0.633 and 0.648, respectively.
Discussion
Breast cancer in Taiwan is the most common form of cancer in Taiwanese women and the most frequent cause of cancer-related death [5]. The result of the current study indicates that 34.2% of Taiwanese women with a family history of breast cancer have a high risk of developing breast cancer over the next five years. Moreover, the most important personal information and biopsychosocial factors associated with breast cancer risk, particularly in older women as a high-risk population include age at first live birth, history of breast cancer mother, breast cancer sister, birth experience, breastfeeding practice, educational level, and job experience. Therefore, it’s crucial for both patients and healthcare providers to accurately evaluate an individual’s risk of breast cancer in order to decide on the potential use of prevention strategies for high-risk women. Understanding that women at elevated risk of breast cancer require significant assistance in healthcare decision-making and embracing the impact of various prevention strategies is essential.
Different methods have been created to assess risk, with the Gail model emerging as a commonly utilized tool. This research aimed to assess the reliability of the Gail model in predicting breast cancer development among Taiwanese individuals with a family history of the disease, incorporating individual and additional risk factors. This approach enables prompt risk evaluation for breast cancer during patient examinations, facilitating timely follow-up and treatment planning, contingent upon meeting specific criteria. The primary focus lies in identifying individuals at heightened risk of breast cancer [27].
Breast cancer is a prevalent concern among women aged 45 and older. According to research conducted by Rojas et al. [19] on breast cancer epidemiology and risk factors, the majority of female breast cancer cases are diagnosed in the age range of 55 to 64, with a median diagnosis age of 61 years. Less than 5% of breast cancer diagnoses occur in women under 40 years old, and similar to many other types of cancer, the risk rises with increasing age [30]. In addition, our study established a family history of breast cancer as a significant risk factor for breast cancer. This discovery aligns with previous research, which found that having one first-degree relative with breast cancer is linked to a 1.8-fold increase in risk while having two first-degree relatives with breast cancer is associated with a 2.9-fold higher risk [31, 32].
Aging is accompanied by a considerable reduction of several mechanisms in women’s bodies including estrogen hypersensitivity and mammary epithelial cell changes [33], . Elderly women exhibit markedly decreased levels of circulating estrogens, despite a greatly elevated risk of hormone-dependent cancer. The seeming a contradiction can be elucidated by profound cellular and molecular transformations that take place in the mammary gland following menopause [33]. The process of aging also results in alterations to the breast tissue, specifically the atrophy of ductal-lobular tissue, which occurs at the expense of the connective and adipose stroma. Russo et al. [34] discovered that following menopause, differentiated lobules experiences regress and become undifferentiated lobules similar to those observed before puberty. Additionally, it induces changes in the system responsible for repairing double-stranded DNA by suppressing the ATM protein (ataxia telangiectasia mutated) pathway, potentially enhancing its ability to transform cells [35].
Considering reproductive characteristics, we discovered that, age at first live birth, birth experience, and breastfeeding practice, as significant predictors associated with breast cancer development. Research indicates that giving birth at a later age, specifically > 30 years old, is associated with an increased risk of breast cancer [36, 37]. Albreksten et al. [37] conducted a study revealing that the risk of breast cancer rises with advancing age at first childbirth, beyond what could be attributed to the delayed protective effects of pregnancy, such as the decreasing risk over time since giving birth. Additionally, they observed that the temporary surge in risk shortly after childbirth was most pronounced following a late first childbirth [37]. A prior study shown that a prolonged period between the onset of menstruation and the first occurrence of a live birth also raises the risk of developing breast cancer [38]. The breast tissue is mostly undifferentiated and is sensitive to the mitogenic actions of estrogen receptor (ER) and progesterone receptor (PR) at this time. As a result, it is particularly vulnerable to carcinogens [38, 39]. The period from the onset of menstruation until the first life of birth has been referred to as a “period of high vulnerability” [40]. A reduced time period between the onset of menstruation and the first life of giving birth is associated with elevated quantities of hormones produced within the body [41].
Prior research has demonstrated that mothers who do not breastfeed are at a high risk of reproductive cancers. Specifically, breast, ovarian, and uterine cancers are more prevalent among women who opt not to breastfeed [42]. Conversely, breastfeeding has been linked to a decreased risk of breast cancer. This protective effect appears to be more pronounced among women who breastfeed for extended durations throughout their lives [43]. For instance, women who breastfed for a minimum of 24 months over their lifetimes exhibited a significantly lower risk of developing breast cancer compared to those who breastfed for fewer than 24 months. When compared to women who breastfed for a total of 0–11 months during their lifetimes, there was a 66.3% reduction in breast cancer risk among those who breastfed for 12–23 months, an 87.4% reduction among those who breastfed for 24–35 months, and a 94% reduction among those who breastfed for 36–47 months [42]. A study conducted in the Taiwanese population showed that breastfeeding for more than 3 years displayed significant protective effects against breast cancer [44].
The periodic impact of estrogen and progesterone on breast tissue can be delayed by prolonged breastfeeding or an increased number of pregnancies [45]. Prolonged breastfeeding decreases the likelihood of developing triple-negative malignancies [46]. The term can be described as a strong correlation between slow-paced weaning and mitigation of inflammatory response throughout the involution. Two distinct types of lobules are observed in the breast tissue of mothers undergoing progressive weaning, indicating a gradual replacement of the terminal duct lobular units (TDLUs). One type of cell was actively producing milk, while the other was involution, as indicated by infiltration of CD45-positive acute reactive immune cells and CD68-positive macrophages [45, 47].
Our research demonstrated a correlation between menopause and an increased chance of developing breast cancer in the younger age group. A study conducted by Chow et al. (1997) in Taiwan examined the demographic features and medical aspects of menopausal women. The study found that there were 14,298 newly reported instances of malignant neoplasms in women. Around 60% of these incidents were observed in women who were 50 years of age or older. The median age at which cervix uteri, breast, and ovarian cancers typically arise is approximately 48 to 49 years, which is in close proximity to the age at which menopause usually begins [48]. Throughout women’s reproductive years, which broadly span from the onset of menstruation to menopause, the ovary generates steroid hormones that directly influence the growth and operation of the breast. It is recognized that experiencing late menopause elevates the risk of breast cancer in women [49].
Educational level and job experience are key social determinants that can influence breast cancer risk through multiple pathways. Our study found that higher education was significantly associated with increased breast cancer risk, which may seem counterintuitive at first. However, education itself is not a direct biological predictor of breast cancer; rather, it is a proxy for various lifestyle, reproductive, and healthcare-seeking behaviors that contribute to disease development.
Higher educational attainment is often linked to lifestyle factors such as increased alcohol consumption, delayed childbearing, lower parity (fewer children), and greater use of hormone replacement therapy (HRT)—all of which have been identified as potential risk factors for breast cancer. Previous research suggests that women with higher education levels may have greater health awareness and access to healthcare, leading to earlier and more frequent breast cancer screenings, which could contribute to the observed higher incidence rates in this group [50]. In contrast, lower educational attainment and socioeconomic disadvantage have been associated with poorer breast cancer outcomes rather than lower risk. Vona-Davis and Rose [51] highlighted that social deprivation is linked to a higher incidence of aggressive breast cancer subtypes, such as estrogen receptor (ER)-negative and triple-negative breast cancer, which are often diagnosed at later stages with worse prognoses. Additionally, socioeconomic disparities—characterized by low family income, limited education, and lack of health insurance—can lead to reduced participation in mammography screening programs and delays in seeking medical care, ultimately contributing to poorer survival rates [51, 52].
Employment status is another crucial factor influencing breast cancer risk and outcomes. Stable employment is often associated with better healthcare access, including employer-sponsored insurance, routine medical check-ups, and preventive screenings. Conversely, job insecurity or unemployment may increase stress levels, limit healthcare access, and reduce the likelihood of engaging in health-promoting behaviors, all of which can negatively impact breast cancer detection and prognosis [53]. Therefore, our findings highlight the complex relationship between education, employment, and breast cancer risk, emphasizing the need for public health strategies that address both ends of the socioeconomic spectrum. While higher education is associated with lifestyle-related risk factors, lower education and economic disadvantage contribute to delayed diagnosis and worse outcomes, necessitating targeted interventions to improve both prevention and early detection efforts across diverse populations.
Strength and limitation
This study utilized a substantial sample obtained from the Taiwan Biobank, which adequately captured the Taiwanese population. In addition, the Gail Model has been tested in large populations worldwide and has been shown to provide accurate estimates of breast cancer risk. Moreover, this study focuses on investigating women who have a family history of breast cancer in their first relative as a vulnerable group for developing breast cancer stratified by age. Therefore, the study results could be applied as recommendations for the prevention programs targeting this particular population.
However, there are numerous limitations to consider when adopting these investigations. First, the study method was cross-sectional because the variables included in the study was obtained only from the health questionnaire of the of Taiwan Biobank at a single point in time. Consequently, making it difficult to establish cause-and-effect relationships. Therefore, in future studies, the inclusion of an appropriate dataset would enable a longitudinal study method to validate breast cancer-related factors among women who have been diagnosed with breast cancer. Second, all of individuals in this study are healthy women who do not have the personal information related a history of biopsy and presence of atypical hyperplasia in biopsy relevant to the data. It may impact the accuracy of our findings. Future studies should aim to collect comprehensive personal data, including biopsy history and the presence of atypical hyperplasia, to enhance the accuracy and applicability of breast cancer risk assessment models.
Third, an inconsistency was observed in the odds ratio (OR) values for significant factors in younger women, particularly pregnancy status, birth experience, and exposure to secondhand smoke, which appear to act as protective factors rather than predictive risk factors. Variations in physiological responses to reproductive and environmental exposures across age groups may contribute to these findings. Future research with larger, more diverse cohorts and comprehensive adjustments for confounding factors is necessary to validate these results and clarify the underlying mechanisms. Fourth, the poor discriminatory ability of the biopsychosocial factors in predicting breast cancer risk, as reflected in the relatively low AUC scores both age groups. This suggests that while these factors may contribute to breast cancer risk, they are insufficient as standalone predictors. The limited predictive performance may be due to the complex and multifactorial nature of breast cancer and may be influenced by biases stemming from the use of self-reported data. It can result in certain weaknesses, such as a lack of objectivity and the inaccuracy of data. Lastly, although previous studies have assessed the predictive performance of the Gail model in Chinese populations, the absence of validation studies conducted exclusively in Taiwan raises concerns regarding potential differences in genetic, environmental, and lifestyle factors that could influence model accuracy. Large-scale, population-based cohort studies should be carried out to evaluate how well the model performs in predicting breast cancer risk among Taiwanese women, considering potential variations in genetic predisposition, reproductive factors, lifestyle influences, and environmental exposures unique to Taiwan. Ultimately, such research would provide critical insights to refine risk prediction tools and optimize breast cancer screening and prevention strategies tailored to the Taiwanese demographic.
Conclusion
The present research indicates that the Gail model assesses individual breast cancer risk among Taiwanese women. 34.2% of women with a family history of breast cancer have a high-risk score of developing breast cancer over the next five years. Various biopsychosocial factors serve as predictors of breast cancer development, particularly in the older group, including age at first pregnancy, birth experience, breastfeeding practice, educational level, and job experience. These findings suggest that targeted interventions (i.e. secondary prevention) such as screening, mammography, counseling, educational interventions, and breast self-examination based on high-risk level and biopsychosocial factors could be implemented to prevent breast cancer, particularly among Taiwanese individuals with a family history of the disease.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Çorbacıoğlu ŞK, Aksel G. Receiver operating characteristic curve analysis in diagnostic accuracy studies: a guide to interpreting the area under the curve value. Turkish J Emerg Med. 2023;23(4).10.4103/tjem.tjem_182_23PMC 1066419538024184 · doi ↗ · pubmed ↗
- 2Collaborative Group on Hormonal Factors in Breast Cancer. Familial breast cancer: collaborative reanalysis of individual data from 52 epidemiological studies including 58 209 women with breast cancer and 101 986 women without the disease. Lancet. 2001;358(9291):1389–99. https://pubmed.ncbi.nlm.nih.gov/11705483/.10.1016/S 0140-6736(01)06524-211705483 · doi ↗ · pubmed ↗
- 3Collaborative Group on Hormonal Factors in Breast Cancer. Menarche, menopause, and breast cancer risk: individual participant meta-analysis, including 118 964 women with breast cancer from 117 epidemiological studies. Lancet Oncol. 2012;13(11):1141–51. https://pubmed.ncbi.nlm.nih.gov/23084519/.10.1016/S 1470-2045(12)70425-4PMC 348818623084519 · doi ↗ · pubmed ↗
- 4Dong J-Y, Qin L-Q. Education level and breast cancer incidence: a meta-analysis of cohort studies. Menopause. 2020;27(1). https://pubmed.ncbi.nlm.nih.gov/31479033/.10.1097/GME.000000000000142531479033 · doi ↗ · pubmed ↗
