Int J Med Sci 2026; 23(1):325-333. doi:10.7150/ijms.124006 This issue Cite

Research Paper

Comparative analysis of two analytical methodologies for deriving dietary patterns associated with vitamin D insufficiency and anemia among expectant mothers

Arpita Das1, Chien-Yeh Hsu2,3, Chyi-Huey Bai4,5,6, Jung-Su Chang1,6,7,8, Ya-Li Huang4,5, Fan-Fen Wang9, Nhi Thi Hong Nguyen1,10, Yi-Chun Chen1 Corresponding address, Jane C.-J. Chao1,3,6,8 Corresponding address

1. School of Nutrition and Health Sciences, Taipei Medical University, Taipei, Taiwan.
2. Department of Information Management, National Taipei University of Nursing and Health Sciences, Taipei, Taiwan.
3. Master Program in Global Health and Health Security, Taipei Medical University, Taipei, Taiwan.
4. Department of Public Health, School of Medicine, Taipei Medical University, Taipei, Taiwan.
5. School of Public Health, Taipei Medical University, Taipei, Taiwan.
6. Nutrition Research Center, Taipei Medical University Hospital, Taipei, Taiwan.
7. Graduate Institute of Metabolism and Obesity Sciences, Taipei Medical University, Taipei, Taiwan.
8. TMU Research Center for Digestive Medicine, Taipei Medical University, Taipei, Taiwan.
9. Department of Internal Medicine, Yangming Branch, Taipei City Hospital, Taipei, Taiwan.
10. Health Personnel Training Institute, University of Medicine and Pharmacy, Hue University, Hue, Vietnam.

Received 2025-8-20; Accepted 2025-11-21; Published 2026-1-1

Citation:
Das A, Hsu CY, Bai CH, Chang JS, Huang YL, Wang FF, Nguyen NTH, Chen YC, Chao JCJ. Comparative analysis of two analytical methodologies for deriving dietary patterns associated with vitamin D insufficiency and anemia among expectant mothers. Int J Med Sci 2026; 23(1):325-333. doi:10.7150/ijms.124006. https://www.medsci.org/v23p0325.htm
Other styles

File import instruction

Abstract

Graphic abstract

Objectives: Dietary patterns play a role associated with acute or chronic diseases. This study compared the correlation between dietary patterns and vitamin D status, using two methods for dietary pattern identification, related to gestational anemia among expectant mothers.

Methods: In this cross-sectional study a total of 1502 expectant mothers aged > 15 years were recruited from the Nutrition and Health Survey data. Dietary patterns were discerned through principal component analysis (PCA) and reduced rank regression (RRR). Associations between dietary pattern scores, vitamin D, and anemia-related biomarkers were validated using linear and binomial logistic regression adjusted for sociodemographic and economic factors.

Results: Two dietary patterns were identified: a plant and marine-based dietary pattern (PMDP) via PCA and a convenience food dietary pattern (CFDP) via RRR. PMDP was characterized by high intakes of plant and marine foods, while CFDP showed high intakes of canned and pickled foods. Risk estimation indicated that high PMDP intake was associated with 30% reduction in odds of vitamin D insufficiency, while moderate CFDP intake was linked to an increased risk of vitamin D insufficiency by 46%. Logistic regression analysis showed a positive association between CFDP and serum iron (β = 0.08; 95% CI: 0.05, 0.24), but a negative association with total iron binding capacity (β = -0.05; 95% CI: -0.44, -0.07).

Conclusions: The PCA-derived dietary pattern effectively identifies eating patterns, while the RRR-derived dietary pattern has a better estimate for disease associations in a specific population. The RRR-derived dietary pattern provides valuable insights for dietary guidelines targeting vitamin D insufficiency.

Keywords: principal component analysis, reduced rank regression analysis, serum vitamin D, anemia-related biomarkers

Introduction

Vitamin D deficiency and insufficiency have become a global health concern, and referred as “vitamin D deficiency epidemic” [1]. The prevalence of these conditions was widespread, with 15.7%, 47.9%, and 76.6% of individuals having serum 25(OH) vitamin D levels below 30, 50, and 75 nmol/L, respectively [2]. In northern Taiwan, the previous study found that 30.1% of pregnant women had low serum vitamin D levels (< 50 nmol/L) [3]. Vitamin D3, or calcitriol, plays a vital role in preventing metabolic diseases like rickets, osteomalacia, type 2 diabetes, cardiovascular disease, cancer, and chronic kidney disease [4]. During pregnancy, vitamin D supplementation has been linked to better maternal and fetal outcomes, including preeclampsia, gestational diabetes, improved fetal growth, preterm birth, and reduced risk of small for gestational age (SGA) [5].

Serum hemoglobin levels were positively correlated with serum vitamin D level in pregnant women [6], and the meta-analyses reported a 61% increased risk of anemia in pregnant women with vitamin D deficiency (OR = 1.61; 95% CI: 1.41, 1.83) [7]. Gestational anemia could lead to preterm delivery, SGA, reduced birth weight, low Apgar (appearance, pulse, grimace, activity, respiration) scores, and fetal complications [8]. Research by Bano et al. suggested that 50% of gestational anemia cases were due to insufficient iron intake or storage [9]. Factors like inadequate dietary diversity, consumption of tea or coffee with meals, insufficient meals, and multiparity increased anemia risk, while pregnant women consuming dairy, animal protein, and vitamin A-rich fruits and vegetables were associated with a reduced risk of anemia [10-13]. Our previous study in Taiwan also found positive associations between dietary patterns (DPs), such as plant-based, carnivore, dairy, and non-dairy DPs, and serum vitamin D, folate, and vitamin B12 levels [14]. Additionally, an adequate DP and frequent vitamin D supplementation were positively associated with serum ferritin and vitamin D levels [15].

Understanding DPs is essential for analyzing the complex relationships between diet and disease outcomes, as it offers a comprehensive assessment of overall dietary habits and their impact on health [16]. Statistical methods like principal component analysis (PCA) and reduced rank regression (RRR) are used to identify dietary patterns in populations [17], and PCA transforms correlated food groups into uncorrelated indexes, and preserves the maximum variance of the original foods [18]. However, PCA has subjectivity in selecting food groups, determining component numbers, and identifying factor loadings [18]. In contrast, RRR combines both a priori and posteriori approaches, and captures foods related to disease risk and biomarkers [19-21]. A previous study suggested RRR-derived DP with high intakes of plant-based and dairy products reduced the risk of micronutrient deficiencies affecting erythropoiesis in expectant mothers [22]. Few studies have examined the associations of PCA- [14] or RRR-derived DP [22] with gestational anemia and vitamin D insufficiency/deficiency in expectant mothers.

Given the limited research on comparing the association of PCA- or RRR-derived DP with anemia-related biomarkers in expectant mothers, our study aimed to compare PCA- and RRR-derived DPs and evaluate their associations with anemia-related biomarkers and vitamin D levels among expectant mothers in Taiwan. Based on our previous findings, we hypothesized that RRR-derived DP might show stronger associations with anemia biomarkers and vitamin D status in expectant mothers.

Methods

Study population

The present cross-sectional multi-center study utilized the data from the Nationwide Nutrition and Health Survey data of Taiwanese pregnant women (NHSDT 2017-2019). A total of 1502 expectant mothers were recruited from various regions of Taiwan including the north, south, central, east, and west regions. We recruited the expectant mothers who aged > 15 years, had a maternal handbook, and signed the inform consent. The expectant mothers with multiparity > 3 or without responsiveness were excluded in this study (Supplementary Figure 1). After considering the inclusion and exclusion criteria, all participants (n = 1502) were categorized into two groups: (1) serum vitamin D insufficiency (≤ 75 nmol/L) (n = 1067) and (2) normal serum vitamin D (> 75 nmol/L) (n = 435). The study was conducted in accordance with the Declaration of Helsinki, and was approved by the Joint Institutional Review Board of Taipei Medical University (TMU-JIRB No. N201707039).

Data collection

Initially, a self-reported survey was employed to collect comprehensive personal details such as socio-economic status and pregnancy-related issues. The self-reported survey included maternal demographic information such as age and region of residence and other data such as body measurements (weight and height), parity, stage of pregnancy, and sun exposure. An in-person interview by the well-trained dietitian was then carried out to determine eating habits and dietary intake. All information, including responses from the self-reported survey and dietary intake, and blood samples were collected during a single prenatal visit among expectant mothers in the first, second, or third trimester.

Dietary assessment

Dietary data collection was conducted by referring to a validated semi-quantitative food frequency questionnaire (sFFQ) [14, 22]. Food frequency was recorded as daily, weekly, or monthly consumption frequency, and consumption frequency was converted into monthly consumption frequency for the analysis of dietary patterns. A total of 59 food groups were initially identified, and 32 food groups were finally developed for the assessment of dietary patterns and the comparisons based on the nutrient compositions of the food groups [14]. To mitigate recording biases, the data collection of sFFQ was assisted by registered dietitians.

Assessment of anthropometric data

Anthropometric data, including pre-pregnancy body mass index (pBMI), were obtained through self-reported data collected during the initial trimester of pregnancy. The pBMI value was calculated by dividing the reported body weight (in kilograms) by the square of the reported height (in square meters) [23].

Evaluation of biochemical data

For the measurement of anemia-related parameters in blood samples, approximately 8 mL of peripheral blood samples were collected during the perinatal visit. Serum, plasma, and buffy coat were segregated by centrifugation, and stored at -80°C for subsequent analysis. The process for measuring blood parameters was reported in details previously [14].

Categorization of BMI and anemia-related biomarkers

The categorization of BMI and anemia-related biomarkers was followed the criteria demonstrated previously [14].

Statistical methods

Data expression and analysis

Statistical analyses were performed using SPSS (version 22.0, IBM Corp., Armonk, NY, USA) or SAS (version 9.4, SAS Institute Inc., Chicago, IL, USA). The characteristics of the study population with insufficient or normal serum vitamin D levels were compared using Student's t test or chi-square test, respectively, for continuous or categorical variables. Results were presented as mean ± standard deviation. Data for binomial regression or generalized logistic regression analysis were indicated as odds ratios (ORs) and 95% confidence intervals (CIs) or β and 95% CIs, respectively, to determine the association of DP scores with vitamin D status among expectant mothers. A p-value < 0.05 was considered significant for all statistical analyses.

DP assessment

The DPs were derived using PCA and RRR methods by employing PROC FACTOR and PROC PLUS functions of the SAS program. The PCA method was employed with 32 food groups and orthogonal varimax rotation, and retained only one factor for the final comparison. The DP was developed from the RRR method using 32 food groups as predictor variables and 6 blood parameters including hemoglobin, iron, ferritin, total iron-binding capacity (TIBC), folate, and vitamin B12 as response variables. The first factor presenting the maximum percentage (1.70%) of variations in response variables was retained among the 6 response factors. An absolute factor loading value was set at ≥ 0.20 [17] for both DPs. The DP scores for both patterns were calculated by summing up the consumption frequency score of the food groups weighed by their respective factor loading values. Based on the compositions of the DPs, PCA- or RRR-derived DP was renamed, respectively, as plant and marine-based DP (PMDP) or convenience food DP (CFDP). For further analysis, the DP scores of each DP were divided into tertiles (T) with 3 covariate-adjusted models: model 1 - unadjusted, model 2 - adjusted for age and parity, and model 3 - adjusted for age, parity, trimester of pregnancy, region of residence, and monthly family income.

Results

Participant characteristics

Supplementary Table 1 summarizes the general characteristics of the expectant mothers across different serum vitamin D levels. Significant differences were found in sociodemographic or pregnancy related factors, including region of residence, parity, trimester of pregnancy, and monthly family income, between the expectant mothers with normal or insufficient serum vitamin D levels. Expectant mothers with normal serum vitamin D levels were mostly resided in southern Taiwan (150 individuals, 34.6%), predominantly primiparous (213 individuals, 49.2%), in the third trimester (236 individuals, 54.3%), and had a monthly family income between NTD 30,000 and 59,999 (204 individuals, 48.0%). However, pBMI status was similar between the expectant mothers with normal or insufficient serum vitamin D levels.

Serum biochemical characteristics of the expectant mothers across serum vitamin D levels

Table 1 summarizes serum biochemical characteristics of the expectant mothers with different levels of serum vitamin D. Participants with normal serum vitamin D levels had significantly higher serum hemoglobin (7.4 ± 1.2 g/dL), iron (14.1 ± 8.1 µmol/L), TIBC (85.4 ± 16.5 µmol/L), folate (32.5 ± 17.1 nmol/L, range: 13.5-45.3 nmol/L; 63.4%), and vitamin B12 (254.1 ± 178.3 pmol/L) compared to those with insufficient serum vitamin D levels.

Dietary patterns (DPs)

Supplementary Figure 2 illustrates the dietary patterns derived from PCA or RRR analysis. The PCA-derived dietary pattern, characterized by high consumption of bamboo shoots and melons, mushrooms, roots and tubers, dark-colored vegetables, legumes, marine plants, nuts, soy products, aquatic fish, herbs and spices, fresh fruits, pickled vegetables, and salt, was named as the plant and marine-based dietary pattern (PMDP). In contrast, the RRR-derived dietary pattern was positively associated with canned and pickled fruits and jam, but negatively associated with legumes, semi-fat meat, poultry meat, lean meat, and dark-colored vegetables. This pattern was denoted as the convenience food dietary pattern (CFDP).

 Table 1 

Serum biochemical characteristics of expectant mothers across different levels of vitamin D

Serum variablesTotal population
(n = 1502)
Insufficient vitamin D
≤ 75 nmol/L
(n = 1067)
Normal
vitamin D
> 75 nmol/L
(n = 435)
p
Hemoglobin, mmol/L, n (%)7.3 ± 1.27.2 ± 1.27.4 ± 1.20.017
< 4.3410 (0.7)9 (0.9)1 (0.2)0.217
4.34-6.21123 (8.2)87 (8.2)36 (8.3)
6.21-6.83189 (12.6)141 (13.2)48 (11.0)
> 6.83-8.691108 (73.8)785 (73.6)323 (74.3)
> 8.6971 (4.7)44 (4.1)27 (6.2)
Iron, µmol/L12.9 ± 7.112.4 ± 6.614.1 ± 8.10.000
Ferritin, nmol/L0.05 ± 0.090.08 ± 0.110.05 ± 0.050.053
TIBC, µmol/L83.5 ± 18.282.8 ± 18.885.4 ± 16.50.012
Transferrin saturation, %16.5 ± 9.916.3 ± 9.716.9 ± 10.30.301
Folate, nmol/L, n (%)28.9 ± 16.627.6 ± 16.332.5 ± 17.10.000
< 6.833 (2.2)27 (2.5)6 (1.4)0.000
6.8-13.4220 (14.6)175 (16.4)45 (10.3)
13.5-45.3985 (65.6)709 (66.5)276 (63.5)
> 45.3264 (17.6)156 (14.6)108 (24.8)
Vitamin B12, pmol/L232.1 ± 147.9223.2 ± 132.6254.1 ± 178.30.000

Continuous data were represented as mean ± standard deviation, while categorical data were expressed as frequencies and percentages enclosed in the parentheses. Statistical significance was assessed through the calculation of p-values using Student's t test for continuous variables and chi-square test for categorical variables. A p-value below 0.05 was indicated as statistical significance between insufficient and normal serum vitamin D groups. TIBC: total iron binding capacity.

Notably, the dietary patterns from the two methods showed distinct characteristics. The RRR-derived dietary pattern explained a higher percentage of variation (6.55%) compared to the PCA-derived dietary pattern (4.37%). The cumulative percentage explained by RRR was approximately 1.70%, with the highest variations attributed to serum TIBC (5.11%) and iron (1.95%).

Associations between DP scores and serum vitamin D status

Table 2 displayed the associations between the scores of DPs derived from PCA or RRR and vitamin D status. Expectant mothers exhibiting the highest intake of the PCA-derived PMDP demonstrated a reduced risk (30% to 35%) of serum vitamin D insufficiency across all three models. Conversely, expectant mothers with moderate consumption (T2) of the RRR-derived CFDP were associated with an increased risk of vitamin D insufficiency by 46% to 52% in all three models. Similar results were observed among expectant mothers with the highest consumption (T3) of the RRR-derived CFDP in models 1 and 2, with an increased risk of vitamin D insufficiency by 36% and 35%, respectively.

 Table 2 

Associations between PCA- or RRR-derived dietary pattern and vitamin D insufficiency across the tertiles of dietary pattern scores

Dietary patternsTertiles (T) of dietary pattern scores (n = 1502)
T1 ORT2 OR (95% CI)T3 OR (95% CI)
Total population, n505486511
Vitamin D insufficiency, n (%)358 (70.9)351 (72.2)358 (70.1)
PCA-derived dietary pattern
Model 11.000.91 (0.64, 1.21)0.65 (0.49, 0.86)**
Model 21.000.92 (0.69, 1.23)0.68 (0.51, 0.90)**
Model 31.000.92 (0.69, 1.23)0.70 (0.52, 0.93)*
RRR-derived dietary pattern
Model 11.001.52 (1.15, 2.00)**1.36 (1.04, 1.78)*
Model 21.001.51 (1.15, 2.00)**1.35 (1.03, 1.78)*
Model 31.001.46 (1.00, 1.95)**1.17 (0.88, 1.56)

PCA-derived dietary pattern renamed as the plant and marine-based dietary pattern. RRR-derived dietary pattern renamed as the convenience food dietary pattern. Serum levels of 25(OH) vitamin D were categorized into two groups: insufficient status, defined as ≤ 75 nmol/L (≤ 30 ng/mL), and normal status, defined as > 75 nmol/L (> 30 ng/mL). Tertile 1 (T1): reference. Model 1: unadjusted. Model 2: adjusted for age and parity. Model 3: adjusted for age, parity, trimester of pregnancy, region of residence, and monthly family income. Statistical significance is denoted by * for p < 0.05 and ** for p ≤ 0.01.

Associations of PCA- or RRR-derived DP with serum anemia-related biomarkers

Table 3 presents the associations between PCA- and RRR-derived DPs and anemia-related biomarkers. Expectant mothers consuming the PCA-derived PMDP showed negative associations with serum ferritin (β = -0.06, -0.06; 95% CI: -0.28, -0.02 and -0.28, -0.02) and positive associations with TIBC (β = 0.08, 0.09; 95% CI: 0.01, 0.06 and 0.01, 0.06) in models 1 and 2.

In contrast, expectant mothers consuming the RRR-derived CFDP showed positive associations with serum hemoglobin (β = 0.06, 0.06; 95% CI: 0.00, 0.01 and 0.00, 0.01) and ferritin (β = 0.10, 0.09; 95% CI: 0.22, 0.72 and 0.20, 0.70) in models 1 and 2. Additionally, serum iron was positively associated in all three models (β = 0.10, 0.09, 0.04; 95% CI: 0.02, 0.05; 0.01, 0.05; and -0.05, 0.44) among expectant mothers with the RRR-derived CFDP. However, expectant mothers with the RRR-derived CFDP showed a negative association with serum TIBC across all three models (β = -0.17, -0.17, -0.05; 95% CI: -0.18, -0.10; -0.18, -0.10; and -0.44, -0.07).

 Table 3 

Association between PCA- and RRR-derived dietary patterns with serum anemia-related biomarkers

Serum variablesDietary patterns (DP) (n = 1502)
PCA-derived DP
β (95% CI)
RRR-derived DP
β (95% CI)
Hemoglobin, mmol/L
Model 1-0.01 (-0.00, 0.00)0.06 (0.00, 0.01)*
Model 2-0.01 (-0.00, 0.00)0.06 (0.00, 0.01)*
Model 30.00 (-0.00, 0.00)0.04 (-0.00, 0.01)
Iron, µmol/L
Model 1-0.05 (-0.02, 0.00)0.10 (0.02, 0.05)***
Model 2-0.05 (-0.02, 0.00)0.09 (0.01, 0.05)***
Model 3-0.04 (-0.02, 0.00)0.08 (0.05, 0.24)**
Ferritin, nmol/L
Model 1-0.06 (-0.28, -0.02) *0.10 (0.22, 0.72)***
Model 2-0.06 (-0.28, -0.02) *0.09 (0.20, 0.70)***
Model 3-0.03 (-0.20, 0.05)0.04 (-0.05, 0.44)
TIBC, µmol/L
Model 10.08 (0.01, 0.06) **-0.17 (-0.18, -0.10)***
Model 20.09 (0.01, 0.06) **-0.17 (-0.18, -0.10)***
Model 30.02 (-0.01, 0.02)-0.05 (-0.44, -0.07)**
Transferrin saturation, %
Model 10.00 (-0.01, 0.01)0.00 (-0.02, 0.02)
Model 20.00 (-0.01, 0.01)0.00 (-0.02, 0.02)
Model 30.01 (-0.01, 0.01)-0.00 (-0.03, 0.02)
Folate, nmol/L
Model 10.02 (-0.01, 0.03)0.02 (-0.02, 0.06)
Model 20.00 (-0.02, 0.02)0.02 (-0.03, 0.05)
Model 30.03 (-0.01, 0.03)-0.02 (-0.05, 0.02)
Vitamin B12, pmol/L
Model 1-0.03 (-0.26, 0.08)0.04 (-0.11, 0.58)
Model 2-0.04 (-0.30, 0.06)0.03 (-0.14, 0.55)
Model 3-0.01 (-0.21, 0.14)-0.01 (-0.40, 0.29)

PCA derived dietary pattern renamed as plant and marine based dietary pattern. RRR derived dietary pattern renamed as convenience food dietary pattern. The values of β and the data enclosed in the parentheses represent regression coefficients and 95% confidence intervals (CIs) following the adjustment for covariates in various models. Model 1: unadjusted. Model 2: adjusted for age and parity. Model 3: adjusted for age, parity, trimester of pregnancy, region of residence, and monthly family income. Statistical significance is denoted by * for p < 0.05, ** for p ≤ 0.01, and *** for p ≤ 0.001. DP: dietary pattern; TIBC: total iron binding capacity.

PCA-derived DP scores and serum anemia-related biomarkers

Table 4 shows the PCA-derived DP scores and anemia-related biomarkers. Expectant mothers in the highest tertile (T3) of the PCA-derived PMDP had 1.41 (95% CI: 1.10, 1.82) and 1.47 (95% CI: 1.13, 1.91) times higher odds of serum ferritin deficiency in models 1 and 2, respectively. Conversely, expectant mothers in the highest tertile of the PCA-derived PMDP reduced odds of serum TIBC deficiency by 34% (OR = 0.66; 95% CI: 0.50, 0.87) and 36% (OR = 0.64; 95% CI: 0.49, 0.85) in models 1 and 2, respectively.

 Table 4 

Associations of PCA-derived dietary pattern with serum anemia-related biomarkers across the tertiles of dietary pattern scores

Serum variablesTertiles (T) of PCA-derived dietary pattern scores (n = 1502)
T1 (n = 505)
OR
T2 (n = 486)
OR (95% CI)
T3 (n = 511)
OR (95% CI)
Hemoglobin, mmol/L
Model 11.001.61 (0.52, 4.96)1.34 (0.46, 3.89)
Model 21.001.63 (0.53, 5.02)1.36 (0.48, 1.89)
Model 31.001.88 (0.55, 6.36)1.25 (0.42, 3.70)
Iron, µmol/L
Model 11.001.01 (0.78, 1.30)1.26 (0.97, 1.62)
Model 21.001.04 (0.80, 1.35)1.30 (1.00, 1.68)
Model 31.000.95 (0.73, 1.25)1.18 (0.90, 1.55)
Ferritin, nmol/L
Model 11.001.06 (0.83, 1.37)1.41 (1.10, 1.82)**
Model 21.001.12 (0.87, 1.45)1.47 (1.13, 1.91)**
Model 31.000.96 (0.72, 1.29)1.16 (0.86, 1.56)
TIBC, µmol/L
Model 11.000.93 (0.71, 1.21)0.66 (0.50, 0.87)**
Model 21.000.89 (0.68, 1.16)0.64 (0.49, 0.85)**
Model 31.001.07 (0.76, 1.49)0.86 (0.61, 1.21)
Transferrin saturation, %
Model 11.000.93 (0.72, 1.20)0.87 (0.67, 1.12)
Model 21.000.92 (0.71, 1.18)0.85 (0.66, 1.10)
Model 31.000.88 (0.68, 1.14)0.84 (0.65, 1.09)
Folate, nmol/L
Model 11.000.73 (0.53, 1.03)0.73 (0.52, 1.03)
Model 21.000.81 (0.57, 1.16)0.83 (0.58, 1.19)
Model 31.000.76 (0.53, 1.11)0.70 (0.48, 1.02)
Vitamin B12, pmol/L
Model 11.000.90 (0.67, 1.21)1.05 (0.78, 1.40)
Model 21.000.94 (0.69, 1.26)1.08 (0.81, 1.46)
Model 31.000.86 (0.63, 1.18)0.93 (0.69, 1.27)

PCA-derived dietary pattern renamed as plant and marine-based dietary pattern. Odds ratios (ORs) and 95% confidence intervals (CIs) are expressed for the risk of low serum biochemical variables across the tertiles of PCA-derived DP, and tertile 1 (T1) was used as the reference group. Three distinct models were executed for the binomial logistic regression analysis. Model 1: unadjusted. Model 2: adjusted for age and parity. Model 3: adjusted for age, parity, trimester of pregnancy, region of residence, and monthly family income. Variables were categorized into two tiers based on serum cutoff values: hemoglobin, 6.52 mmol/L (10.5 g/dL); iron, 10.7 µmol/L (60 µg/dL); ferritin, 0.034 nmol/L (15 ng/mL); TIBC, 43.0 µmol/L (240 µg/dL); transferrin saturation, 16%; folate, 13.6 nmol/L (6 ng/mL); and vitamin B12, 149.8 pmol/L (203 pg/mL). Statistical significance is denoted by ** for p ≤ 0.01. TIBC: total iron binding capacity.

RRR-derived DP scores and serum anemia-related biomarkers

Table 5 displays the results for the RRR-derived dietary pattern score and serum anemia-related biomarkers. Expectant mothers with T2 of the RRR-derived CFDP score had lower odds of 0.72 (95% CI: 0.54, 0.96) and 0.74 (95% CI: 0.55, 0.99) for serum iron deficiency in both models 1 and 2, respectively. Similarly, those with T2 and T3 of the RRR-derived CFDP score had lower odds of serum ferritin deficiency (T2-model 1: OR = 0.72; 95% CI: 0.54, 0.96; model 2: OR = 0.74; 95% CI: 0.55, 1.00; T3-model 1: OR = 0.61; 95% CI: 0.45, 0.81; model 2: OR = 0.61; 95% CI: 0.46, 0.83) in both models. In contrast, expectant mothers with T2 of the RRR-derived CFDP score elevated the risk for serum TIBC deficiency by 51% (95% CI: 1.09, 2.10) and 45% (95% CI: 1.04, 2.02), and those with T3 of the RRR-derived CFDP score increased the risk for serum TIBC deficiency by 74% (95% CI: 1.26, 2.41) and 71% (95% CI: 1.23, 2.37) in models 1 and 2, respectively.

 Table 5 

Associations of RRR-derived dietary pattern with serum anemia-related biomarkers across the tertiles of dietary pattern scores

Serum variablesTertiles (T) of RRR-derived dietary pattern scores (n = 1502)
T1 (n = 505)
OR
T2 (n = 486)
OR (95% CI)
T3 (n = 511)
OR (95% CI)
Hemoglobin, mmol/L
Model 11.001.68 (0.40, 7.06)1.68 (0.40, 7.06)
Model 21.001.67 (0.40, 7.06)1.68 (0.40, 7.10)
Model 31.002.05 (0.51, 8.32)0.70 (0.24, 2.05)
Iron, µmol/L
Model 11.000.72 (0.54, 0.96)*0.76 (0.57, 1.02)
Model 21.000.74 (0.55, 0.99)*0.79 (0.59, 1.06)
Model 31.000.83 (0.64, 1.08)0.84 (0.64, 1.09)
Ferritin, nmol/L
Model 11.000.72 (0.54, 0.96)*0.61 (0.45, 0.81)**
Model 21.000.74 (0.55, 1.00)*0.61 (0.46, 0.83)**
Model 31.000.94 (0.70, 1.26)0.91 (0.68, 1.22)
TIBC, µmol/L
Model 11.001.51 (1.09, 2.10)*1.74 (1.26, 2.41)**
Model 21.001.45 (1.04, 2.02)*1.71 (1.23, 2.37)**
Model 31.001.42 (0.95, 2.14)1.25 (0.84, 1.87)
Transferrin saturation, %
Model 11.001.19 (0.90, 1.59)1.10 (0.83, 1.47)
Model 21.001.19 (0.89, 1.58)1.09 (0.81, 1.45)
Model 31.001.21 (0.90, 1.62)1.06 (0.79, 1.42)
Folate (nmol/L)
Model 11.000.88 (0.60, 1.29)1.04 (0.71, 1.51)
Model 21.000.92 (0.61, 1.39)1.13 (0.76, 1.69)
Model 31.000.98 (0.64, 1.51)1.43 (0.93, 2.18)
Vitamin B12 (pmol/L)
Model 11.000.92 (0.66, 1.30)1.00 (0.71, 1.40)
Model 21.000.96 (0.68, 1.35)1.04 (0.74, 1.46)
Model 31.001.01 (0.71, 1.44)1.25 (0.87, 1.78)

RRR derived dietary pattern renamed as convenience food dietary pattern. Odds ratios (ORs) and 95% confidence intervals (CIs) are expressed for the risk of low serum biochemical variables across the tertiles of RRR-derived DP, and tertile 1 (T1) was used as the reference group. Three distinct models were executed for the binomial logistic regression analysis. Model 1: unadjusted. Model 2: adjusted for age and parity. Model 3: adjusted for age, parity, trimester of pregnancy, region of residence, and monthly family income. Variables were categorized into two tiers based on serum cutoff values: hemoglobin, 6.52 mmol/L (10.5 g/dL); iron, 10.7 µmol/L (60 µg/dL); ferritin, 0.034 nmol/L (15 ng/mL); TIBC, 43.0 µmol/L (240 µg/dL); transferrin saturation, 16%; folate, 13.6 nmol/L (6 ng/mL); and vitamin B12, 149.8 pmol/L (203 pg/mL). Statistical significance is denoted by * for p < 0.05 and ** for p ≤ 0.01. TIBC: total iron binding capacity.

Discussion

Comparisons between PCA- and RRR-derived DPs

Results from the present study revealed that the PCA-derived PMDP was associated with a decreased risk of low serum vitamin D status. Conversely, the RRR-derived CFDP showed an increased risk for low serum vitamin D status. Comparing the PCA- and RRR-derived DPs, PCA analysis elucidated dietary behaviors within the population, while RRR primarily focused on outcome variables, revealing the associations with disease-related variables or biomarkers [24]. Consequently, both approaches identified different DPs. The PCA-derived DP was characterized by higher intakes of plant foods, fish, pickled vegetables, and salt. In contrast, the RRR-derived DP was characterized by an increased intake of processed foods and decreased consumption of dark-colored vegetables, lean or semi-fat meat, and legumes and various beans. Consequently, the PCA-derived DP exhibited a negative association with ferritin, whereas the RRR-derived DP showed positive associations with serum anemia-related biomarkers such as hemoglobin, iron, and ferritin. These results highlighted the distinct characteristics of RRR (disease-associated) [25] and PCA (eating pattern-associated) [26] methods. Previous studies have generally reported similar DPs derived from both PCA and RRR methods [17, 27], though some studies have shown contrasting results like ours [24, 28, 29].

PCA- or RRR-derived DP, anemia-related biomarkers, and vitamin D status

Regarding the association of DPs with serum anemia-related biomarkers and vitamin D status, our study found that the PCA-derived DP contained the hindering food groups for iron absorption, potentially explaining an inverse association with ferritin [14]. Additionally, the presence of vitamin D-enriched food items in the PCA-derived DP (such as mushrooms, marine plants, and soy products) could contribute to improved vitamin D status, consistent with the previous research [14, 30-33]. Furthermore, our findings were consistent with other studies indicating that traditional DP with increased consumption of rice, pork, and vegetables [34], consumption of non-heme iron and iron-inhibiting foods (such as coffee, soft drinks, and sugar-sweetened beverages) [35], fast food DP [36], and inflammatory DP (comprising of eggs and animal products in combination with processed, fried, and sweetened foods) [37] could increase the risk of anemia across various populations.

Surprisingly, we observed a positive association between RRR-derived CFDP with high intakes of canned and pickled fruits and jam and serum ferritin levels, possibly attributed to the improvement in gut microbiota profile. Pickles, in particular, are rich sources of beneficial gut bacteria, which could enhance gut microbial functions [38]. This finding aligns with the report indicating that intestinal gut microbiota played a crucial role in iron absorption [39]. Furthermore, we noted an inverse association between ferritin and TIBC status, consistent with previous research highlighting TIBC as a biomarker inversely related to iron status [40, 41]. While the association between serum vitamin D and ferritin status remains controversial [7, 42-45]. An inverse association observed between serum vitamin D and ferritin levels could be explained by the opposite properties of vitamin D as an anti-inflammatory role and ferritin as an inflammatory biomarker [46]. However, vitamin D could reduce inflammatory cytokines and hepcidin levels to ensure sufficient iron storage [47]. In the present study, an inverse association between serum vitamin D and ferritin status was not found in the completed adjusted model. A previous study reported that serum vitamin D was positively associated with serum hemoglobin levels, but the role of DP was not correlated with this association among women of reproductive age [48].

Strengths and limitations

To our knowledge, the present study was the first research to explore two different statistical methods (PCA and RRR) deriving DPs, and examine the associations of DPs with serum vitamin D and anemia-related biomarkers among expectant mothers. A major strength of this study was to use the nationwide nutrition survey data, cluster sampling, and covariate-adjusted models to identify the most relevant associations, along with validated sFFQ [28]. The large sample size accurately represented the characteristics of the population. However, despite these strengths, this study has several limitations, including the absence of data analysis for 24-hour dietary recall and dietary supplements, and missing information on inflammatory biomarkers, parathyroid hormone, hepcidin, and pregnancy-related eating disorders (such as pica and morning sickness).

Conclusions

In summary, PCA- and RRR-derived DPs were compatible. The PCA-derived DP depicted population eating behavior, while the RRR-derived DP identified diet-disease associations. The PCA-derived plant and marine-based DP with high intakes of plant foods, nuts, fish, and seafood products was positively associated with serum vitamin D, but not correlated with serum anemia-related biomarkers. The RRR-derived convenience food DP with high consumption of canned or pickled fruits and jam showed negative associations with serum vitamin D and TIBC, but a positive association with serum iron. The RRR-derived DP may provide the insight for dietary guidelines targeting vitamin D insufficiency. Further research should explore DP scores and gestational anemia risk, incorporating additional anemia and vitamin D biomarkers.

Abbreviations

BMI: body mass index; CFDP: convenience food dietary pattern; CI: confidence interval; DP: dietary pattern; OR: odds ratio; PCA: principal component analysis; PMBDP: plant marine-based dietary pattern; RRR: reduced rank regression; sFFQ: semi-quantitative food frequency questionnaire; TIBC: total iron-binding capacity.

Supplementary Material

Supplementary figures and table.

Attachment

Acknowledgements

We extended our appreciation to the Nationwide Nutrition and Health Survey in Pregnant Women (Taiwan) for providing us with access to the database for our research endeavors.

Funding

This research, utilizing secondary data, did not receive any external financial support.

Data availability statement

Access to the data is restricted solely for research purposes, and the data are not available to the public.

Ethical approval

The research was granted approval by the Joint Institutional Review Board of Taipei Medical University, Taiwan, under approval number N201707039, dated November 7, 2018, and followed the ethical guidelines set forth in the Declaration of Helsinki. Prior to participating in the study, all individuals provided informed consent.

Author contributions

A Das, Y-C Chen, and JC-J Chao were pivotal in conceptualizing the study. C-Y Hsu, C-H Bai, J-S Chang, Y-L Huang, and F-F Wang contributed to data curation. A Das conducted formal analysis. A Das, Y-C Chen, and JC-J Chao initially drafted the manuscript. NTH Nguyen and JC-J Chao contributed to manuscript revision and proofreading. Y-C Chen and JC-J Chao supervised the research. All authors have reviewed and endorsed the final manuscript.

Competing Interests

The authors have declared that no competing interest exists.

References

1. Rebelos E, Tentolouris N, Jude E. The role of vitamin D in health and disease: a narrative review on the mechanisms linking vitamin D with disease and the effects of supplementation. Drugs. 2023;83:665-85

2. Cui A, Zhang T, Xiao P. et al. Global and regional prevalence of vitamin D deficiency in population-based studies from 2000 to 2022: a pooled analysis of 7.9 million participants. Front Nutr. 2023;10:1070808

3. Huang YL, Pham TTM, Chen YC. et al. Effects of climate, sun exposure, and dietary intake on vitamin D concentrations in pregnant women: a population-based study. Nutrients. 2023;15:1182

4. Courbebaisse M, Cavalier E. Vitamin D in 2020: an old pro-hormone with potential effects beyond mineral metabolism. Nutrients. 2020;12:3378

5. Pérez-López FR, Pilz S, Chedraui P. Vitamin D supplementation during pregnancy: an overview. Curr Opin Obstet Gynecol. 2020;32:316-21

6. Hitesh T, Khatuja R, Agrawal P. et al. Unlocking the mystery of the role of vitamin D in iron deficiency anemia in antenatal women: a case control study in a tertiary care hospital in New Delhi. BMC Pregnancy Childbirth. 2023;23:749

7. Lima MS, Pereira M, Castro CT. et al. Vitamin D deficiency and anemia in pregnant women: a systematic review and meta-analysis. Nutr Rev. 2022;80:428-38

8. Shah T, Khaskheli MS, Ansari S. et al. Gestational anemia and its effects on neonatal outcome, in the population of Hyderabad, Sindh, Pakistan. Saudi J Biol Sci. 2022;29:83-7

9. Bano T, Nazar B, Tahir MR. APGAR score of neonates born to anemic mothers versus non-anemic mothers. J Islamabad Med Dent Coll. 2018;7:246-9

10. Gibore NS, Ngowi AF, Munyogwa MJ. et al. Dietary habits associated with anemia in pregnant women attending antenatal care services. Curr Dev Nutr. 2021;5:nzaa178

11. Ayensu J, Annan R, Lutterodt H. et al. Prevalence of anaemia and low intake of dietary nutrients in pregnant women living in rural and urban areas in the Ashanti region of Ghana. PLoS One. 2020;15:e0226026

12. Habe S, Haruna M, Yonezawa K. et al. Factors associated with anemia and iron deficiency during pregnancy: a prospective observational study in Japan. Nutrients. 2024;16:418

13. Jugha VT, Anchang-Kimbi JK, Anchang JA. et al. Dietary diversity and its contribution in the etiology of maternal anemia in conflict hit mount cameroon area: a cross-sectional study. Front Nutr. 2021;7:625178

14. Das A, Bai C-H, Chang J-S. et al. Associations of dietary patterns and vitamin D levels with iron status in pregnant women: a cross-sectional study in Taiwan. Nutrients. 2023;15:1805

15. Wong RS, Tung KTS, Chan YWK. et al. Adequate dietary intake and vitamin D supplementation: a study of their relative importance in determining serum vitamin D and ferritin concentrations during pregnancy. Nutrients. 2022;14:3083

16. Agedew E, Abebe Z, Ayelign A. Dietary patterns in relation with nutritional outcomes and associated factors among adolescents: implications for context-specific dietary intervention for the agrarian community, northwest Ethiopia. Front Nutr. 2023;10:1274406

17. Kurniawan AL, Hsu CY, Lee HA. et al. Comparing two methods for deriving dietary patterns associated with risk of metabolic syndrome among middle-aged and elderly Taiwanese adults with impaired kidney function. BMC Med Res Methodol. 2020;20:255

18. Zhao J, Li Z, Gao Q. et al. A review of statistical methods for dietary pattern analysis. Nutr J. 2021;20:37

19. Li Y, Yatsuya H, Wang C. et al. Dietary patterns derived from reduced rank regression are associated with the 5-year occurrence of metabolic syndrome: aichi workers' cohort study. Nutrients. 2022;14:3019

20. Lazarova SV, Jessri M. Associations between dietary patterns and cardiovascular disease risk in Canadian adults: a comparison of partial least squares, reduced rank regression, and the simplified dietary pattern technique. Am J Clin Nutr. 2022;116:362-77

21. Jacobs S, Kroeger J, Schulze MB. et al. Dietary patterns derived by reduced rank regression are inversely associated with type 2 diabetes risk across 5 ethnic groups in the multiethnic cohort. Curr Dev Nutr. 2017;1:e000620

22. Mayasari NR, Bai C-H, Chao J. et al. Relationships between dietary patterns and erythropoiesis-associated micronutrient deficiencies (iron, folate, and vitamin B12) among pregnant women in Taiwan. Nutrients. 2023;15:2311

23. Khanna D, Peltzer C, Kahar P. et al. Body mass index (BMI): a screening tool analysis. Cureus. 2022;14:e22119

24. Batis C, Mendez MA, Gordon-Larsen P. et al. Using both principal component analysis and reduced rank regression to study dietary patterns and diabetes in Chinese adults. Public Health Nutr. 2016;19:195-203

25. Sun Q, Wen Q, Lyu J. et al. Dietary pattern derived by reduced-rank regression and cardiovascular disease: a cross-sectional study. Nutr Metab Cardiovasc Dis. 2022;32:337-45

26. Schwedhelm C, Iqbal K, Knüppel S. et al. Contribution to the understanding of how principal component analysis-derived dietary patterns emerge from habitual data on food consumption. Am J Clin Nutr. 2018;107:227-35

27. Sauvageot N, Leite S, Alkerwi AA. et al. Association of empirically derived dietary patterns with cardiovascular risk factors: a comparison of PCA and RRR methods. PLoS One. 2016;11:e0161298

28. Hoffmann K, Boeing H, Boffetta P. et al. Comparison of two statistical approaches to predict all-cause mortality by dietary patterns in German elderly subjects. Br J Nutr. 2005;93:709-16

29. Manios Y, Kourlaba G, Grammatikaki E. et al. Comparison of two methods for identifying dietary patterns associated with obesity in preschool children: the GENESIS study. Eur J Clin Nutr. 2010;64:1407-14

30. Mayasari NR, Bai C-H, Hu T-Y. et al. Associations of food and nutrient intake with serum hepcidin and the risk of gestational iron-deficiency anemia among pregnant women: a population-based study. Nutrients. 2021;13:3501

31. Beck KL, Conlon CA, Kruger R. et al. Dietary determinants of and possible solutions to iron deficiency for young women living in industrialized countries: a review. Nutrients. 2014;6:3747-76

32. Cardwell G, Bornman JF, James AP. et al. A review of mushrooms as a potential source of dietary vitamin D. Nutrients. 2018;10:1498

33. Tamama K. Potential benefits of dietary seaweeds as protection against COVID-19. Nutr Rev. 2021;79:814-23

34. Xu X, Hall J, Byles J. et al. Dietary pattern, serum magnesium, ferritin, C-reactive protein and anaemia among older people. Clin Nutr. 2017;36:444-51

35. Wiafe MA, Apprey C, Annan RA. Patterns of dietary iron intake, iron status, and predictors of haemoglobin levels among early adolescents in a rural Ghanaian district. J Nutr Metab. 2020;2020:3183281

36. Ma J, Huang J, Zeng C. et al. Dietary patterns and association with anemia in children aged 9-16 years in Guangzhou, China: a cross-sectional study. Nutrients. 2023;15:4133

37. Paramastri R, Hsu CY, Lee HA. et al. Association between dietary pattern, lifestyle, anthropometric status, and anemia-related biomarkers among adults: a population-based study from 2001 to 2015. Internat J Environ Res Public Health. 2021;18:3437

38. Behera SS, El Sheikha AF, Hammami R. et al. Traditionally fermented pickles: how the microbial diversity associated with their nutritional and health benefits? J Funct Foods. 2020;70:103971

39. Yilmaz B, Li H. Gut microbiota and iron: the crucial actors in health and disease. Pharmaceuticals. 2018;11:98

40. Faruqi A, Mukkamalla SKR. Iron binding capacity. In: StatPearls. Treasure Island, FL: StatPearls Publishing; 2024. https://www.ncbi.nlm.nih.gov/books/NBK559119

41. Raza N, Sarwar I, Munazza B. et al. Assessment of iron deficiency in pregnant women by determining iron status. J Ayub Med Coll Abbottabad. 2011;23:36-40

42. Mogire RM, Muriuki JM, Morovat A. et al. Vitamin D deficiency and its association with iron deficiency in African children. Nutrients. 2022;14:1372

43. Malczewska-Lenczowska J, Sitkowski D, Surała O. et al. The association between iron and vitamin D status in female elite athletes. Nutrients. 2018;10:167

44. Madar AA, Stene LC, Meyer HE. et al. Effect of vitamin D3 supplementation on iron status: a randomized, double-blind, placebo-controlled trial among ethnic minorities living in Norway. Nutr J. 2016;15:74

45. Mohammed M, Abed M, Shweash M. et al. Relationship between vitamin D deficiency and serum ferritin level in healthy women. Int J Res Pharm Sci. 2019;10:2095-8

46. Yoon H, Bae YN, Gi YM. et al. The association between serum ferritin and 25-hydroxyvitamin D and metabolic syndrome in Korean women: the Korea National Health and Nutrition Examination Survey 2010-2012. J Clin Biochem Nutr. 2017;61:60-6

47. Soepnel LM, Mabetha K, Draper CE. et al. A cross-sectional study of the associations between biomarkers of vitamin D, iron status, and hemoglobin in south African women of reproductive age: the healthy life trajectories initiative, south Africa. Curr Dev Nutr. 2023;7:100072

48. Michalski ES, Nguyen PH, Gonzalez-Casanova I. et al. Serum 25-hydroxyvitamin D but not dietary vitamin D intake is associated with hemoglobin in women of reproductive age in rural northern Vietnam. J Clin Translat Endoc. 2017;8:41-8

Author contact

Corresponding address Corresponding authors: Jane C.-J. Chao, Tel: +886-2-27361661 ext. 6548, E-mail: chenjuiedu.tw; Yi-Chun Chen, Tel: +886-2-27361661 ext. 6559, E-mail: yichunedu.tw.


Citation styles

APA
Das, A., Hsu, C.Y., Bai, C.H., Chang, J.S., Huang, Y.L., Wang, F.F., Nguyen, N.T.H., Chen, Y.C., Chao, J.C.J. (2026). Comparative analysis of two analytical methodologies for deriving dietary patterns associated with vitamin D insufficiency and anemia among expectant mothers. International Journal of Medical Sciences, 23(1), 325-333. https://doi.org/10.7150/ijms.124006.

ACS
Das, A.; Hsu, C.Y.; Bai, C.H.; Chang, J.S.; Huang, Y.L.; Wang, F.F.; Nguyen, N.T.H.; Chen, Y.C.; Chao, J.C.J. Comparative analysis of two analytical methodologies for deriving dietary patterns associated with vitamin D insufficiency and anemia among expectant mothers. Int. J. Med. Sci. 2026, 23 (1), 325-333. DOI: 10.7150/ijms.124006.

NLM
Das A, Hsu CY, Bai CH, Chang JS, Huang YL, Wang FF, Nguyen NTH, Chen YC, Chao JCJ. Comparative analysis of two analytical methodologies for deriving dietary patterns associated with vitamin D insufficiency and anemia among expectant mothers. Int J Med Sci 2026; 23(1):325-333. doi:10.7150/ijms.124006. https://www.medsci.org/v23p0325.htm

CSE
Das A, Hsu CY, Bai CH, Chang JS, Huang YL, Wang FF, Nguyen NTH, Chen YC, Chao JCJ. 2026. Comparative analysis of two analytical methodologies for deriving dietary patterns associated with vitamin D insufficiency and anemia among expectant mothers. Int J Med Sci. 23(1):325-333.

This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/). See https://ivyspring.com/terms for full terms and conditions.
Popup Image