- Research article
- Open Access
Paraspinal back muscles in asymptomatic volunteers: quantitative and qualitative analysis using computed tomography (CT) and magnetic resonance imaging (MRI)
BMC Musculoskeletal Disorders volume 21, Article number: 403 (2020)
To evaluate paraspinal back muscles of asymptomatic subjects using qualitative and quantitative analysis on CT and MRI and correlate the results with demographic data.
Twenty-nine asymptomatic subjects were enrolled prospectively (age: mean 34.31, range 23–50; 14 men, 15 women) from August 2016 to April 2017. Qualitative analysis of muscles was done using Goutallier’s system on CT and MRI. Quantitative analysis entailed cross sectional area (CSA) on CT and MRI, Hounsfield unit (HU) on CT, fat fraction using two-point Dixon technique on MRI. Three readers independently analyzed the images; intra- and inter-observer agreements were measured. Linear regression and Spearman’s analyses were used for correlation with demographic data.
CSA values were significantly higher in men (p < 0.001). Fat fraction was higher (22.53% vs. 14.35%) and HU lower (36.00 vs. 47.43) in women (p < 0.001). Intra- and inter-observer reliabilities of the two methods were greater than 0.8, except for CSA of L5/S1 on MRI; however, regarding quantitative analysis, decreasing HU and increasing fat fraction were correlated with increasing age, female gender and lower lumbar segment (p < 0.001).
MRI and CT can be reliably used for qualitative and quantitative analysis of paraspinal back muscles, regarding fat content. Fat fraction and HU showed highest reliabilities.
Atrophy and degenerative changes in the back muscles are known to be associated with chronic lower back pain [1, 2]. Studies have been conducted to investigate the relationship between low back pain, sarcopenia, and related pathologies [1, 3,4,5]. However, there have been few studies on analysis of paraspinal back muscles in asymptomatic persons [6,7,8,9,10] and fewer studies in young adults [1, 6, 10]. Studies of lumbar paraspinal muscles are divided into two categories: qualitative and quantitative analysis by cross-sectional area (CSA). For qualitative analysis, Goutallier grade (GG) system has been used extensively in CT and MRI studies of rotator cuff muscles [10,11,12]. However, there has been no standardized grading system to evaluate degenerative changes of paraspinal muscles in patients; several studies have applied the GG system to lumbar paraspinal muscles [2, 6, 10, 11, 13]. Lumbar paraspinal muscles are composed of various muscles and degree of fatty change varies according to lumbar level 18; therefore, GG system has to be further validated in application to paraspinal muscles.
Some quantitative studies of paraspinal muscles have used the ratio of CSAs on CT or MRI [3, 5, 14,15,16,17]. However, it may be difficult to generalize CSAs to represent degenerative changes in lumbar paraspinal muscles because of differences in people’s body composition. There have been studies evaluating functional CSA (fCSA), measuring areas without fatty changes, or total CSAs of the paraspinal muscles [3, 7, 16,17,18]. Other studies have used various fat quantification techniques to overcome shortcomings of quantitative techniques, including MR spectroscopy and chemical shift [1, 2, 4, 6, 10, 19, 20]. We used the two-point dixon technique because fat fraction (FF) can readily be obtained on a clinical scanner within reasonable time .
Although there have been several studies evaluating quality of lumbar paraspinal muscles, few studies have used CT and MRI comparing qualitative and quantitative methods ; fewer studies evaluated all lumbar segments. Therefore, in this study, we analyzed the lumbar paraspinal muscles both quantitatively and qualitatively on CT and MRI at all lumbar segments in young asymptomatic adults, regarding reliability of these methods and analyzed the correlation with demographic variables, especially regarding fat content.
This was a prospective study of asymptomatic healthy volunteers (age: mean 34.31, range 23–50; 14 men, 15 women) from August 2016 to April 2017; they were recruited from a health screening program, which usually gives the choice of having CT or MRI performed upon the subjects’ choice; Institutional Review Board approval and written informed consent were obtained. Exclusion criteria were previous procedure and/or surgery of spine, hip, or knee, poliomyelitis or congenital anomalies in spine or lower extremity, and contraindications for MRI. Twenty-nine subjects were enrolled, clinically examined, and measured for height and weight with body mass index (BMI) as weight/height2 (kg/m2); all subjects underwent lumbar CT and MRI according to standardized protocols.
CT and MR imaging
Axial and sagittal reformatted images were acquired on a multidetector CT scanner (Somatom Definition AS or Somatom Definition Flash, Siemens, Erlangen, Germany) from T12 upper endplate margin to S2 lower endplate margin. CT scanning parameters were as follows: 100 to 120 kV, 250–750 mAs, 0.6 mm collimation, and 2 mm slice thickness. MRI was performed using a 3.0 T scanner (Skyra; Siemens, Erlangen, Germany). T2-weighted FSE (fast spin echo) axial and sagittal, T1-weighted axial sequences were acquired from L1–2 to L5-S1 centered at each intervertebral disc. Additionally, axial two-point Dixon sequence was obtained parallel to each vertebra (Table 1).
Two musculoskeletal radiologists (reader 1 with six-year experience, reader 3 with four-year experience) and a trainee (reader 2) independently analyzed images on Picture archiving and communicating system (PACS; Infinitt Co., Ltd., Seoul, Korea). We obtained axial images parallel to intervertebral discs on CT and MRI and selected an axial plane passing through the disc center (Fig. 1). On axial CT and MRI, regions of interest (ROIs) were manually drawn along the thoracolumbar fascia (Fig. 2); this corresponded to the “total CSA” referred to in previous studies, not “fCSA” [9, 16]. We assessed inter- and intraobserver reliabilities of qualitative and quantitative measurements on CT and MRI. First, the three readers were trained by one experienced observer how to draw ROIs and measure GGs under consensus and analyzed images blinded to each other’s results. Measurements were repeated after 2 weeks for intraobserver agreement. We noted the presence of disc pathologies if there were any.
Paraspinal muscles within the thoracolumbar fascia were evaluated using GGs on CT and MRI at L1–2 to L5-S1 segments as follows: 0 - all muscle, no fat, 1 – fatty streaks within muscle or fat stripe around lamina and facet joint, 2 - more muscle than fat, 3 - muscle equal to fat, 4 - more fat than muscle (Fig. 3).
On CT, three radiologists drew ROIs on axial images to measure Hounsfield units (HU) and CSAs (Fig. 2a). On MRI, in-phase and fat only image were obtained using two-point Dixon method with ROIs drawn on in-phase image (Fig. 2b), which were copied and pasted onto fat only image (Fig. 2c); CSA and mean signal intensity (SI) were measured. FFs were calculated by dividing SI of the fat image ROI by SI of the in-phase image ROI. Additionally, on both CT and MRI, the values were corrected using the ROI of the vertebral body (VB) to reduce bias such as body size and gender that could affect the CSA. The ROI was drawn along the margin of the VB at the inferior endplate of each lumbar level and then divides the measured CSA by this value [19, 22].
Demographic data were analyzed using T-test for parametric variables and Mann-Whitney U-test for non-parametric variables. For qualitative analysis, Kappa statistic was used for intra-observer agreement, Kendall’s coefficient of concordance for inter-observer agreement. For quantitative analysis, intra-class correlation coefficient (ICC) was used for intra- and inter-observer agreements. Since GGs of paraspinal muscles were analyzed at 5 lumbar segments, it was impossible to obtain one representative value. We evaluated the relationship between demographic variables, including age, sex, BMI, and quantitative values. Also, we analyzed associations between quantitative values and each lumbar segment, i.e. whether there was an increase according to lumbar segment. Associations were analyzed using simple linear regression analysis; additional multiple linear regression analysis was performed for age, sex, and BMI, regarding their influence. Relationship between GG and quantitative values were analyzed using simple linear regression. Spearman’s correlation analysis was used to evaluation of correlation between mean FF/HU and each lumbar segment, and then determine which lumbar segment best reflected the mean FF and HU. Kappa values can be interpreted as follows: under 0.20 slight agreement, 0.21–0.40 fair agreement, 0.41–0.60 moderate; correlation coefficient (r) indicated the degree of relevance as follows: 0.2–0.4 week, 0.4–0.7 moderate, 0.7–0.9 strong, over 0.9 very strong. Statistical analyses were performed using SPSS 20.0 software (SPSS Inc., Chicago, IL, USA) with significance for P < .05.
Regarding demographic data, height, weight, and BMI were significantly higher in males as expected (Table 2). Upon analysis of incidental disc pathologies, the results were as follows: there 5 disc protusions in L3/4, L4/5, L5/S1 and no disc extrusion. Qualitative and quantitative values of paraspinal muscles of each lumbar segment are summarized (Table 3). GGs scored between 0 and 2 on CT and MRI. Mean GG increased down the lumbar segments and was higher in women; GGs on MRI were higher than CT in both genders.
Mean CSA of men and women were 2519.67 mm2/2297.59mm2 and 1848.24 mm2/1729.00mm2 on CT/MRI, respectively: mean CSA and the CSA ratio of men were mostly significantly higher than women on CT/MRI. CSAs decreased at lower segments. CSAs on CT and MRI were comparable, except for L5-S1 level. Mean HU values were 47.43(±4.30) in men and 36.00(±6.38) in women; HU values decreased at lower levels. Mean FF was 14.35%(±3.68) in men and 22.53%(±5.93) in women; FFs increased at lower segments. On analysis of incidental disc pathologies and degenerative changes of 145 lumbar segments, 6 disc protrusions and 11 bulging discs were found, however, all asymptomatic.
Intra- and inter-observer reliability
For intraobserver reliability, regarding qualitative analysis on CT, in reader 1, there was substantial to almost perfect agreement (Kappa value 0.78–0.84) in all levels except for L1–2 level. In readers 2 and 3, Kapppa values were 0.74–1.0 and 0.76–1.0, respectively. On MRI, values of reader 1, 2, and 3 ranged between 0.78–1.0, 0.73–0.93, and 0.81–1.0, respectively. Regarding quantitative analysis, Kappa values were higher than 0.9 for FF and HU in all readers. For CSAs, values were 0.9 or higher at all levels, except for L5/S1 level on MRI in all readers.
For inter observer reliability, ICCs for GGs were 0.83–0.92 and 0.87–0.93 for CT and MRI, respectively. ICCs for CSA on CT were 0.95–0.97, on MRI were 0.94–0.98 at levels L1–2 to L4–5; the lowest ICC was 0.61 at level L5-S1. For HU and FFs on CT and MRI, ICCs were 0.9 or higher at all lumbar segments.
Mean quantitative values were analyzed regarding the relationship between age, sex, and BMI; the influence of each variable was measured by simple regression analysis (Table 4). CSAs showed no significant difference according to lumbar segment on CT but significantly decreased down lower segments on MRI (p < 0.001). HU decreased and FF significantly increased down lumbar segments (p < 0.001). Additionally, we analyzed correlation between qualitative and quantitative values; right and left side values were comparable. As GGs increased, CSA decreased on CT and MRI, respectively, which was significant on CT (p = 0.002) but not on MRI. However, as GGs increased, HU decreased and FF increased (p < 0.001).
Regarding CSAs on CT and MRI, there were significant correlations with sex and BMI (Table 5). CSA was smaller in women and as BMI increased, CSA significantly increased on CT and MRI (p < 0.001). HU and FF were statistically correlated with age and sex but not with BMI. Lower HU and higher FF were observed in women (p < 0.001). Gender was the most influential variable for HU and FF; BMI was the most influential variable for CSA.
When each lumbar level was analyzed for the most representative FF and HU, FF at L3–4 was the most representative in all readers; FF showed a strong correlation with r value of 0.9 or more, except at L5-S1 level. HU on CT showed the highest values at levels L2–3 and L4–5, with r values of 0.9 or higher.
Muscle degeneration with fatty change is associated with lumbar spine pathology affecting disease progression and life quality [5, 15, 20, 23]. Although CT and MRI are known to be accurate and sensitive in muscle evaluation [24, 25], there has been a few studies on paraspinal muscles in asymptomatic young adults using both modalities [6,7,8,9, 19]. As MRI techniques have evolved recently, validation and comparison in healthy adults are needed. In our study, subjects were relatively young adults, matched by gender with a mean BMI of 23.0 kg/m2.
GGs ranged between 0 and 2 in our study, consistent with previous studies [2, 10], increasing down lower lumbar segments, showing highest degree of fatty degeneration in L5-S1 segment, consistent with previous studies . Although GGs correlated with quantitative analysis, we found a limitation in applying GGs to evaluate back muscles of young adults since higher degree of muscle degeneration (grades 3, 4) were not seen and fatty changes were seen only in parts of paraspinal muscles close to spine; in spite of this, interobserver agreement of GGs was high.
GGs showed statistically significant differences between genders on CT and MRI; scores on MRI tended to be higher because MRI is more sensitive in detecting fat [19, 20, 27]. However, regarding CSA values, there was no statistically significant difference between CT and MRI, except for L5-S1 segment. Axial images using two-point Dixon technique were acquired parallel to each vertebra unlike conventional axial images. The angle between L5 and S1 vertebral bodies was large, and so CSA on two-point Dixon sequence and axial plane of CT were measured differently, resulting in lower interobserver agreement at L5-S1 segment on MRI.
CSAs were significantly higher in men on CT and MRI but did not show a significant difference according to age, consistent with previous studies [6, 14] and contradictory to other studies hypothesizing increased degenerative muscle changes with aging [15, 23]. This may be due to our study population, consisting of relatively young asymptomatic volunteers. Increased CSA was strongly associated with increasing BMI. As CSA does not accurately reflect muscle quality in young adults, BMI has limitations in assessing muscle quality. In our study, association of BMI with HU or FF was not statistically significant. Only quantifying fat could accurately evaluate the quality of paraspinal muscles.
HU and FF data were consistent with the hypothesis that fatty degeneration of paraspinal muscles increases with age [6, 15]; fatty change tended to increase down lower segments, more pronounced in women, consistent with previous studies [20, 28]. Fortin et al.  proposed the most abundant fatty change to occur at L5-S1 segment because the gravitational centerline passes through this segment rendering it to be the most weight-bearing; also, a larger lordotic angle and greater motion at this level may contribute to this finding .
Previous studies have attempted to find the lumbar level, most representative of the mean ; Crawford et al.  noted L4–5 segment to be the most representative FF on MRI. We expected that the mid-lumbar level would best reflect fatty changes, because fatty change increased down lumbar levels, reflected in decreasing HU and increasing FF. From our results, all lumbar segments seemed to reflect the mean value well on CT and MRI. This may be because the study included only asymptomatic adults and difference in the correlation coefficient among the levels was not significant. The level that best reflected the mean was L2–3 segment for HU and L3–4 segment for FF; correlation coefficient was lower at L5-S1 segment than other levels. We hypothesized a fatty change with muscle atrophy in the lower lumbar segments, most prominent in L5/S1. Similar to our study, a 15-year prospective study of quantitative analysis of paraspinal muscles also showed greater atrophy and fatty changes in muscle at L5/S1 .
Regarding qualitative analysis, intra- and interobserver agreements for CSA were high, consistent with previous studies [9, 18], confirming CSA reliable for muscle evaluation. Also using FF and HU, more reliable values were obtained, comparable to CSA. Recently, two-point Dixon technique has been used to quantify fat in paraspinal muscles . Since two-point Dixon technique acquires four phases with one acquisition, images are obtained in a reasonable scan time. In addition, on CT, it is easy to measure the degree of fatty change using only HU. Therefore, FF and HU are feasible and reliable tools for evaluation of muscle quality, especially regarding fatty degeneration.
This study has several limitations. First, sample size was small and did not include a wide age range. However, we obtained data of lumbar paraspinal muscles in asymptomatic relatively young adults, which have not been studied much. Secondly, this study reflects the characteristics of a certain ethnicity and may not reflect a diverse population. Thirdly, although asymptomatic healthy adults were recruited, there were some disc pathologies; however, they were all asymptomatic and probably did not have a significant effect on our results. Fourthly, there was no gold standard for muscle degeneration; however, it is neither feasible nor ethical to obtain muscle tissue specimens in a study like this and lack of gold standard is exactly the reason for such studies. Finally, we did not subdivide muscle groups; we analyzed all muscles in the thoracolumbar fascia as a whole as this seemed to be a simple and easy method.
Female, older age, and lower lumbar segment were associated with higher fat content of paraspinal muscles. MRI and CT can be reliably used for qualitative and quantitative analyses of paraspinal back muscles in young healthy adults, especially regarding fat content, with good correlation between the two methods. FF and HU could be useful tools for evaluating muscle degeneration with fatty change in paraspinal muscles. The level that best reflected the mean was L2–3 segment for HU and L3–4 segment for FF. This study could serve as a baseline study for future studies regarding muscles.
Availability of data and materials
The dataset analyzed are not publicly available but are available from the corresponding author on reasonable request.
Magnetic resonance imaging
Cross sectional area
Functional cross sectional area
Body mass index
Fast spin echo
Regions of interests
Intra-class correlation coefficient
Paalanne N, Niinimaki J, Karppinen J, Taimela S, Mutanen P, Takatalo J, et al. Assessment of association between low back pain and paraspinal muscle atrophy using opposed-phase magnetic resonance imaging: a population-based study among young adults. Spine (Phila Pa 1976). 2011;36(23):1961–8.
Mengiardi B, Schmid MR, Boos N, Pfirrmann CW, Brunner F, Elfering A, et al. Fat content of lumbar paraspinal muscles in patients with chronic low back pain and in asymptomatic volunteers: quantification with MR spectroscopy. Radiology. 2006;240(3):786–92.
Fortin M, Lazary A, Varga PP, McCall I, Battie MC. Paraspinal muscle asymmetry and fat infiltration in patients with symptomatic disc herniation. Eur Spine J. 2016;25(5):1452–9.
Bhadresha A, Lawrence OJ, McCarthy MJ. A comparison of magnetic resonance imaging muscle fat content in the lumbar Paraspinal muscles with patient-reported outcome measures in patients with lumbar degenerative disk disease and focal disk prolapse. Global Spine J. 2016;6(4):401–10.
Wan Q, Lin C, Li X, Zeng W, Ma C. MRI assessment of paraspinal muscles in patients with acute and chronic unilateral low back pain. Br J Radiol. 2015;88(1053):20140546.
Crawford RJ, Filli L, Elliott JM, Nanz D, Fischer MA, Marcon M, et al. Age- and level-dependence of fatty infiltration in lumbar paravertebral muscles of healthy volunteers. AJNR Am J Neuroradiol. 2016;37(4):742–8.
Niemelainen R, Briand MM, Battie MC. Substantial asymmetry in paraspinal muscle cross-sectional area in healthy adults questions its value as a marker of low back pain and pathology. Spine (Phila Pa 1976). 2011;36(25):2152–7.
Hides J, Gilmore C, Stanton W, Bohlscheid E. Multifidus size and symmetry among chronic LBP and healthy asymptomatic subjects. Man Ther. 2008;13(1):43–9.
Danneels LA, Vanderstraeten GG, Cambier DC, Witvrouw EE, De Cuyper HJ. CT imaging of trunk muscles in chronic low back pain patients and healthy control subjects. Eur Spine J. 2000;9(4):266–72.
Yanik B, Keyik B, Conkbayir I. Fatty degeneration of multifidus muscle in patients with chronic low back pain and in asymptomatic volunteers: quantification with chemical shift magnetic resonance imaging. Skeletal Radiol. 2013;42(6):771–8.
Battaglia PJ, Maeda Y, Welk A, Hough B, Kettner N. Reliability of the Goutallier classification in quantifying muscle fatty degeneration in the lumbar multifidus using magnetic resonance imaging. J Manipulative Physiol Ther. 2014;37(3):190–7.
Goutallier D, Postel JM, Bernageau J, Lavau L, Voisin MC. Fatty muscle degeneration in cuff ruptures. Pre- and postoperative evaluation by CT scan. Clin Orthop Relat Res. 1994;304:78–83.
Bumann H, Nuesch C, Loske S, Byrnes SK, Kovacs B, Janssen R, et al. Severity of degenerative lumbar spinal stenosis affects pelvic rigidity during walking. Spine J. 2020;20(1):112–20.
Shahidi B, Parra CL, Berry DB, Hubbard JC, Gombatto S, Zlomislic V, et al. Contribution of lumbar spine pathology and age to Paraspinal muscle size and fatty infiltration. Spine (Phila Pa 1976). 2017;42(8):616–23.
Takayama K, Kita T, Nakamura H, Kanematsu F, Yasunami T, Sakanaka H, et al. New predictive index for lumbar Paraspinal muscle degeneration associated with aging. Spine (Phila Pa 1976). 2016;41(2):E84–90.
Fortin M, Videman T, Gibbons LE, Battie MC. Paraspinal muscle morphology and composition: a 15-yr longitudinal magnetic resonance imaging study. Med Sci Sports Exerc. 2014;46(5):893–901.
D'Hooge R, Cagnie B, Crombez G, Vanderstraeten G, Dolphens M, Danneels L. Increased intramuscular fatty infiltration without differences in lumbar muscle cross-sectional area during remission of unilateral recurrent low back pain. Man Ther. 2012;17(6):584–8.
Hu ZJ, He J, Zhao FD, Fang XQ, Zhou LN, Fan SW. An assessment of the intra- and inter-reliability of the lumbar paraspinal muscle parameters using CT scan and magnetic resonance imaging. Spine (Phila Pa 1976). 2011;36(13):E868–74.
Hyun SJ, Bae CW, Lee SH, Rhim SC. Fatty degeneration of the Paraspinal muscle in patients with degenerative lumbar kyphosis: a new evaluation method of quantitative digital analysis using MRI and CT scan. Clin Spine Surg. 2016;29(10):441–7.
Sasaki T, Yoshimura N, Hashizume H, Yamada H, Oka H, Matsudaira K, et al. MRI-defined paraspinal muscle morphology in Japanese population: the Wakayama spine study. PLoS One. 2017;12(11):e0187765.
Ma J. Dixon techniques for water and fat imaging. J Magn Reson Imaging. 2008;28(3):543–58.
Arbanas J, Pavlovic I, Marijancic V, Vlahovic H, Starcevic-Klasan G, Peharec S, et al. MRI features of the psoas major muscle in patients with low back pain. Eur Spine J. 2013;22(9):1965–71.
Thompson DD. Aging and sarcopenia. J Musculoskelet Neuronal Interact. 2007;7(4):344–5.
Cooper C, Fielding R, Visser M, van Loon LJ, Rolland Y, Orwoll E, et al. Tools in the assessment of sarcopenia. Calcif Tissue Int. 2013;93(3):201–10.
Shaw SC, Dennison EM, Cooper C. Epidemiology of sarcopenia: determinants throughout the Lifecourse. Calcif Tissue Int. 2017;101(3):229–47.
Sun D, Liu P, Cheng J, Ma Z, Liu J, Qin T. Correlation between intervertebral disc degeneration, paraspinal muscle atrophy, and lumbar facet joints degeneration in patients with lumbar disc herniation. BMC Musculoskelet Disord. 2017;18(1):167.
Kang CH, Shin MJ, Kim SM, Lee SH, Lee CS. MRI of paraspinal muscles in lumbar degenerative kyphosis patients and control patients with chronic low back pain. Clin Radiol. 2007;62(5):479–86.
Kalichman L, Carmeli E, Been E. The association between imaging parameters of the Paraspinal muscles, spinal degeneration, and low Back pain. Biomed Res Int. 2017;2017:2562957.
We thank Deok Yeon Jo, MS, for helping in statistical analysis and editing of the manuscript.
This study was supported by Hallym University Research Fund 2016 (HURF-2016-35) and by a grant from the Central Medical Service (CMS) Research Fund.
Ethics approval and consent to participate
This study has been approved by the Hallym University Dontan Sacred Heart Hospital IRB (2016–246-I) and written informed consent was obtained.
Consent for publication
The authors declare that they have no conflict of interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Khil, E.K., Choi, J., Hwang, E. et al. Paraspinal back muscles in asymptomatic volunteers: quantitative and qualitative analysis using computed tomography (CT) and magnetic resonance imaging (MRI). BMC Musculoskelet Disord 21, 403 (2020). https://doi.org/10.1186/s12891-020-03432-w
- Cross-sectional area
- Fatty infiltration
- Muscle atrophy
- Two point Dixon
- Fat fraction
- Goutallier score
- Paraspinal muscle