Age- and muscle-specific reliability of muscle architecture measurements assessed by two-dimensional panoramic ultrasound
BioMedical Engineering OnLine volume 21, Article number: 15 (2022)
Age-related changes in muscle properties affect daily functioning, therefore a reliable assessment of such properties is required. We examined the effects of age on reliability, muscle quality and interrelation among muscle architecture (MA) parameters of the gastrocnemius medialis (GM), tibialis anterior (TA), and vastus lateralis (VL) muscles.
Three raters scored ultrasound (US) scans of 12 healthy younger and older adults, on fascicle length (FL), pennation angle (PA) and muscle thickness (MT). Intra- and inter-rater reliability of MA measures in rest and contraction was assessed by intraclass correlation coefficients (ICC) and standard error of measurements (SEM, SEM%). The relationship between MA parameters was examined using Pearson correlation coefficients. Muscle quality (MQ) was examined using mean pixel intensity.
Reliability was moderate to excellent for TA in both groups (ICCs: 0.64–0.99, SEM% = 1.6–14.8%), and for VL in the younger group (ICCs: 0.67–0.98, SEM% = 2.0–18.3%). VL reliability was poor to excellent in older adults (ICCs: 0.22–0.99, SEM% = 2.7–36.0%). For GM, ICCs were good to excellent (ICCs: 0.76–0.99) in both groups, but GM SEM% were higher in older adults (SEM%Younger = 1.5–10.7%, SEM%Older = 1.6–28.1%). Muscle quality was on average 19.0% lower in older vs. younger adults. In both groups, moderate to strong correlations were found for VL FL and MT (r ≥ 0.54), and TA PA and MT (r ≥ 0.72), while TA FL correlated with MT (r ≥ 0.67) in younger adults only.
In conclusion, age- and muscle-specificities were present in the relationships between MT and PA, and MT and FL at rest. Furthermore, the reliability of MA parameters assessed with 2D panoramic US is acceptable. However, the level of reliability varies with age, muscle and MA measure. In older adults notably, the lowest reliability was observed in the VL muscle. Among the MA parameters, MT appears to be the simplest and most easily reproducible parameter in all muscles and age groups.
Healthy aging is associated with a transformation of muscle composition . This age-related change in muscle quality, namely the proportion of contractile over non-contractile tissue, results in an impaired force-generating capacity and increased intramuscular fat , which eventually impact skeletal muscle function . Additionally, muscle quality alterations are related to muscle thickness and the arrangement of muscles fibers, i.e., muscle architecture (MA) , which are strong determinants of muscle mass and muscle strength, respectively [4,5,6]. Therefore, the use of muscle quality and MA as biomarkers of skeletal muscle function, as well as for diagnostic and evaluation of exercise interventions, is important but relies on good reliability of the measurement procedure.
MA is quantified by fascicle length (FL), pennation angle (PA) and muscle thickness (MT), which are all affected by age [7,8,9,10,11,12]. For example, MT of the gastrocnemius medialis and vastus lateralis muscles is negatively correlated (r = − 0.4) with age , and both FL and PA of the gastrocnemius medialis muscle decline by ~ 10% by age 81 years . Muscle status is often assessed by brightness-mode (B-mode) ultrasound (US) when priority is given to low-cost, rapid and non-invasive assessment [13,14,15]. Aggregate evidence suggests that US MA reliability estimates are highly variable and inconsistent in a variety of muscles and across studies [16,17,18,19]. One source of inconsistency is the use of incorrect types of intraclass correlation coefficients, which could inflate reliability or mask unreliability of MA parameters [16, 17, 19]. A second source of inconsistency is the use of reliability estimates of MA parameters that are not comparable between studies [16, 17, 19]. It is also often the case that due to a large number of images, two or more raters determine MA, however the reliability between raters analyzing the same image is scarcely reported . Furthermore, reliability of US estimates of MA has only been scarcely examined in the older population. Therefore, the potential effects of aging on reliability, which results from the age-related alterations in muscle quality and architecture, remain unclear.
In fact, the age-related changes in muscle quality vary per individual, depend on co-existing pathologies and are reflected in sonographic changes, of which some of the most reoccurring ones are changes in echogenicity and loss of heterogeneity . These changes concern, respectively, the ability of the tissue to reflect echo waves and the changes in the proportion of contractile and non-contractile tissue. Both these properties are quantifiable by ultrasound measures of muscle quality, in particular echo intensity (EI) . Therefore, if both the MA reliability of lower limb muscles in aging and the effects of muscle quality on MA reliability remain unclear, the potential value of ultrasound for diagnostic and prognosis is also debatable. Additionally, since MA is a strong determinant of muscle strength [4,5,6], the conclusions on age-related loss of strength and low physical performance based on US measurements are less impactful, and so is the relationship between these metrics and muscle loss, or sarcopenia, which is associated with mobility-disability and mortality [23,24,25,26].
Although the use of US in clinical settings is promising , on the basis of the evidence presented, a clarification of the reliability of MA features for lower extremity muscles in older adults would increase the clinical value of MA and would allow clinicians to make more informed treatment decisions. Therefore, the purpose of this study is to provide a comprehensive analysis of relative and absolute intra-rater and inter-rater reliability of muscle architecture, namely FL, PA, and MT using panoramic two-dimensional (2D) US of three lower extremity muscles at rest and during contraction in healthy younger and older adults. The tibialis anterior, vastus lateralis and gastrocnemius medius muscles were targeted since their MA is affected by aging and because of the important role they play during balance control while standing [28, 29] and power production during walking [30, 31]
The second aim is to elucidate the effect of age on MA reliability and measures of muscle quality. Accordingly, the association between different MA parameters, will also be examined. This process would ultimately help reveal not only the structure of interdependence among MA parameters, but also facilitate the selection of those parameters that are reliable while being highly correlated to other MA parameters. We expect to find that reliability of MA parameters would vary by muscle. Furthermore, we expect significant positive correlations between FL and MT and FT and PA [32, 33]. As age affects muscles, connective tissue volume and intramuscular fat in the lower extremity muscles differently , which in turn affect muscle quality , we expect that reliability of MA would be lower in older compared with younger adults.
Reliability of MA parameters
Table 1 shows the demographics and muscle property data.
Table 2 shows the ICCs for each rater and ICC averaged over the three raters for PA, FL, and MT in the three muscles at rest and during contraction in the two age groups. All ICCs were significant. For the GM, average intra-rater ICCs were good to excellent (ICCs = 0.78–0.99) in the two age groups. Table 3 shows the corresponding absolute SEM and relative SEM values. Averaged intra-rater SEM values for the GM muscle were ≤ 1.84° for PA, ≤ 0.46 cm for FL and ≤ 0.05 cm for MT in both groups. For the TA, relative intra-rater reliability was moderate to excellent (ICCs = 0.70–0.98) in both age groups, and absolute SEM values were ≤ 1.24° for PA, ≤ 0.61 cm for FL and ≤ 0.04 cm for MT. For the VL, relative intra-rater reliability was moderate to excellent (ICCs = 0.59–0.98) in the two age groups. Corresponding absolute SEM values were ≤ 1.29° for PA in both groups. For FL, SEMs were ≤ 1.01 cm in the old group and ≤ 0.75 cm in the younger group. For MT, SEM values were ≤ 0.13 cm in the two age groups.
Table 4 shows the ICCs and SEMs for the inter-rater reliability for PA, FL, and MT in the three muscles at rest and during contraction in the two age groups. For the MA of the GM muscle, inter-rater reliability was good to excellent (ICCs = 0.76–0.99, all P < 0.05) in the two age groups. Absolute PA SEM values were below 1.83° in the younger group and ranged between 1.80° to 3.63°in the older group. FL SEMs were ≤ 0.55 cm in the younger group, and ranged between 1.03 and 1.84 cm in the older group. Absolute MT SEM values were ≤ 0.06 cm in the two age groups. Inter-rater reliability for TA MA was moderate to excellent (ICCs = 0.64–0.99, all P < 0.05) in the two age groups. SEM values for the PA, FL and MT of the TA were, respectively, ≤ 2.89°, ≤ 0.93 cm, and ≤ 0.05 cm. Inter-rater reliability for VL MA was moderate to excellent (ICCs = 0.67–0.98, all P < 0.05) in the younger group and poor to excellent in the older group (ICCs = 0.22–0.99). More specifically, in the older group, ICCs varied from 0.44 to 0.55 for PA, between 0.22 and 0.37 for FL, and between 0.94 and 0.99 for MT. However, the ICCs values for PA and FL in the older group were not significant. PA SEM values were ≤ 2.60° in the two age groups. FL SEM values were ≤ 4.05 cm in the older group and ≤ 2.04 cm in the younger group. SEM MT values were ≤ 0.17 cm in the younger group and ≤ 0.09 cm in the older group.
Echo intensity and correlations among MA parameters
Figure 1 shows the echo intensity values at rest and during contraction in the three muscles and the two age groups. Mean and SD values of the echo intensity values are presented in Additional file 1: Table S1.
For the GM, EI in older adults was higher by 21.4% (Cohens’ d = 1.35) and 17.2% (Cohen’s d = 1.14) at rest and during contraction respectively, compared to younger adults. In older adults, TA EI was higher by 19.7% at rest (Cohens’ d = 1.14) and 28.0% (Cohens’ d = 1.47) during contraction, compared to younger adults. For the VL at rest, EI was 17.2% (Cohens’ d = 0.63) higher in older compared to younger adults, whereas during contraction it was 10.2% (Cohens’ d = 0.63) lower in younger adults. Large effects are reported in all muscles and conditions, except for the VL in contraction which showed a medium effect. Differences in EI between younger and older adults were only significant for GM in rest, TA in contraction and VL in rest (P < 0.05).
Figure 2 shows the correlations between MA parameters at rest and during contraction in the three muscles and two age groups. In the younger group, GM, MT, and PA correlated moderately (r = 0.68, P < 0.05) at rest , but not during contraction (r = 0.22). In the older group, no correlations between MT and PA were found in the GM (rrest = 0.01; rcon = 0.11). MT and PA correlated moderate to strong at rest and during contraction in TA in both groups (Young: rrest = 0.83, rcon = 0.70; Old: rrest = 0.78, rcon = 0.72, all P < 0.05). In VL, no correlation was found in the younger group at rest (r = 0.13), but a moderate correlation was found during contraction (r = 0.54, P < 0.05). MT and PA did not correlate in both conditions in older group (rrest = 0.01, rcon = 0.11, respectively).
MT and FL in GM correlated similarly at rest in the younger (r = 0.55) and older group (r = 0.50), but different during contraction (young: r = 0.72, P < 0.05; old: r = 0.58). MT and FL in TA at rest and in contraction correlated in young (rrest = 0.87, rcon = 0.67, both P < 0.05), but not in older adults (rrest = 0.15, rcon = 0.19). MT and FL in VL at rest and during contraction correlated moderately to strongly in both age groups (young: rrest = 0.77; rcon = 0.54; old: rrest = 0.75; rcon = 0.70, all P < 0.05).
Age-related changes in MA parameters impact daily function. Reliable estimates of such measures are important for diagnostics and treatment of muscle-related disorders such as sarcopenia. A comprehensive reliability assessment of MA parameters in lower leg muscles has been lacking so far, especially in older adults. In the present study, we aimed to determine the relative and absolute intra-rater and inter-rater reliability of MA measured using 2D panoramic US in GM, TA, and VL at rest and during contraction. Furthermore, we wanted to elucidate the effects of age on both MA reliability and muscle quality, as well as on the interdependence of different MA parameters. As hypothesized, we found age- and muscle-specificity in the interdependence of MA parameters and in the reliability of MA parameters. Furthermore, as expected, muscle quality, expressed by EI, decreased with age, which could underlie the age-sensitivity of MA reliability estimates.
Overall, the values for FL, PA, and MT in the three muscles (Table 1) were within the limits of those reported previously [7, 8, 36]. Intra- and inter-rater reliability of the MA parameters was good to excellent in GM (ICCs: 0.76–0.99) and moderate to excellent in TA (ICCs: 0.64–0.99) in each age group (Tables 2, 4). In contrast, intra- and inter-rater reliability of MA parameters in VL was age-dependent, as it was moderate to excellent (ICCs: 0.67–0.98) in younger and poor to excellent (ICCs: 0.22–0.99) in older adults. These results are in line with the aggregate data reviewed previously [16,17,18]. However, when comparing our data with the data presented in these reviews, several points should be considered. First, the studies reviewed focused on test–retest, between scan or inter-operator ICCs. Second, different types or even incorrect reliability estimates were used and/or the type and model of the ICC were not reported. This may have inflated reliability or masked unreliability [16, 17, 19]. A study with a comparable methodology reported higher reliability (ICCs: 0.99–1.00) for MA parameters in GM . These higher ICCs are probably the result of a longer probe being used which captured the entire length of the GM (100 mm  vs. 39 mm in the present study). Because panoramic US is sensitive to changes in position of the probe , our probe movement could have reduced the reliability. Furthermore, it is important to notice that in the current study ICCs were calculated based on absolute agreement, which could lower reliability compared to consistency measures. In the current study, SEM% values ranged from 2.1 to 12.9% for intra-rater reliability of PA, FL, and MT in the three muscles. For inter-rater reliability of the MA parameters, SEM% values ranged between 3.5–26.0% in the older group, and between 3.5–22.1% in the younger group.
Comparison of reliability among MT, FL and PA
In all muscles, MT had the highest absolute and relative intra- and inter-rater reliability (ICCs > 0.87and SEM% < 7.2%) compared to PA (ICCs: 0.44–0.99; SEM%: 4.4–17.5%) and FL (ICCs: 0.22–0.99; SEM%: 2.7–14.0%). Additionally, MT confidence intervals were smaller than those of FL and PA, and did not overlap in most cases with the intervals of FL and PA, implying significant higher reliability for MT. As fascicles have variable lengths and arrangements within a muscle, the associated PA and FL may differ from fascicle to fascicle [4, 38], which may explain the lower reliability found for these two parameters.
Comparison of reliability between rest and contraction
A previous study found improvements in image contrast and measurement accuracy for MA outcomes derived during contraction . In the current study, we observed higher ICC estimates during muscle contraction compared with rest (~ 62%). However, when reliability of MA parameters was poor, as was the case for the FL and PA of the VL in older adults, reliability of MA parameters did not improve with contraction. Confidence intervals between rest and contraction were overlapping in most cases. Therefore, contraction does not seem to affect reliability of MA parameters.
Comparison of reliability between the two age groups
Although ICC intervals are wide and overlap between both age groups, we observed that MA parameters were more variable in older compared to younger adults, in particular the PA and FL in VL, but not in GM and TA (Table 4). This sensitivity is reflected in the lower muscle quality in the older group (Fig. 2), possibly as a result of decreased muscle volume and increased fat infiltration [1, 35]. Furthermore, the larger volume of VL vs. GM and TA could have reduced probe stability during scanning , which influences image quality, since panoramic US is sensitive to probe re-positioning . As older adults exhibit greater relative force fluctuations during isometric contractions [41, 42] and the large volume of VL lengthens scanning time, these force fluctuations could further compromise probe stability, image quality and ultimately the reliability of the VL MA. Our data also draw attention to using both ICC and SEM to assess reliability. While GM FL at rest had good reliability in both age groups (ICCs ≥ 0.76), SEM% were sensitive to age (young: 10.8%, old: 28.1%, Table 4), underlining the importance of assessing both absolute and relative reliability, especially when examining participants prone to muscle atrophy [16, 19].
Muscle quality changes and correlations between MA parameters
As hypothesized, we observed that muscle quality, quantified using EI, was lower (P < 0.05) in older compared to younger adults . The lower muscle quality in VL is consistent with the quadriceps’s lower quality in older adults reported in the literature [43, 44]. The lower muscle quality could have affected the ability to reliably identify the MA parameters, as it appears to be reflected in the larger range of the ICCs found in the older population. Therefore, the current results also highlight the relevance of a MA reliability analysis in an older population, including those having a specific disability that compromises muscle quality. Moreover, the MA data collected in three muscles and two age groups allowed us to examine the age- and muscle-specificity of the relationship among MT, PA, and FL. Such analyses could help us better understand the structural and functional mechanisms underlying senile sarcopenia and muscles’ responses to mechanical loading. This process also helps reveal the interdependence among MA parameters, and facilitates the selection of those parameters that have higher reliability while being highly correlated to other MA parameters. We observed, in agreement with our hypothesis and supported by prior data [32, 33], a muscle-specificity in the relationships between MT and PA, as there was such an association in GM (r = 0.68) and TA (r = 0.83) in younger adults, but not in the VL (r = 0.13) (Fig. 2). The MT–PA relationship was affected by age, as such association was only present in TA (r = 0.72, P < 0.05) in older adults, but not in GM (r = 0.01) and VL (r = 0.07). How age and specific muscles might affect the MT–FL relationship is important for understanding if muscle changes occur uniformly cross-sectionally (MT) and longitudinally (FL) . We observed age- and muscle-specificity in the MT–FL relationship, so that such relationship occurred in both age groups in VL (Fig. 2. Younger: r = 0.75, P < 0.05, older: r = 0.70, P < 0.05), in neither group in GM (younger: r = 0.55; older: r = 0.50) and for the TA only in the younger adults (younger: r = 0.87, P < 0.05; older: r = 0.15). The data seem to suggest that old age affects the interdependence among measures of MA at rest. We note that muscle contraction did affect these associations between PA–MT and FL–MT for the younger (rrest = 0.68; rcon = 0.22), but not for older adults. An explanation for this finding could be that we observed higher MVCs with more variation in the younger group for the GM and VL compared to the older group (Table 1). This indicates that there is also more variability in the absolute force level at which the muscles were contracted at 20–30% MVC. The selection of a MA measure that is highly correlated with other MA parameters while also being highly reliable may thus remain age- and muscle-dependent. However, on the basis of our analysis, we found that MT appears the simplest and most easily reproducible MA parameter (ICCs > 0.87, SEM% < 7.47%) in the older adults and it supports the use of MT in clinical settings, as done previously in the context of muscle strength research .
A greater sample size could have narrowed the 95% confidence intervals of the ICC in particular for the PA and FL of all muscles. Wider confidence intervals in the current study for PA and FL, and not for MT, have also been reported previously , when determining test–retest reliability of VL and GM muscles in 21 older adults. In the present study, FL was determined with a custom-made MATLAB tool, on 2D images, ignoring the curvature of the muscle which could have influenced reliability .
The current study examined intra-rater and inter-rater reliability of muscle architecture in three lower limb muscles in healthy younger and healthy older individuals. Additionally, we clarified the effect of age on MA reliability and measures of muscle quality. In conclusion, we observed the presence of age- and muscle-specificity in the relationships between MT and PA and MT and FL at rest. Furthermore, we conclude that MA parameters can be reliably assessed with 2D panoramic US, but the level of reliability is likely influenced by muscle quality and varies with age, muscle, and MA measure. Among the MA parameters, MT appears to be the simplest and most easily reproducible MA parameter in older adults.
Healthy younger (N = 12, 5M, mean age: 23.3 SD: ± 3.8 years) and healthy older independently living volunteers (N = 12, 6M, age 67.9 ± 2.1 years) participated in the study. Exclusion criteria were: neurological disorders and orthopedic disabilities that limited mobility function, a hip or knee replacement in the last 3 years, inability to walk for 5 min without a walking aid, and a history of falls in the last year. The Local Ethical Committee approved the study protocol, which was executed in accordance with the declaration of Helsinki. Participants gave written informed consent prior to the start of the study.
Muscle architecture measurements
In vivo MA of the GM, TA and VL muscles was examined by 2D B-mode US (Echoblaster, Telemed, Vilnius, Lithuania) with a 128-element linear-array probe (transducer field of view = 39 mm). The PanoView (Echowave II, Telemed, Lithuania) software was used to create panoramic images of the muscles. US settings were optimized to ensure the best contrast between muscle fascicles and background. The settings, including gain (70%), depth (50 mm for VL, 40 mm TA and GM) and frequency (8 MHz), were set prior to testing and held constant between participants and across trials. US scans were performed on the right leg.
To ensure the thickest part of the muscle was captured within the time limit for scanning, the proximal and distal ends of the most medial part of the muscle were identified, and the panoramic scan was made between ~ 20–80% of the total muscle length. The probe was moved along the longitudinal axis of the muscle oriented parallel to the muscle fascicles and perpendicular to the skin. Minimal pressure was applied with the probe on the skin to avoid muscle compression during scanning. Water-soluble transmission gel was applied to the skin to aid acoustic coupling. Directly after each trial the US scan was inspected and repeated if a movement artefact contaminated the image. For all three muscles, US scans were recorded both at rest and during 20–30% of maximal voluntary (MVC) contraction.
For all muscles, US scans were first made at rest. For the VL, US scans were made while the participant was seated on a custom-made chair  with the right leg strapped to a lever arm mounted on the chair with the knee and hip 90° flexed. For the analysis of the TA, participants sat on the seat of the dynamometer (KinCom AP125; Chattecx Inc., Chattanooga, TN., USA) with the back supported and the knee fully extended. The subject’s right foot was strapped to the foot plate attachment of the dynamometer. Two crossover upper-body belts and a thigh strap were used to minimize extraneous movements. The GM muscle was examined in a prone position on an examination table. Two straps were secured around the upper and lower leg to minimize extraneous movements. The subject’s right foot was secured to the foot plate attachment of the KinCom with the knee fully extended and the ankle at 90°. After a scan at rest, participants produced a weak and medium effort contraction of the muscle followed by an MVC with 30 s of rest between trials. To determine the MVC, participants contracted the quadriceps, plantar flexors and dorsiflexors, respectively, as rapidly and forcefully for 5 s. Afterwards the muscles were US-scanned during 20–30% of the participants’ MVC with the force target displayed on a monitor.
A total of 144 scans were collected (24 participants, 3 muscles, rest, contraction) by the same experimenter.
Ultrasound data analysis
Using a custom-made graphical user interface in MATLAB (version r2019a; The MathWorks Inc., Natick, MA) PA, FL and MT were determined (Fig. 3). PA was defined as the angle between a clearly visible fascicle and the deep aponeurosis, FL as the length of the fascicular path between the deep and superficial aponeuroses  and MT as the perpendicular distance between the superficial and deep aponeuroses at the widest distance [46, 51]. FL and MT were converted from pixels to centimeters using the graduated scale on the original US scan.
To determine the absolute and relative reliability of the US scans, three raters with 6-month to 3-year experience on interpreting US scan images, analyzed the scans. All scans were presented three times  to each rater in random order. Due to a technical problem, data of one younger participant were missing for the VL at rest and for the GM during contraction. This led to a total of 630 ratings in the younger group and 648 ratings in the older group that were used for further statistical analyses.
To assess whether the reliability of ultrasound is influenced by muscle quality, for each scan the EI was calculated. As non-contractile and contractile elements have different pixel intensities, where skeletal muscles appear black and intramuscular adipose and fibrous tissues appear white , EI can be used to examine muscle composition. For each ultrasound image, a region of muscle tissue of interest was selected with the exclusion of subcutaneous and fibrous tissue . EI was than calculated as the mean pixel intensity of that region . Figure 4 shows an ultrasound scan of the m. tibialis anterior during contraction of a younger (left) and older participant (right), with the regions of interest annotated in yellow and the mean EI values displayed at the top of the figure. EI values range between 0 and 255, with low values being indicative of good muscle quality .
Statistical software (R version 3.6.1, R core team, 2019), including the IRR package (v0.84.1, Gamer, 2012) was used for all calculations. Intraclass correlation coefficients (ICC) with 95% confidence intervals were calculated as a measure of relative reliability, and the standard error of measurements (SEM) and its percentage (SEM%) were calculated as a measure of absolute reliability.
Reliability of MA parameters
Intra-rater reliability was determined by comparing the three ratings of the same image for each rater. Inter-rater reliability was determined by comparing the mean scores of the three ratings of each US scan among the three observers. For the intra-rater reliability, the ICC and SEM were based on a single-rating, absolute agreement, 2-way mixed-effects model with the following equation (for all reliability equations see ):
with MSR = mean square for rows, MSE = mean square for error; MSC = mean square for columns; n = number of subjects; k = number of measurements.
For the inter-rater reliability, the ICC and SEM were based on a mean-rating (k = 3), absolute agreement, 2-way random model with the following equation:
Reliability was classified as poor (ICC < 0.5), moderate (0.5 ≤ ICC ≤ 0.75), good (0.75 < ICC ≤ 0.9) or excellent (ICC > 0.9) . To examine differences between ICCs of the different architectural parameters, younger and older adults, and rest and contraction, confidence intervals were compared . The SEM for both models was calculated using the following equation :
where MSE is the mean square error. A low SEM implies high reliability, though no generally accepted scales exist to interpret these values.
Additionally, SEM% was computed as follows :
Echo intensity and correlations among MA parameters
To examine whether measures of EI significantly differ between younger and older adults, independent t-tests were performed. To examine the relationship among the three MA parameters, MA parameters were first averaged across raters and trials and Pearson correlations coefficients were computed between the averaged MA values. Correlations were classified as weak (≤ 0.35), moderate (0.36–0.67), or strong (0.68–0.89) or very strong (≥ 0.90) . For all analyses, statistical significance was set to P < 0.05.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Intraclass correlation coefficient
Standard error of measurement
Relative standard error of measurement
Doherty TJ. Invited review: aging and sarcopenia. J Appl Physiol. 2003;95:1717–27. https://doi.org/10.1152/japplphysiol.00347.2003.
Fragala MS, Kenny AM, Kuchel GA. Muscle quality in aging: a multi-dimensional approach to muscle functioning with applications for treatment. Sport Med. 2015;45:641–58. https://doi.org/10.1007/s40279-015-0305-z.
Barbat-Artigas S, Rolland Y, Zamboni M, Aubertin-Leheudre M. How to assess functional status: a new muscle quality index. J Nutr Health Aging. 2012;16:67–77. https://doi.org/10.1007/s12603-012-0004-5.
Lieber RL, Fridén J. Functional and clinical significance of skeletal muscle architecture. Muscle Nerve. 2000;23:1647–66. https://doi.org/10.1002/1097-4598(200011)23:11<1647::aid-mus1>3.0.co;2-m.
Rutherford OM, Jones DA. Measurement of fibre pennation using ultrasound in the human quadriceps in vivo. Eur J Appl Physiol Occup Physiol. 1992;65:433–7. https://doi.org/10.1007/BF00243510.
Selva Raj I, Bird SR, Shield AJ. Ultrasound measurements of skeletal muscle architecture are associated with strength and functional capacity in older adults. Ultrasound Med Biol. 2017;43:586–94. https://doi.org/10.1016/j.ultrasmedbio.2016.11.013.
Morse CI, Thom JM, Birch KM, Narici MV. Changes in triceps surae muscle architecture with sarcopenia. Acta Physiol Scand. 2005;183:291–8. https://doi.org/10.1111/j.1365-201X.2004.01404.x.
Narici MV, Maganaris CN, Reeves ND, Capodaglio P. Effect of aging on human muscle architecture. J Appl Physiol. 2003;95:2229–34. https://doi.org/10.1152/japplphysiol.00433.2003.
Stenroth L, Peltonen J, Cronin NJ, Sipilä S, Finni T. Age-related differences in Achilles tendon properties and triceps surae muscle architecture in vivo. J Appl Physiol. 2012;113:1537–44. https://doi.org/10.1152/japplphysiol.00782.2012.
Lexell J, Taylor CC, Sjöström M. What is the cause of the ageing atrophy? Total number, size and proportion of different fiber types studied in whole vastus lateralis muscle from 15- to 83-year-old men. J Neurol Sci. 1988;84:275–94. https://doi.org/10.1016/0022-510x(88)90132-3.
Fukunaga T, Kubo K, Kawakami Y, Fukashiro S, Kanehisa H, Maganaris CN. In vivo behaviour of human muscle tendon during walking. Proc R Soc Lond Ser B Biol Sci. 2001;268:229–33. https://doi.org/10.1098/rspb.2000.1361.
Kubo K, Kanehisa H, Azuma K, Ishizu M, Kuno SY, Okada M, et al. Muscle architectural characteristics in women aged 20–79 years. Med Sci Sports Exerc. 2003;35:39–44. https://doi.org/10.1097/00005768-200301000-00007.
Narici M. Human skeletal muscle architecture studied in vivo by non-invasive imaging techniques: functional significance and applications. J Electromyogr Kinesiol. 1999;9:97–103. https://doi.org/10.1016/s1050-6411(98)00041-8.
Lieber RL, Ward SR. Skeletal muscle design to meet functional demands. Philos Trans R Soc B Biol Sci. 2011;366:1466–76. https://doi.org/10.1098/rstb.2010.0316.
Narici MV, Binzoni T, Hiltbrand E, Fasel J, Terrier F, Cerretelli P. In vivo human gastrocnemius architecture with changing joint angle at rest and during graded isometric contraction. J Physiol. 1996;496(Pt 1):287–97. https://doi.org/10.1113/jphysiol.1996.sp021685.
English C, Fisher L, Thoirs K. Reliability of real-time ultrasound for measuring skeletal muscle size in human limbs in vivo: a systematic review. Clin Rehabil. 2012;26:934–44. https://doi.org/10.1177/0269215511434994.
Nijholt W, Scafoglieri A, Jager-Wittenaar H, Hobbelen JSM, van der Schans CP. The reliability and validity of ultrasound to quantify muscles in older adults: a systematic review. J Cachexia Sarcopenia Muscle. 2017;8:702–12. https://doi.org/10.1002/jcsm.12210.
Kwah LK, Pinto RZ, Diong J, Herbert RD. Reliability and validity of ultrasound measurements of muscle fascicle length and pennation in humans: a systematic review. J Appl Physiol. 2013;114:761–9. https://doi.org/10.1152/japplphysiol.01430.2011.
Hebert JJ, Koppenhaver SL, Parent EC, Fritz JM. A systematic review of the reliability of rehabilitative ultrasound imaging for the quantitative assessment of the abdominal and lumbar trunk muscles. Spine (Phila Pa 1976). 2009;34:E848–56. https://doi.org/10.1097/BRS.0b013e3181ae625c.
McMahon JJ, Turner A, Comfort P. Within- and between-session reliability of medial gastrocnemius architectural properties. Biol Sport. 2016;33:185–8. https://doi.org/10.5604/20831862.1200511.
Walker FO, Cartwright MS, Wiesler ER, Caress J. Ultrasound of nerve and muscle. Clin Neurophysiol. 2004;115:495–507. https://doi.org/10.1016/j.clinph.2003.10.022.
Stock MS, Thompson BJ. Echo intensity as an indicator of skeletal muscle quality: applications, methodology, and future directions. Eur J Appl Physiol. 2021;121:369–80. https://doi.org/10.1007/s00421-020-04556-6.
Visser M, Goodpaster BH, Kritchevsky SB, Newman AB, Nevitt M, Rubin SM, et al. Muscle mass, muscle strength, and muscle fat infiltration as predictors of incident mobility limitations in well-functioning older persons. J Gerontol Ser A Biol Sci Med Sci. 2005;60:324–33. https://doi.org/10.1093/gerona/60.3.324.
Janssen I, Heymsfield SB, Ross R. Low relative skeletal muscle mass (Sarcopenia) in older persons is associated with functional impairment and physical disability. J Am Geriatr Soc. 2002;50:889–96. https://doi.org/10.1046/j.1532-5415.2002.50216.x.
Visser M, Deeg DJH, Lips P, Harris TB, Bouter LM. Skeletal muscle mass and muscle strength in relation to lower-extremity performance in older men and women. J Am Geriatr Soc. 2000;48:381–6. https://doi.org/10.1111/j.1532-5415.2000.tb04694.x.
Cruz-Jentoft AJ, Pierre Baeyens J, Bauer JM, Boirie Y, Cederholm T, Landi F, et al. Sarcopenia: European consensus on definition and diagnosis Report of the European Working Group on Sarcopenia in Older People. Age Ageing. 2010;39:412–23. https://doi.org/10.1093/ageing/afq034.
Tosato M, Marzetti E, Cesari M, Savera G, Miller RR, Bernabei R, et al. Measurement of muscle mass in sarcopenia: from imaging to biochemical markers. Aging Clin Exp Res. 2017;29:19–27. https://doi.org/10.1007/s40520-016-0717-0.
Afschrift M, de Groote F, Jonkers I. Similar sensorimotor transformations control balance during standing and walking. PLoS Comput Biol. 2021;17:e1008369. https://doi.org/10.1371/journal.pcbi.1008369.
Papegaaij S, Baudry S, Négyesi J, Taube W, Hortobágyi T. Intracortical inhibition in the soleus muscle is reduced during the control of upright standing in both young and old adults. Eur J Appl Physiol. 2016;116:959–67. https://doi.org/10.1007/s00421-016-3354-6.
Waanders JB, Murgia A, Hortobágyi T, DeVita P, Franz JR. How age and surface inclination affect joint moment strategies to accelerate and decelerate individual leg joints during walking. J Biomech. 2020;98:109440. https://doi.org/10.1016/j.jbiomech.2019.109440.
DeVita P, Hortobagyi T. Age causes a redistribution of joint torques and powers during gait. J Appl Physiol. 2000;88:1804–11. https://doi.org/10.1152/jappl.2000.88.5.1804.
Abe T, Brechue WF, Fujita S, Brown JB. Gender differences in FFM accumulation and architectural characteristics of muscle. Med Sci Sports Exerc. 1998;30:1066–70. https://doi.org/10.1097/00005768-199807000-00007.
Kawakami Y, Abe T, Kanehisa H, Fukunaga T. Human skeletal muscle size and architecture: variability and interdependence. Am J Hum Biol. 2006;18:845–8. https://doi.org/10.1002/ajhb.20561.
Degens H, Korhonen MT. Factors contributing to the variability in muscle ageing. Maturitas. 2012;73:197–201. https://doi.org/10.1016/j.maturitas.2012.07.015.
Strobel K, Hodler J, Meyer DC, Pfirrmann CWA, Pirkl C, Zanetti M. Fatty atrophy of supraspinatus and infraspinatus muscles: accuracy of US. Radiology. 2005;237:584–9. https://doi.org/10.1148/radiol.2372041612.
de Boer MD, Seynnes OR, di Prampero PE, Pišot R, Mekjavić IB, Biolo G, et al. Effect of 5 weeks horizontal bed rest on human muscle thickness and architecture of weight bearing and non-weight bearing muscles. Eur J Appl Physiol. 2008;104:401–7. https://doi.org/10.1007/s00421-008-0703-0.
Weng L, Tirumalai AP, Lowery CM, Nock LF, Gustafson DE, Von Behren PL, et al. US extended-field-of-view imaging technology. Radiology. 1997;203:877–80. https://doi.org/10.1148/radiology.203.3.9169720.
Gans C, de Vree F. Functional bases of fiber length and angulation in muscle. J Morphol. 1987;192:63–85. https://doi.org/10.1002/jmor.1051920106.
Mairet S, Maïsetti O, Portero P. Homogeneity and reproducibility of in vivo fascicle length and pennation determined by ultrasonography in human vastus lateralis muscle. Sci Sport. 2006;21:268–72. https://doi.org/10.1016/j.scispo.2006.08.004.
Giannakou E, Aggeloussis N, Arampatzis A. Reproducibility of gastrocnemius medialis muscle architecture during treadmill running. J Electromyogr Kinesiol. 2011;21:1081–6. https://doi.org/10.1016/j.jelekin.2011.06.004.
Tracy BL, Enoka RM. Older adults are less steady during submaximal isometric contractions with the knee extensor muscles. J Appl Physiol. 2002;92:1004–12. https://doi.org/10.1152/japplphysiol.00954.2001.
Bazzucchi I, Felici F, Macaluso A, De Vito G. Differences between young and older women in maximal force, force fluctuations, and surface emg during isometric knee extension and elbow flexion. Muscle Nerve. 2004;30:626–35. https://doi.org/10.1002/mus.20151.
Abe T, Sakamaki M, Yasuda T, Bemben MG, Kondo M, Kawakami Y, et al. Age-related, site-specific muscle loss in 1507 Japanese men and women aged 20 to 95 years. J Sport Sci Med. 2011;10:145–50.
Maden-Wilkinson TM, Degens H, Jones DA, McPhee JS. Comparison of MRI and DXA to measure muscle size and age-related atrophy in thigh muscles. J Musculoskelet Neuronal Interact. 2013;13:320–8.
Narici MV, Maganaris CN. Plasticity of the muscle-tendon complex with disuse and aging. Rev. 2007;35:126–34. https://doi.org/10.1097/jes.0b013e3180a030ec.
Strasser EM, Draskovits T, Praschak M, Quittan M, Graf A. Association between ultrasound measurements of muscle thickness, pennation angle, echogenicity and skeletal muscle strength in the elderly. Age (Omaha). 2013;35:2377–88. https://doi.org/10.1007/s11357-013-9517-z.
Raj IS, Bird SR, Shield AJ. Reliability of ultrasonographic measurement of the architecture of the vastus lateralis and gastrocnemius medialis muscles in older adults. Clin Physiol Funct Imaging. 2012;32:65–70. https://doi.org/10.1111/j.1475-097X.2011.01056.x.
Noorkoiv M, Stavnsbo A, Aagaard P, Blazevich AJ. In vivo assessment of muscle fascicle length by extended field-of-view ultrasonography. J Appl Physiol. 2010;109:1974–9. https://doi.org/10.1152/japplphysiol.00657.2010.
dos Santos PCR, Hortobágyi T, Zijdewind I, Bucken Gobbi LT, Barbieri FA, Lamoth C. Minimal effects of age and prolonged physical and mental exercise on healthy adults’ gait. Gait Posture. 2019;74:205–11. https://doi.org/10.1016/j.gaitpost.2019.09.017.
Baroni BM, Geremia JM, Rodrigues R, De Azevedo Franke R, Karamanidis K, Vaz MA. Muscle architecture adaptations to knee extensor eccentric training: Rectus femoris vs. vastus lateralis. Muscle Nerve. 2013;48:498–506. https://doi.org/10.1002/mus.23785.
Palmer TB, Akehi K, Thiele RM, Smith DB, Thompson BJ. Reliability of panoramic ultrasound imaging in simultaneously examining muscle size and quality of the hamstring muscles in young, healthy males and females. Ultrasound Med Biol. 2015;41:675–84. https://doi.org/10.1016/j.ultrasmedbio.2014.10.011.
Koppenhaver SL, Parent EC, Teyhen DS, Hebert JJ, Fritz JM. The effect of averaging multiple trials on measurement error during ultrasound imaging of transversus abdominis and lumbar multifidus muscles in individuals with low back pain. J Orthop Sports Phys Ther. 2009;39:604–11. https://doi.org/10.2519/jospt.2009.3088.
Pillen S, Tak RO, Zwarts MJ, Lammens MMY, Verrijp KN, Arts IMP, et al. Skeletal muscle ultrasound: correlation between fibrous tissue and echo intensity. Ultrasound Med Biol. 2009;35:443–6. https://doi.org/10.1016/j.ultrasmedbio.2008.09.016.
Young HJ, Southern WM, Mccully KK. Comparisons of ultrasound-estimated intramuscular fat with fitness and health indicators. Muscle Nerve. 2016;54:743–9. https://doi.org/10.1002/mus.25105.
Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. 2016;15:155–63. https://doi.org/10.1016/j.jcm.2016.02.012.
Kowalik D, Choi YH, Zou GY. Confidence interval estimation for a difference between two dependent intraclass correlation coefficients with variable class sizes. J Stat Theory Pract. 2011;5:613–25. https://doi.org/10.1080/15598608.2011.10483734.
de Vet HCW, Terwee CB, Knol DL, Bouter LM. When to use agreement versus reliability measures. J Clin Epidemiol. 2006;59:1033–9. https://doi.org/10.1016/j.jclinepi.2005.10.015.
Lexell JE, Downham DY. How to assess the reliability of measurements in rehabilitation. Am J Phys Med Rehabil. 2005;84:719–23. https://doi.org/10.1097/01.phm.0000176452.17771.20.
Taylor R. Interpretation of the correlation coefficient: a basic review. J Diagn Med Sonogr. 1990;6:35–9. https://doi.org/10.1177/875647939000600106.
This work was supported by the French National Research Agency in the framework of the Investissements d’avenir program (ANR-10-AIRT-05 and ANR-15-IDEX-02). The sponsors had no involvement in the review and approval of the manuscript for publication. This work forms part of a broader interdisciplinary project, GaitAlps.
Ethics approval and consent to participate
The study protocol was approved by The Ethical Committee of the Center for Human Movement Sciences, University Medical Center Groningen (Protocol ID 201900228). All participants signed the informed consent document before taking part in the study. The study was performed in accordance with the Declaration of Helsinki (World Medical Association Declaration of Helsinki, 2013).
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Hagoort, I., Hortobágyi, T., Vuillerme, N. et al. Age- and muscle-specific reliability of muscle architecture measurements assessed by two-dimensional panoramic ultrasound. BioMed Eng OnLine 21, 15 (2022). https://doi.org/10.1186/s12938-021-00967-4
- Echo intensity
- Muscle architecture