- Open Access
Smartphone-based behavioral monitoring and patient-reported outcomes in adults with rheumatic and musculoskeletal disease
BMC Musculoskeletal Disorders volume 23, Article number: 566 (2022)
Rheumatic and musculoskeletal diseases (RMD) are associated with depression, fatigue, and disturbed sleep – symptoms that often impact behavior and activity. Patient reported outcomes (PROs) are a way of collecting information on the patient symptom experience directly from the individual. The purpose of this study was to measure and compare user smartphone sensor and activity data in adults with RMDs and assess associations with PROs.
We invited adults with RMDs enrolled in the FORWARD Databank to participate by installing a custom app on their smartphone and answering PROs (pain, global, HAQ-II) questions daily and weekly over 3 years. Passive data collected included mobility distance, unique calls and text messages, call durations, and number of missed calls. Confounders included sociodemographic, clinical, passive phone behavior, and seasonal factors. Kappa statistics between PRO and flares were computed to measure agreement. The agreement between daily and weekly VAS pain was estimated using the intraclass (ICC) correlation of a two-way random effect model. The relationship between the weekly PRO outcomes and the passive phone data was analyzed with a linear mixed-effect model (LMM), including a random intercept for participant and slope for time in the study with an unstructured covariate structure.
Of the 446 participants, the mean (SD) age was 54 (12) years, most (65.5%) had rheumatoid arthritis (RA), the vast majority (91%) were female, and the US Northeast has the least representation (12%). Longer reaction times, interaction diversity, and higher mobility were associated with worse PROs while longer text messages were associated with better PROs. Participants in this study showed good levels of adherence which holds promise for future interventions using passive behavior measures in self-management and clinical follow-up.
This is the first study to examine passive smartphone behavior with PROs in RMDs and we found significant associations between these behaviors and important health outcomes of pain and function. As smartphone usage continues to change, future studies should validate and expand on our findings with a goal of finding changes in patient symptoms passively through mobile device monitoring.
Mobile smartphones have become a standard tool for healthcare delivery and health self-monitoring . Smartphones are equipped with various sensors which can measure movement, activity, sleep, and geographical location. Sensor data combined with user data such as which applications are used, for how long, and in what capacity can give information about an individual’s activity, behavior, and personality [2, 3]. Combining this passively collected data (sensing data and user activity data) allows scientists to make assumptions about behavioral patterns .
Patient-reported outcomes (PROs) are traditionally considered health outcomes as reported by the individual patient . Whereas the clinician is limited to measurement of health variables within the healthcare setting, PROs are valuable because they measure the individual’s experience and capture a “real-time” ongoing clinical picture of their quality of life, symptomatology, and disease experience [5, 6]. PROs are essential for understanding the day-to-day experience of living with an illness and may improve patient communication, decision making, satisfaction, and confidence .
Using passively collected data from a smartphone to collect information about the individual in their day-to-day life expands upon the principle of PROs while also reducing the bias of self-report . Collecting PROs combined with passive smartphone data is a novel way of incorporating the individual symptom experience with objective behavioral data outside of the clinical setting. Additionally, combining PRO and passive data allows scientists to better draw generalizations about behavioral patterns and health status. While this data combination has been utilized to predict disease course in psychiatric populations, limited information is available on individuals with chronic disease .
Individuals with rheumatic and musculoskeletal diseases (RMD) like rheumatoid arthritis (RA) have daily fluctuations in associated symptoms, including pain, fatigue, physical disability, and mood disturbances . These symptoms are associated with alterations in activity. As symptoms increase in frequency or severity, physical activity levels and cognitive response times tend to decrease . When individuals are well, and symptoms subside, activity levels and cognitive processing times increase. These symptom and disease burden associated activity changes have the potential to be measured through smartphone sensing technology and have been correlated to rheumatic illness activity .
Smartphones collect passive data on individuals without requiring adherence to survey tool completion, which is a common PRO challenge. Barriers to PRO include the time required to complete the PRO tools, difficulty using electronic devices to complete PROs, and that an individual is less likely, or unable to complete PRO tools when they are unwell [12, 13]. Passive smartphone activity data collected throughout the day and over time has the potential to predict individual health changes like disease flare. Combining smartphone user activity data with PROs would provide more in-depth real-time data about the individual’s clinical picture.
Several studies exist that focus on wearable activity trackers and their relationship to RMD disease activity, however, there is limited literature available on how passive smartphone user data may correlate with RMD PROs. Available literature suggests that research on RA symptoms and smartphone passive data has been found to be feasible, with high levels of patient engagement and may predict RA disease activity . Additionally, sensor measures such as wrist motion which may quantify meaningful RMD specific symptoms have also been shown to be feasible . These findings demonstrate the potential to quantify meaningful RMD clinical information passively and remotely from an individual’s smartphone.
Using passive smartphone data combined with PROs may improve symptom monitoring and management and may have utility as a diagnostic and disease management tool in individuals with suspected rheumatic illness. The purpose of this study was to: 1. Measure and compare user phone activity via smartphone sensing activity in individuals with RMDs (RA and non-RA), and 2. Assess associations between passively collected behavioral data and participant’s PRO disease severity.
Six hundred and twenty-nine active participants of FORWARD, The National Databank for Rheumatic Diseases, were invited to participate in this study based on their prior response of owning a smartphone and expressing willingness to be invited. The FORWARD Databank is an ongoing United States (US)-based longitudinal observational study that enrolls participants from rheumatology clinics . Participants complete semi-annual, comprehensive questionnaires covering several aspects of their diseases, including demographics, clinical characteristics, disease severity, and treatments.
To be eligible to participate, individuals were required to: (1) be over 18 years old (2); own a smartphone with an iOS (iPhone) or Android operating system (3); be willing to download and use an app to respond to daily or weekly surveys and share passive measures on smartphone activity (4); have an RMD diagnosis including rheumatoid arthritis (RA), systemic lupus erythematosus (SLE), osteoarthritis (OA), fibromyalgia, and musculoskeletal chronic pain.
Two data sources were used in the study, including the custom created smartphone application (hereby called “the App”) data collected via smartphone (passive phone data and PROs) and the FORWARD questionnaires (enrolment and 6-month phase), which were linked using the most immediate prior FORWARD questionnaire from the day participants started using the app.
The app data
The PROs that were selected were chosen due to their common use in rheumatology populations in both research and clinical practice, and our experience using these specific measures in our research . No additional measures were selected to limit subject burden. The PROs that were collected included the daily Visual Analogue Scale (VAS) pain and physical function and weekly VAS Pain, VAS global patient assessment, and HAQ-II; these last three variables combined mean formed the Patient Activity Scale-II (PAS-II), a measure of RMD activity. Daily passive measures describing smartphone activity (collected in Android only) included the number of unreturned calls, average duration per call (minutes), the average length of SMS per message (characters), call count, SMS text message count, interaction diversity (number of unique phone numbers), missed interaction (including texts and calls) and aggregated communication (number of calls and SMS). Mobility as the approximate distance (in miles) covered while walking, biking, running, etc. was collected with the intent to capture physical activity; mobility radius (in miles), an approximate for overall traveling, measured the radius of the circle around the locations gathered for the participant that day. Additionally, the time-of-day participants completed their PROs on their phone (morning, afternoon, and evening/night), season (winter, spring, summer, and fall), working days versus weekends, and reaction time (in hours, the length of time taken by a participant to respond to a given questionnaire). Passively collected measures were limited to when participants kept their smartphone active and with them.
FORWARD variables included demographics, such as diagnosis (RA vs. non-RA), age, sex, race, educational years, employment status, marital status, total household income (US dollars thousands), insurance (Medicare vs. other), number of persons living in the household; clinical and severity variables such as Rheumatic Disease Comorbidity Index (RDCI), Health Assessment Questionnaire-II (HAQ-II), Patient Activity Scale-II (PAS-II), Rheumatoid Arthritis Disease Activity Index (RADAI), EuroQoL-5D (EQ-5D), SF-36 Physical Component Summary (PCS) and Mental Component Summary (MCS) scores, and Visual Analog Scales (VAS), which included pain, fatigue, and patient global assessment. Geographical area (rural versus urban) and US region defined by first zip code digit (Southeast, Northeast, Midwest, West) were also considered to possibly affect smartphone behavior and health outcomes.
This study was conducted in two phases that took place over 3 years. During Phase I (Sep 2013 to May 2014), 190 participants downloaded the app and agreed to respond to daily VAS pain and HAQ-II for 3 months and weekly to VAS pain, global assessment, and HAQ-II. Participants also agreed to share phone activity and sensing data, including the number of unique calls and texts, duration of calls, distance traveled, etc.
For Phase II (May 2013 to August 2015), 256 new subjects enrolled. Identical in data collection, this phase included an update on flare assessment and improved graphics. Phase I participants who were willing to continue were allowed to participate in the subsequent phase.
Baseline characteristics and phone/passive measures were compared by diagnosis using T-tests and Chi-square/Fisher’s test for continuous and categorical variables, respectively, i.e., when participants started using the App. Similar analyses were performed comparing phone operating systems as iOS and Android had different rules on what data from the smartphone would be made available to the App. Correct adherence to the App was computed daily and weekly over the study period to the point of discontinuation (i.e., no App use ≥ 1 month (implementation)). Kaplan Meier estimates were used to analyze time to discontinuation and Cox regression for predictors. Confounders included sociodemographic, clinical, passive phone behavior, and seasonal factors. Kappa statistics between PRO and flares were computed to measure agreement.
The agreement between daily and weekly VAS pain was estimated using the intraclass (ICC) correlation of a two-way random effect model, with the method as the first level (daily or weekly pain) and the participant as the second level. If the agreement between daily or weekly measures is high, the weekly measurement was preferred, imputing each weekly value in the corresponding seven prior days. The agreement analysis was limited to the first 3 months, where both weekly and daily measures were collected.
Passive mobile variables were log-transformed due to the skewness of the distributions and further transformed using moving averages of the previous 7 days (sensitivity analyses with different lags). The relationship between the four weekly PRO outcomes (pain, global assessment, HAQ-II, and PAS-II) and the passive phone data was analyzed with a linear mixed-effect model (MRM), including a random intercept for participant and slope for time in the study with an unstructured covariate structure. Random intercept models were also compared with random slope models using likelihood ratio tests.
Best models (for fixed effects) were obtained using backward selection with a 20% significance level at which variables could be removed from the models, estimated in the two samples: overall sample and for RA participants only. The saturated model started with the following list of covariates (as fixed effects) for the overall sample: diagnoses, sex, age, white ethnicity, total income, education level, number of persons living in the household, RD comorbidity index, education years, employment status, prior HAQ-II, prior pain, sleep scale, phase initiators, study phase, season, region and interaction between season and region, rural area, workdays, time of day, time in the study as a quadratic function and moving averages of passive data: mobility, mobility radius, average SMS length, average call duration, interaction diversity, missed interactions and log of reaction time. Disease-modifying antirheumatic drugs (DMARDs) or biologic treatment were also included for RA only. All analyses were performed using STATA/MP version 14.2, and all tests were two-sided, with a 5% significance level.
Sample and baseline characteristics
Of the 624 FORWARD participants with RMDs that were invited to participate, 541 participants accepted, and 446 (292 with RA) downloaded, installed, and had data collected on the study App (Fig. 1). Prior FORWARD questionnaires were collected 2 months (median, IQR 1–3 months) before the start of the study. By phase, the sample split was 190 in Phase I and 256 in Phase II, with 88 participating in both phases. Restricting the sample to Android users having all the passive variables collected, the final sample was composed of 167 participants, 107 with RA diagnosis, with a total of ~ 85,000 time points and 38,000 observations of passive data. The average time follow-up was 60 days (SD 37) for those getting the daily questions in the first 3 months. The average time follow-up for the overall study where weekly questions were asked was 27 weeks (SD 4w) (median 19 weeks).
At initiation, participants were primarily female (91.8%), white (89%), average age of 54 years old (SD 12) with 15 years of education and a mean HAQ-II of 0.78 (SD 0.58) (Table 1). Most participants lived in urban areas (85%) and in midwest-southeast regions (73%). Some differences in disease severity were found between diagnoses such as in PAS-II. Between phone operating systems, iPhone users tended to have better disease severity outcomes and higher total income.
Overall, 62% of participants reported daily measures in the App (Table 2). Daily and weekly correct implementation was 68 and 90%. The probability of remaining in the study at 6 months was 0.78 and 1 year, 0.64. No differences were found by diagnosis (P log ranks 0.92).
Younger age HR 0.98 (95% CI 0.97–0.99), pain 1.08 (1.02–1.15), HAQ-II 0.75 (0.60–0.94), Global 1.06 (1.00–1.13), PAS-2 1.13 (1.02–1.24), and summer vs other seasons 2.98 (1.94–4.56) were significant predictors of discontinuation using univariate cox regression.
On average, participants had four calls per day with a duration of 4 minutes per call, sent 17 text messages with 72 characters length, interacted with 4 to 5 distinct people and missed one call per day maximum (Table 2). Figure 2 presents an example of the passive phone data for one participant. Passive data variables were log-transformed due to the skewness of the distributions. No differences were found in phone behavior at baseline between RA and non-RA participants, except on some active PROs such as pain, global assessment and PAS-II. Non-RA participants had worse pain, global assessment and disease activity in comparison to RA participants (driven mainly by Fibromyalgia participants), but no differences were found on function or HAQ-II. Participants took 2 hours on average to respond to the question in the app. Most of the measures on the app were collected early in the morning, during the week (77%) and during the winter (48%).
Agreement between daily and week PROs
The agreement between daily and weekly pain was 83.3%. Based on this estimate, weekly measurements were used during the entire study, imputing the weekly PROs in the previous 6 days of the week.
The relationship between PROs and passive phone data
The relationship between passive phone data and the PROs outcomes are listed in Table 3 for the overall sample and Table 4 for RA only. Using the predicted values from the MRM models by increasing values of SMS length, interaction diversity, reaction time and mobility variables, keeping all the other variables at mean values. The best final models searched by backward selection are presented in Table 5.
This study included 446 individuals with RMDs participating in a study to measure passive sensing data and PROs using smartphones. With this information, we sought to compare user phone activity (time of use, number and duration of calls, number, and length of text messages) and smartphone sensing activity (mobility, distance travelled, geographic location) in individuals with RMDs (RA and non-RA), and assess associations between these passively collected behavioral data and participants PRO disease severity (PAS-II, pain, and HAQ-II).
Longer reaction times, interaction diversity, and higher mobility were associated with worse PRO disease severity. It is unsurprising that slower reaction times would be consistent with more pain and disability. Previous research on other chronic and disabling conditions has associated slower reaction time, as measured through smartphone sensing data, as predictive of disease severity  It is unclear why more mobility and miles traveled, and more interaction diversity would be associated with worse PRO disease severity. We would expect that individuals with more severe and disabling disease would be less mobile with less interaction, due to limited range of motion and pain. One possibility is that those with more symptoms may need to seek more care, and this would require interacting with more phone numbers and traveling greater distances. Alternatively, perhaps a certain level of mobility and interaction contributes to an overexerted state that contributes to increased RMD symptomatology. Associations between reaction time, interaction diversity and mobility with worse PRO were statistically significant, but it is unclear if the differences demonstrate clinical significance.
Longer length of SMS text message was associated with better PRO outcomes (less pain, better function, less disease activity). Similar results have been found in depressed individuals during symptom remission, indicating individuals may be better able to formulate ideas, concentrate, and have the motivation to compose and respond to text messages when feeing well . Although this finding was statistically significant, the magnitude of linear associations was modest.
Participants in this study showed good levels of adherence which holds promise for future interventions using passive behavior measures in self-management and clinical follow-up. Examining predictors of discontinuation of response to smartphone data collection is also important as additional reminders can encourage continued participation.
Limitations include the observational nature of the study and self-selection of the sample. Data collection was completed over 5 years ago, and thus, results were limited by the technology at the time. Participants were active participants in the longitudinal study FORWARD who owned smartphones, limiting the generalizability of these findings. At the time of study, the Android operating system was the only system where we could evaluate all passive variables, therefore limiting the sample further. The analyses also assumed patients were carrying their phone at all times (for mobility). The recruitment strategy included extending participation of individuals in Phase I which reduced the number of unique participants. Additionally, the novel nature of our study required new analysis techniques for both PROs and sensing data, which have not been previously tested.
A significant challenge with RMDs like RA is the unpredictability of symptom flares. While many studies have been conducted to predict the prodrome for rheumatic disease flares, each person has specific triggers and an individualized course of symptom flare and quiescence. The behavioral patterns associated with the RA prodrome and symptom experience have the potential to be measured and predicted with passively collected smartphone data. Smartphones are a feasible option to collect data about disease activity in individuals with both rheumatic diseases and other chronic health concerns. Correlating PROs with passive measures can strengthen our knowledge about both variables. Future work should focus on ways to passively collect PRO-type data that has clinical relevance in the individual’s treatment plan. Future research may find that smartphone passive data may be a less intrusive means to identify worsening disease burden in participants with rheumatic and other disease states. Additional studies on the behavior of individuals and their rheumatic symptomatology and the measurement of these variables using smartphone technology should confirm and expand these findings.
Availability of data and materials
The data that support the findings of this study are available upon reasonable request. The data are not publicly available due to their containing information that could compromise the privacy of research participants. For data requests contact Kaleb Michaud, email@example.com.
Sim I. Mobile devices and health. N Engl J Med. 2019;381(10):956–68.
International Telecommunication Union. Measuring the Information Society Report 2018. Volume 1. Geneva: ITU; 2018. Retrieved June 9, 2022. https://www.itu.int/en/ITU-D/Statistics/Documents/publications/misr2018/MISR-2018-Vol-1-E.pdf. ISBN: 978-92-61-27231-9.
Stachl C, Au Q, Schoedel R, Gosling SD, Harari GM, Buschek D, et al. Predicting personality from patterns of behavior collected with smartphones. Proc Natl Acad Sci. 2020;117(30):17680–7.
Insel TR. Digital phenotyping: technology for a new science of behavior. JAMA. 2017;318(13):1215–6.
Parrish RG. Measuring population health outcomes. Prev Chronic Dis. 2010;7(4):1–11
Dowling NM, Bolt DM, Deng S, Li C. Measurement and control of bias in patient reported outcomes using multidimensional item response theory. BMC Med Res Methodol. 2016;16(1):1–12.
Bartlett SJ, De Leon E, Orbai A-M, Haque UJ, Manno RL, Ruffing V, et al. Patient-reported outcomes in RA care improve patient communication, decision-making, satisfaction and confidence: qualitative results. Rheumatology. 2020;59(7):1662–70.
He-Yueya J, Buck B, Campbell A, Choudhury T, Kane JM, Ben-Zeev D, et al. Assessing the relationship between routine and schizophrenia symptoms with passively sensed measures of behavioral stability. NPJ Schizophr. 2020;6(1):1–8.
Taylor PC, Moore A, Vasilescu R, Alvir J, Tarallo M. A structured literature review of the burden of illness and unmet needs in patients with rheumatoid arthritis: a current perspective. Rheumatol Int. 2016;36(5):685–95.
Hernández-Hernández V, Ferraz-Amaro I, Díaz-González F. Influence of disease activity on the physical activity of rheumatoid arthritis patients. Rheumatology. 2014;53(4):722–31.
Crouthamel M, Quattrocchi E, Watts S, Wang S, Berry P, Garcia-Gancedo L, et al. Using a ResearchKit smartphone app to collect rheumatoid arthritis symptoms from real-world participants: feasibility study. JMIR mHealth and uHealth. 2018;6(9):e177.
Nguyen H, Butow P, Dhillon H, Sundaresan P. A review of the barriers to using patient-reported outcomes (PROs) and patient-reported outcome measures (PROMs) in routine cancer care. J Med Rad Sci. 2021;68(2):186–95.
Long C, Beres LK, Wu AW, Giladi AM. Patient-level barriers and facilitators to completion of patient-reported outcomes measures. Qual Life Res. 2022;31(6):1711–18.
Reade S, Spencer K, Sergeant JC, Sperrin M, Schultz DM, Ainsworth J, et al. Cloudy with a chance of pain: engagement and subsequent attrition of daily data entry in a smartphone pilot study tracking weather, disease severity, and physical activity in patients with rheumatoid arthritis. JMIR mHealth and uHealth. 2017;5(3):e6496.
Hamy V, Garcia-Gancedo L, Pollard A, Myatt A, Liu J, Howland A, et al. Developing smartphone-based objective assessments of physical function in rheumatoid arthritis patients: the PARADE study. Digit Biomark. 2020;4(1):26–44.
Wolfe F, Michaud K. The National Data Bank for rheumatic diseases: a multi-registry rheumatic disease data bank. Rheumatology. 2010;50(1):16–24.
Zhan A, Mohan S, Tarolli C, Schneider RB, Adams JL, Sharma S, et al. Using smartphones and machine learning to quantify Parkinson disease severity: the mobile Parkinson disease score. JAMA Neurol. 2018;75(7):876–80.
Moukaddam N, Truong A, Cao J, Shah A, Sabharwal A. Findings from a trial of the smartphone and OnLine usage-based eValuation for depression (SOLVD) application: what do apps really tell us about patients with depression? Concordance between app-generated data and standard psychiatric questionnaires for depression and anxiety. J Psychiatr Pract. 2019;25(5):365–73.
We would like to acknowledge the time and support contributed by Dr. Leanne Kaye, Naomi Kincler, Shishir Dash, Dr. Sai Moturu, and Dr. Karan Singh for creating the smartphone app, collecting data, and providing data to our team all during the high-demand early days of their healthcare company, Ginger. Additionally, we want to acknowledge the time and effort provided by Dr. Virginia Haynes who helped provide support of the data analysis from Eli Lilly and Company. Last, but not least, we want to thank all of the FORWARD participants who made this study possible.
There was no funding for the study while ongoing. Funding for the post-study analysis was received by Eli Lilly and Company.
Ethics approval and consent to participate
All methods were carried out in accordance with relevant guidelines and regulations, and this study was approved by Ascension Via Christi Hospitals Wichita, Inc. Institutional Review Board (IRB#: IRB00001674 FWA#: FWA00001005). Informed consent was obtained from all participants.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Mollard, E., Pedro, S., Schumacher, R. et al. Smartphone-based behavioral monitoring and patient-reported outcomes in adults with rheumatic and musculoskeletal disease. BMC Musculoskelet Disord 23, 566 (2022). https://doi.org/10.1186/s12891-022-05520-5
- Rheumatic and musculoskeletal disease
- Mobile health
- Patient reported outcomes
- Physical function