01531nas a2200133 4500008004100000245010800041210006900149260000900218520098900227653002301216100001501239700001901254856012401273 2022 eng d00aCombating False Information by Sharing the Truth: A Study on the Spread of Fact-checks on Social Media0 aCombating False Information by Sharing the Truth A Study on the c20223 aMisinformation on social media has become a horrendous problem in our society. Fact-checks on information often fall behind the diffusion of misinformation, which can lead to negative impacts on society. This research studies how different factors may affect the spread of fact-checks over the internet. We collected a dataset of fact-checks in a six-month period and analyzed how they spread on Twitter. The spread of fact-checks is measured by the total retweet count. The factors/variables include the truthfulness rating, topic of information, source credibility, etc. The research identifies truthfulness rating as a significant factor: conclusive fact-checks (either true or false) tend to be shared more than others. In addition, the source credibility, political leaning, and the sharing count also affect the spread of fact-checks. The findings of this research provide practical insights into accelerating the spread of the truth in the battle against misinformation online.10aBusiness Analytics1 aLi, Jiexun1 aChang, Xiaohui u/biblio/combating-false-information-sharing-truth-study-spread-fact-checks-social-media01524nas a2200133 4500008004100000245010400041210006900145260000900214520098900223653002301212100001501235700001901250856012101269 2022 eng d00aCombating Misinformation by Sharing the Truth: a Study on the Spread of Fact-Checks on Social Media0 aCombating Misinformation by Sharing the Truth a Study on the Spr c20223 aMisinformation on social media has become a horrendous problem in our society. Fact-checks on information often fall behind the diffusion of misinformation, which can lead to negative impacts on society. This research studies how different factors may affect the spread of fact-checks over the internet. We collected a dataset of fact-checks in a six-month period and analyzed how they spread on Twitter. The spread of fact-checks is measured by the total retweet count. The factors/variables include the truthfulness rating, topic of information, source credibility, etc. The research identifies truthfulness rating as a significant factor: conclusive fact-checks (either true or false) tend to be shared more than others. In addition, the source credibility, political leaning, and the sharing count also affect the spread of fact-checks. The findings of this research provide practical insights into accelerating the spread of the truth in the battle against misinformation online.10aBusiness Analytics1 aLi, Jiexun1 aChang, Xiaohui u/biblio/combating-misinformation-sharing-truth-study-spread-fact-checks-social-media01140nas a2200121 4500008004100000245010500041210006900146260000900215520069800224653002300922100001900945856005400964 2022 eng d00aImproving Student Engagement and Connection in Online Learning: Part II, Methodologies and Practices0 aImproving Student Engagement and Connection in Online Learning P c20223 aThe first article in the series appeared last December. Since then, we have received plenty of feedback from other instructors who are actively engaged in online education. Almost all of them agreed that teaching well online remains a challenging task. In this article, I discussed six specific practices that I have found particularly helpful for online teaching and learning.

Practice 1: Adopt a variety of communication methods
Practice 2: Create a Q&A Discussion Board
Practice 3: Estimate the amount of time taken for each assignment
Practice 4: Ensure timely replies
Practice 5: Synchronize assignments with the Canvas calendar
Practice 6: Reorganize course content10aBusiness Analytics1 aChang, Xiaohui uhttps://blogs.oregonstate.edu/inspire/2021/12/07/00850nas a2200121 4500008004100000245009300041210006900134260000900203520033100212653002300543100001900566856014300585 2021 eng d00aImproving Student Engagement and Connection in Online Learning through Proactive Support0 aImproving Student Engagement and Connection in Online Learning t c20213 aXiaohui Chang associate professor of Business Analytics, doesn't hold office hours. She holds "ask me anything hours" as a part of her methods to engage, connect and show empathy for her online students. Her first essay on the Ecampus teaching journey has great tips for all of our increased interactions in the virtual space.10aBusiness Analytics1 aChang, Xiaohui uhttps://blogs.oregonstate.edu/inspire/2021/12/07/improving-student-engagement-and-connection-in-online-learning-through-proactive-support/02320nas a2200157 4500008004100000245009900041210006900140260000900209300001400218490000700232520173700239653002301976100001501999700001902014856012902033 2020 eng d00aImproving Mobile Health Apps Usage: A Quantitative Study on mPower Data of Parkinson's Disease0 aImproving Mobile Health Apps Usage A Quantitative Study on mPowe c2020 a399–4200 v343 aPurpose
The emergence of mobile health (mHealth) products has created a capability of monitoring and managing the health of patients with chronic diseases. These mHealth technologies would not be beneficial unless they are adopted and used by their target users. This study identifies key factors affecting the usage of mHealth apps based on user usage data collected from an mHealth app.

Design/methodology/approach
Using a data set collected from an mHealth app named mPower, developed for patients with Parkinson’s disease (PD), this paper investigated the effects of disease diagnosis, disease progression, and mHealth app difficulty level on app usage, while controlling for user information. App usage is measured by five different activity counts of the app.

Findings
The results across five measures of mHealth app usage vary slightly. On average, previous professional diagnosis and high user performance scores encourage user participation and engagement, while disease progression hinders app usage.

Research limitations/implications
The findings potentially provide insights into better design and promotion of mHealth products and improve the capability of health management of patients with chronic diseases.

Originality/value
Studies on the mHealth app usage are critical but sparse because large-scale and reliable mHealth app usage data are limited. Unlike earlier works based solely on survey data, this research used a large user usage data collected from an mHealth app to study key factors affecting app usage. The methods presented in this study can serve as a pioneering work for the design and promotion of mHealth technologies.10aBusiness Analytics1 aLi, Jiexun1 aChang, Xiaohui u/biblio/improving-mobile-health-apps-usage-quantitative-study-mpower-data-parkinsons-disease00618nas a2200169 4500008004100000245009000041210006900131260000900200300001400209490000800223653002300231100001700254700001800271700001900289700001500308856012500323 2020 eng d00aModeling and Regionalization of China's PM2.5 Using Spatial-Functional Mixture Models0 aModeling and Regionalization of Chinas PM25 Using SpatialFunctio c2020 a116–1320 v11610aBusiness Analytics1 aLiang, Decai1 aZhang, Haozhe1 aChang, Xiaohui1 aHuang, Hui u/biblio/modeling-and-regionalization-chinas-pm25-using-spatial-functional-mixture-models01955nas a2200181 4500008004100000245008100041210006900122260000900191300000900200490000700209520140300216653002301619100001801642700002101660700001901681700001901700856005401719 2020 eng d00aNoise Accumulation in High Dimensional Classification and Total Signal Index0 aNoise Accumulation in High Dimensional Classification and Total c2020 a1-230 v213 aGreat attention has been paid to Big Data in recent years. Such data hold promise for scientific discoveries but also pose challenges to analyses. One potential challenge is noise accumulation. In this paper, we explore noise accumulation in high dimensional two-group classification. First, we revisit a previous assessment of noise accumulation with principal component analyses, which yields a different threshold for discriminative ability than originally identified. Then we extend our scope to its impact on classifiers developed with three common machine learning approaches—random forest, support vector machine, and boosted classification trees. We simulate four scenarios with differing amounts of signal strength to evaluate each method. After determining noise accumulation may affect the performance of these classifiers, we assess factors that impact it. We
conduct simulations by varying sample size, signal strength, signal strength proportional to the number predictors, and signal magnitude with random forest classifiers. These simulations suggest that noise accumulation affects the discriminative ability of high-dimensional classifiers developed using common machine learning methods, which can be modified by sample size, signal strength, and signal magnitude. We developed the measure total signal index (TSI) to track the trends of total signal and noise accumulation.10aBusiness Analytics1 aElman, Miriam1 aMinnier, Jessica1 aChang, Xiaohui1 aChoi, Dongseok uhttp://jmlr.org/papers/volume21/19-117/19-117.pdf01488nas a2200181 4500008004100000245011200041210006900153260000900222300001600231490000700247520082500254653002301079100001801102700001401120700001901134700001501153856013801168 2020 eng d00aRealized Volatility Forecasting and Volatility Spillovers: Evidence from Chinese Non-Ferrous Metals Futures0 aRealized Volatility Forecasting and Volatility Spillovers Eviden c2020 a2713–27310 v263 aWe study the prediction of realized volatility of non-ferrous metals futures traded on the Shanghai Futures Exchange from March 2011 to December 2017. A dynamic model averaging model is employed to combine multiple prediction models using time-varying weights based on individual model performance. Empirical results also reveal that models incorporating volatility spillovers across metals are important for forecast combinations, and short-term spillovers have a stronger impact than long-term spillovers. This approach offers the best forecasting performance and allows users to identify the most dominant model at any given time and demonstrate when and how volatility transmission from another metal is valuable for forecasting. We also find evidence of distinct trading behaviors in emerging and developed markets.10aBusiness Analytics1 aWang, Donghua1 aXin, Yang1 aChang, Xiaohui1 aSu, Xingze u/biblio/realized-volatility-forecasting-and-volatility-spillovers-evidence-chinese-non-ferrous-metals01160nas a2200157 4500008004100000245007000041210006900111260000900180300001200189490000800201520062900209653002300838100001900861700001500880856010700895 2019 eng d00aBusiness Performance Prediction in Location-based Social Commerce0 aBusiness Performance Prediction in Locationbased Social Commerce c2019 a112-1230 v1263 aSocial commerce and location-based services provide a data platform for coexisting and competing businesses in geographical neighborhoods. Our research is aimed at mining data from such platforms to gain valuable insights for better support to strategic and operational business decisions. We develop a computational framework for predicting business performance that takes into account both intrinsic (e.g., attributes) and extrinsic (e.g., competitions) factors. Our experiments on synthetic and real datasets demonstrated superiority of a hybrid prediction model that adopts both link-based and context-based assumptions.10aBusiness Analytics1 aChang, Xiaohui1 aLi, Jiexun u/biblio/business-performance-prediction-location-based-social-commerce00513nas a2200109 4500008004100000245009700041210006900138260002400207653002300231100001900254856013000273 2019 eng d00aLocation-based Data on Social Commerce Platforms can Provide Insights for Business Decisions0 aLocationbased Data on Social Commerce Platforms can Provide Insi aCorvallis, ORc201910aBusiness Analytics1 aChang, Xiaohui u/biblio/location-based-data-social-commerce-platforms-can-provide-insights-business-decisions01294nas a2200169 4500008004100000245007300041210006900114260000900183300001000192490000800202520073100210653002300941100001400964700001900978700001800997856010901015 2018 eng d00aFlexible and Efficient Estimating Equations for Variogram Estimation0 aFlexible and Efficient Estimating Equations for Variogram Estima c2018 a45-580 v1223 aVariogram estimation plays a vastly important role in spatial modeling. Different methods
for variogram estimation can be largely classified into least squares methods and likelihood
based methods. A general framework to estimate the variogram through a set of estimating
equations is proposed. This approach serves as an alternative approach to likelihood based
methods and includes commonly used least squares approaches as its special cases. The
proposed method is highly efficient as a low dimensional representation of the weight
matrix is employed. The statistical efficiency of various estimators is explored and the lag
effect is examined. An application to a hydrology data set is also presented.10aBusiness Analytics1 aSun, Ying1 aChang, Xiaohui1 aGuan, Yongtao u/biblio/flexible-and-efficient-estimating-equations-variogram-estimation00511nas a2200145 4500008004100000245007500041210006900116260000900185653002300194653001700217100002100234700001900255700001900274856007200293 2018 eng d00aUsing a Q Matrix to Assess Students' Latent Skills in an Online Course0 aUsing a Q Matrix to Assess Students Latent Skills in an Online C c201810aBusiness Analytics10aSupply Chain1 aHsieh, Ping-Hung1 aChang, Xiaohui1 aOlstad, Andrew uhttps://ecampus.oregonstate.edu/research/publications/white-papers/01377nas a2200181 4500008004100000245007600041210006900117260000900186300001600195490000700211520077900218653002300997100001801020700001701038700001901055700001601074856010501090 2017 eng d00aThe Lead-Lag Relationship between the Spot and Futures Markets in China0 aLeadLag Relationship between the Spot and Futures Markets in Chi c2017 a1447–14560 v173 aBased on daily and one-minute high-frequency returns, this paper examines the
lead-lag dependence between the CSI 300 index spot and futures markets from 2010 to 2014. The
nonparametric and nonlinear thermal optimal path method is adopted. Empirical results of the
daily data indicate that the lead-lag relationship between the two markets is within one day but
this relationship is volatile since neither of the two possible situations (the futures leads or lags
behind the spot market) takes a dominant place. Besides, our results from high-frequency data
demonstrate that there is a price discovery in the Chinese futures market: the intraday one-minute
futures return leads the cash return by 0~5 minutes regardless of the price trend of the market.10aBusiness Analytics1 aWang, Donghua1 aTu, Jingqing1 aChang, Xiaohui1 aLi, Saiping u/biblio/lead-lag-relationship-between-spot-and-futures-markets-china00558nas a2200145 4500008004100000245007700041210006900118260002200187653002300209653001700232100002100249700001900270700001900289856010400308 2016 eng d00aEarly Detection of Placement for Success in an Online Quantitative Class0 aEarly Detection of Placement for Success in an Online Quantitati aChicago, ILc201610aBusiness Analytics10aSupply Chain1 aHsieh, Ping-Hung1 aChang, Xiaohui1 aOlstad, Andrew u/biblio/early-detection-placement-success-online-quantitative-class01790nas a2200217 4500008004100000245011300041210006900154260000900223300001200232490000700244520105400251653002301305100001901328700002201347700001101369700001101380700002001391700001301411700001301424856013501437 2015 eng d00aDisease Risk Estimation by Combining Case-Control Data with Aggregated Information on the Population at Risk0 aDisease Risk Estimation by Combining CaseControl Data with Aggre c2015 a114-1210 v713 aWe propose a novel statistical framework by supplementing case-control data with summary statistics on the population at risk for a subset of risk factors. Our approach is to first form two unbiased estimating equations, one based on the case-control data and the other on both the case data and the summary statistics, and then optimally combine them to derive another estimating equation to be used for the estimation. The proposed method is computationally simple and more efficient than standard approaches based on case-control data alone. We also establish asymptotic properties of the resulting estimator, and investigate its finite-sample performance through simulation. As a substantive application, we apply the proposed method to investigate risk factors for endometrial cancer, by using data from a recently completed population-based case-control study and summary statistics from the Behavioral Risk Factor Surveillance System, the Population Estimates Program of the US Census Bureau, and the Connecticut Department of Transportation.10aBusiness Analytics1 aChang, Xiaohui1 aWaagepetersen, R.1 aYu, H.1 aMa, X.1 aHolford, T., R.1 aWang, R.1 aGuan, Y. u/biblio/disease-risk-estimation-combining-case-control-data-aggregated-information-population-risk02960nas a2200181 4500008004100000245015600041210006900197260000900266300001200275490000600287520226000293653002302553100001802576710001802594700001302612700001902625856013402644 2015 eng d00aDynamic relation of Chinese stock price-volume pre- and post- the Split Share Structure Reform: New evidence from a two-state Markov-switching approach0 aDynamic relation of Chinese stock pricevolume pre and post the S c2015 a386-4010 v53 aPurpose – The purpose of this paper is to identify the bull and bear regimes in Chinese stock market and empirically analyze the dynamic relation of Chinese stock price-volume pre- and post- the Split Share Structure Reform.

Design/methodology/approach – The authors investigate the price-volume relationship in the Chinese stock market before and after the Split Share Structure Reform using Shanghai Composite Index daily data from July 1994 to April 2013. Using a two-state Markov-switching autoregressive model and a modified two-state Markov-switching vector autoregression model, this study identifies bull or bear market and also examine the existence of regime-dependent Granger causality.

Findings – Using a two-state Markov-switching autoregressive model, the authors detect structural changes in the market volatility due to the reform, and find evidence of a positive rather than an asymmetric price-volume contemporaneous correlation. There is a strong dynamic Granger causal relation from stock returns to trading volume before and after the reform regardless of the market conditions, but the causal effects of volume on returns are only seen in the bear markets before the reform. The model is robust when using different stock indices and time periods.

Originality/value – The work is different from previous studies in the following aspects: most of the existing empirical literature focus on the well-developed economies, but our interest lies in the emerging Chinese market that has witnessed rapid growth in the past decade; in contrast to many works in the literature that examine the price-volume relationship during one market condition, the authors compare the relationship in a bull market with that in a bear market, using a two-state MS-AR model; the authors also employ a modified two-state Markov-switching vector autoregression model to examine the existence of regime-dependent Granger causality; as the most massive systematic reform for the Chinese stock market since its inception in 2005, the Split Share Structure Reform has a profound impact on the Chinese stock market, thus it is of vital importance to explore its effects on both the price-volume relationship and the market structure.10aBusiness Analytics1 aWang, Donghua1 aEmptyAuthNode1 aLei, Man1 aChang, Xiaohui u/biblio/dynamic-relation-chinese-stock-price-volume-pre-and-post-split-share-structure-reform-new01615nas a2200157 4500008004100000245008100041210006900122260000900191300001200200490000600212520105900218653002301277100001901300700002301319856011501342 2014 eng d00aWavelet Methods in Interpolation of High-Frequency Spatial-Temporal Pressure0 aWavelet Methods in Interpolation of HighFrequency SpatialTempora c2014 a52–680 v83 aThe location-scale and whitening properties of wavelets make them more favorable for interpolating high-frequency monitoring data than Fourier-based methods. In the past, wavelets have been used to simplify the dependence structure in multiple time or spatial series, but little has been done to apply wavelets as a modeling tool in a space–time setting, or, in particular, to take advantage of the localization of wavelets to capture the local dynamic characteristics of high-frequency meteorological data. This paper analyzes minute-by-minute atmospheric pressure data from the Atmospheric Radiation Measurement program using different wavelet coefficient structures at different scales and incorporating spatial structure into the model. This approach of modeling space–time processes using wavelets produces accurate point predictions with low uncertainty estimates, and also enables interpolation of available data from sparse monitoring stations to a high density grid and production of meteorological maps on large spatial and temporal scales.10aBusiness Analytics1 aChang, Xiaohui1 aStein, Michael, L. u/biblio/wavelet-methods-interpolation-high-frequency-spatial-temporal-pressure01375nas a2200157 4500008004100000245008800041210006900129260000900198300001400207490000700221520079900228653002301027100001901050700002301069856012501092 2013 eng d00aDecorrelation Property of Discrete Wavelet Transform Under Fixed-Domain Asymptotics0 aDecorrelation Property of Discrete Wavelet Transform Under Fixed c2013 a8001-80130 v593 aTheoretical aspects of the decorrelation property of the discrete wavelet transform when applied to stochastic processes have been studied exclusively from the increasing-domain perspective, in which the distance between neighboring observations stays roughly constant as the number of observations increases. To understand the underlying data-generating process and to obtain good interpolations, fixed-domain asymptotics, in which the number of observations increases in a fixed region, is often more appropriate than increasing-domain asymptotics. In the fixed-domain setting, we prove that, for a general class of inhomogeneous covariance functions, with suitable choice of wavelet filters, the wavelet transform of a nonstationary process has mostly asymptotically uncorrelated components.10aBusiness Analytics1 aChang, Xiaohui1 aStein, Michael, L. u/biblio/decorrelation-property-discrete-wavelet-transform-under-fixed-domain-asymptotics02113nas a2200157 4500008004000000245006700040210006700107260001800174520157000192653002301762100001901785700001601804700001501820700001701835856010301852 0 engd00aAdditive Dynamic Models for Correcting Numerical Model Outputs0 aAdditive Dynamic Models for Correcting Numerical Model Outputs c2023 In Press3 a
Numerical air quality models are pivotal for the prediction and assessment of air pollution, but numerical model outputs may be systematically biased. An additive dynamic model is proposed to correct large-scale raw model outputs using data from other sources, including readings collected at ground monitoring networks and weather outputs from other numerical models. An additive partially linear model specification is employed for the nonlinear relationships between air pollutants and covariates. In addition, a multi-resolution basis function approximate is proposed to capture the different small-scale variations of biases, and a discretized stochastic
integro-differential equation is constructed to characterize the dynamic evolution of the random coefficients at each spatial resolution. An expectation-maximization algorithm is developed for parameter estimation and a multi-resolution ensemble-based scheme is embedded to accelerate the computation. For statistical inference, a conditional simulation technique is applied to quantify the uncertainty of parameter estimates and bias correction results. The proposed approach is used to correct the biased raw outputs of PM2.5 from the Community Multiscale Air
Quality (CMAQ) system for China’s Beijing-Tianjin-Hebei region. Our method improves the root mean squared error and continuous rank probability score by 43.70% and 34.76%, respectively. Compared to other statistical methods under different metrics, our model has advantages in both correction accuracy and computational efficiency.10aBusiness Analytics1 aChang, Xiaohui1 aChen, Yewen1 aHuang, Hui1 aLuo, Fangzhi u/biblio/additive-dynamic-models-correcting-numerical-model-outputs