SELECTED PUBLICATIONS
- High Dimensional Latent Panel Quantile Regression with an Application to Asset Pricing
- with Alexandre Belloni (Duke University, Fuqua School of Business), Oscar Hernan Madrid Padilla (UCLA, Department of Statistics & Data Science), and Kevin Wang (Harvard Business School)
Annals of Statistics, Volume 51, Issue 1, Feb 2023, Pages 96-121.
Abstract
We propose a generalization of the linear panel quantile regression model to accommodate both sparse and dense parts: sparse means while the number of covariates available is large, potentially only a much smaller number of them have a nonzero impact on each conditional quantile of the response variable; while the dense part is represented by a low-rank matrix that can be approximated by latent factors and their loadings. Such a structure poses problems for traditional sparse estimators, such as the \( \ell_1 \)-penalized Quantile Regression, and for traditional latent factor estimators, such as PCA. We propose a new estimation procedure, based on the ADMM algorithm, consisting of combining the quantile loss function with \( \ell_1 \) and nuclear norm regularization. We show, under general conditions, our estimator can consistently estimate both the nonzero coefficients of the covariates and the latent low-rank matrix.
Our proposed model has a "Characteristics + Latent Factor" Asset Pricing Model interpretation: we apply our model and estimator with a large-dimensional panel of financial data and find that (i) characteristics have sparser predictive power once latent factors were controlled, and (ii) the factors and coefficients at upper and lower quantiles are different from the median.
- Analysis of Networks via the Sparse β-Model
- with Kengo Kato (Cornell University, Department of Statistics and Data Science) and Chenlei Leng (Warwick)
Journal of the Royal Statistical Society, Series B, Volume 83, Issue 5, Nov 2021, Pages 887-910.
Abstract
Data in the form of networks are increasingly available in a variety of areas, yet statistical models allowing for parameter estimates with desirable statistical properties for sparse networks remain scarce. To address this, we propose the Sparse \( \beta \)-Model (S\( \beta \)M), a new network model that interpolates the celebrated Erdős–Rényi model and the \( \beta \)-model that assigns one different parameter to each node. By a novel reparameterization of the \( \beta \)-model to distinguish global and local parameters, our S\( \beta \)M can drastically reduce the dimensionality of the \( \beta \)-model by requiring some of the local parameters to be zero. We derive the asymptotic distribution of the maximum likelihood estimator of the S\( \beta \)M when the support of the parameter vector is known. When the support is unknown, we formulate a penalized likelihood approach with the \( \ell_0 \)-penalty. Remarkably, we show via a monotonicity lemma that the seemingly combinatorial computational problem due to the \( \ell_0 \)-penalty can be overcome by assigning non-zero parameters to those nodes with the largest degrees. We further show that a \( \beta\)-min condition guarantees our method to identify the true model and provide excess risk bounds for the estimated parameters. The estimation procedure enjoys good finite sample properties as shown by simulation studies. The usefulness of the S\(\beta \)M is further illustrated via the analysis of a microfinance take-up example.
- Nonlinear Factor Models for Network and Panel Data
- with Iván Fernández-Val (BU) and Martin Weidner (University of Oxford)
Journal of Econometrics, Volume 220, Issue 2, Feb 2021, Pages 296-324.
Abstract
Factor structures or interactive effects are convenient devices to incorporate latent variables in panel data models. We consider fixed effect estimation of nonlinear panel single-index models with factor structures in the unobservables, which include logit, probit, ordered probit and Poisson specifications. We establish that fixed effect estimators of model parameters and average partial effects have normal distributions when the two dimensions of the panel grow large, but might suffer of incidental parameter bias. We show how models with factor structures can also be applied to capture important features of network data such as reciprocity, degree heterogeneity, homophily in latent variables and clustering. We illustrate this applicability with an empirical example to the estimation of a gravity equation of international trade between countries using a Poisson model with multiple factors.
- A Comparative Study of Causal Reward Design
- with Xinlei Pan (UC Berkeley)
- Causal Machine Learning Workshop (ICML 2018)
- Spotlight presentation
- Counterfactual: An R Package for Counterfactual Analysis
- with Victor Chernozhukov (MIT), Iván Fernández-Val (BU), and Blaise Melly (University of Bern)
- The R Journal, Volume 9, Issue 1, June 2017, pp. 370-384
SUBMITTED/UNDER REVISION
- Quantile Graphical Models: Prediction and Conditional Independence with Applications to Systemic Risk
- with Alexandre Belloni (Duke) and Victor Chernozhukov (MIT)
Resubmitted to Journal of Econometrics, September 12, 2024
Abstract
We propose two types of Quantile Graphical Models: i) Conditional Independence Quantile Graphical Models (CIQGMs) characterize the conditional independence by evaluating the distributional dependence structure at each quantile index, as such, those can be used for validation of the graph structure in the causal graphical models; ii) Prediction Quantile Graphical Models (PQGMs) characterize the statistical dependencies through the graphs of the best linear predictors under asymmetric loss functions. PQGMs make weaker assumptions than CIQGMs as they allow for misspecification. One advantage of these models is that we can apply them to large collections of variables driven by non-Gaussian and non-separable shocks. Because of QGMs’ ability to handle large collections of variables and focus on specific parts of the distributions, we could apply them to quantify tail interdependence. The resulting tail risk network can be used for measuring systemic risk contributions that help make inroads in understanding international financial contagion and dependence structures of returns under downside market movements.
We develop estimation and inference methods focusing on the high-dimensional case, where the number of nodes in the graph is large as compared to the number of observations. For CIQGMs, these results include valid simultaneous choices of penalty functions, uniform rates of convergence, and confidence regions that are simultaneously valid. We also derive analogous results for PQGMs, which include new results for penalized quantile regressions in high-dimensional settings to handle misspecification, many controls, and a continuum of additional conditioning events.
- Deep Reinforcement Learning in a Monetary Model
with Andreas Joseph (Bank of England), Michael Kumhof (Bank of England), Xinlei Pan (UC Berkeley), Xuan Zhou (Reserve Bank of Australia)
Abstract
We study deep reinforcement learning (DRL) as a general approach to bounded rationality problems in dynamic stochastic general equilibrium models. Agents are represented by deep artificial neural networks and learn to maximize their intertemporal utility function by interacting with a model environment, of which they have no a priori knowledge. We apply our approach to a classical model from the adaptive learning literature, which looks at the interaction between monetary and fiscal policy. We find that, contrary to adaptive learning, the DRL household can solve the model, in the sense of learning the utility-maximizing steady state, in all policy regimes. However, learning may not be stable without the use of early stopping criteria. This has wider implications for the use of this class of models.
- An MM algorithm for Fixed Effects in Multinomial Models with an Application to Payment Choice in Grocery Stores
with Marc Rysman (BU), Shuang Wang (BU), Krzysztof Wozniak (Federal Reserve Board)
Abstract
Estimating multinomial models with many fixed effects faces prohibitive computational challenges. We develop a new method based on the Minorization-Maximization (MM) algorithm to address this issue. We provide a new proof of consistency of the MM algorithm. We apply our model to payment choice in grocery stores. Using rich transaction-level panel data of household purchases in a novel way, we estimate a multinomial logit discrete choice model with over 1 million fixed effects. We analyze switching in the short and long-run, focusing on determinants such as transaction size and the evolution of preferences.
- Estimation of Nonlinear Panel Models with Multiple Unobserved Effects
Reject &Resubmit, Review of Economics and Statistics
Abstract
I propose a fixed effects expectation-maximization (EM) estimator that can be applied to a class of nonlinear panel data models with unobserved heterogeneity, which is modeled as individual effects and/or time effects. Of particular interest is the case of interactive effects, ie when the unobserved heterogeneity is modeled as a factor analytical structure. The estimator is obtained through a computationally simple, iterative two-step procedure, where the two steps have closed form solutions. I show that estimator is consistent in large panels and derive the asymptotic distribution for the case of the probit with interactive effects. I develop analytical bias corrections to deal with the incidental parameter problem. Monte Carlo experiments demonstrate that the proposed estimator has good finite-sample properties. I illustrate the use of the proposed model and estimator with an application to international trade networks.
REFEREE EXPERIENCE
Journal of Econometrics (20), Journal of Business & Economic Statistics (6), Econometric Reviews, Econometric Theory (7), Statistica Sinica, Annals of Economics and Statistics, Economic Inquiry, Econometrics and Statistics, Biometrika, Journal of the American Statistical Association (4), Review of Economics and Statistics (4), Bank of England Staff Working Paper Series, Oxford Bulletin of Economics and Statistics, Annals of Statistics (2), Journal of Machine Learning Research, Journal of Applied Econometrics (2), Electronic Journal of Statistics, Review of Economic Studies (2), Journal of the Royal Statistical Society: Series B (2), Journal of Machine Learning Research, Quantitative Economics, Biometrics, Management Science