Large N, Small T, Multiple P: A Causal Matrix Completion Method for CRM Panel Data
dc.contributor.author | Jiang, Zhongming | |
dc.contributor.advisor | Feinberg, Fred | |
dc.date.accessioned | 2024-06-25T14:17:07Z | |
dc.date.available | 2024-06-25T14:17:07Z | |
dc.date.issued | 2024 | |
dc.identifier.uri | https://hdl.handle.net/2027.42/193938 | |
dc.description.abstract | The prototypical customer relationship management (CRM) panel structure is composed of many customers (large N), with short histories (small T), and multiple outcome metrics (multiple P). Our paper aims to tackle the challenges of causal inference that firms face in such CRM settings, which are additionally characterized by unobserved heterogeneity, time dynamics, and staggered adoption. Despite the success of synthetic control methods (SCM) in contemporary marketing applications, extant variants typically necessitate "small N, large T" data regimes to be performant - e.g., a handful of firm- or jurisdiction-level donor units, each with long time series. To extend to the "large N, small T, multiple P" setting, we bridge SCM to the broader causal matrix completion (MC) paradigm and leverage the "multiple P" ubiquitous to contemporary CRM: the presence of multiple outcomes enables a shared matrix singular value decomposition (cf. SCM's factorization), which helps jointly identify individual-level latent factors to establish conditional ignorability, compensating for overall short time series at the customer level. We employ a Bayesian causal inference approach, specifying a joint posterior of the nonrandom missingness of potential outcomes, together with the likelihood of the observed outcomes. We introduce two distinct variants of Bayesian causal MC models, each estimated independently through the implementation of the Gibbs sampling (independent multiple P's) and the Hamiltonian Monte Carlo (concurrent multiple P's) -based data augmentation procedure. We empirically illustrate our approach through a comprehensive customer-level database of gift card purchases and redemptions from a U.S. hospitality startup. We compare the effectiveness with extant SCM under the German reunification empirical study and devise a generalized framework for marketing and statistics researchers applicable to a wide range of CRM panel structures. | |
dc.subject | Customer-Base Analysis | |
dc.subject | Bayesian Causal Inference | |
dc.subject | Synthetic Control Method | |
dc.subject | Panel Data | |
dc.subject | Latent Factor Model | |
dc.title | Large N, Small T, Multiple P: A Causal Matrix Completion Method for CRM Panel Data | |
dc.type | Thesis | |
dc.description.thesisdegreename | Honors (Bachelor's) | |
dc.description.thesisdegreediscipline | Statistics | en_US |
dc.description.thesisdegreegrantor | University of Michigan | |
dc.subject.hlbsecondlevel | Statistics | |
dc.subject.hlbtoplevel | Science | |
dc.contributor.affiliationum | Statistics | |
dc.contributor.affiliationumcampus | Ann Arbor | |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/193938/1/jiangzm.pdf | |
dc.identifier.doi | https://dx.doi.org/10.7302/23420 | |
dc.working.doi | 10.7302/23420 | en |
dc.owningcollname | Honors Theses (Bachelor's) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe its collections in a way that respects the people and communities who create, use, and are represented in them. We encourage you to Contact Us anonymously if you encounter harmful or problematic language in catalog records or finding aids. More information about our policies and practices is available at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.