A Novel Grouped-Gram-Based Algorithm for Fast and Memory-Efficient Fixed Effects Estimation
Keywords:
- Fixed effects,
- Panel data,
- Grouped Gram matrix,
- Within transformation,
- Computational econometrics,
- Memory-efficient algorithms,
- Estimation in high-dimensional models
Abstract
Fixed effects models often rely on the within transformation, which constructs demeaned arrays prior to forming cross-products. This paper develops an estimator that avoids the formation of demeaned arrays by exploiting grouped summaries built from per-unit sufficient statistics. A complete derivation shows that the grouped Gram representation reproduces the classical estimator exactly. The difference lies in memory access patterns and byte movement. The grouped estimator concentrates operations into unit-level accumulations, avoiding the writes associated with array centering. Gains arise once the panel reaches a scale where memory traffic governs run time. Simulations examine coefficient accuracy, bootstrap dispersion, run time and memory use.
Downloads
References
Arellano M (2003) Panel data econometrics. Oxford Uni Press. https://books.google.co.in/books?hl=en&lr=&id=OQ1REAAAQBAJ&oi=fnd&pg=PR7&dq=1.%09Arellano,+M.+(2003).+Panel+Data+Econometrics.+Oxford+University+Press&ots=HBgCUj-Otc&sig=hxZ5vya99Nm3HYbkP2KrCYetrwo&redir_esc=y#v=onepage&q&f=false
Wooldridge JM (2010) Econometric analysis of cross section and panel data. MIT Press. https://books.google.co.in/books?hl=en&lr=&id=hSs3AgAAQBAJ&oi=fnd&pg=PP1&dq=19.%09Wooldridge,+J.+M.+(2010).+Econometric+Analysis+of+Cross+Section+and+Panel+Data.+MIT+Press&ots=VZRVoA_XTn&sig=EfXcYjj2g2THwuvQxZ_PUzqoTmA&redir_esc=y#v=onepage&q=19.%09Wooldridge%2C%20J.%20M.%20(2010).%20Econometric%20Analysis%20of%20Cross%20Section%20and%20Panel%20Data.%20MIT%20Press&f=false
Hausman JA (1978) Specification tests in econometrics. Econ Soc 46: 1251-1271. https://www.jstor.org/stable/1913827
Hsiao C (2014) Analysis of panel data. Cambridge Uni Press. https://www.cambridge.org/core/books/analysis-of-panel-data/A774C63FF969DA1944A3F91501702C65
Golub GH, Van Loan CF (2013) Matrix computations 4th edition. Johns Hopkins Uni Press. https://math.ecnu.edu.cn/~jypan/Teaching/books/2013 Matrix Computations 4th.pdf
Gentle JE (2007) Matrix algebra: Theory, computations and applications in statistics. Springer. https://link.springer.com/chapter/10.1007/978-0-387-70873-7_3
Chan TF, Golub GH, LeVeque RJ (1982) Updating formulae and a pairwise algorithm for computing sample variances. Physica, Heidelberg. https://link.springer.com/chapter/10.1007/978-3-642-51461-6_3
Graham SL, Snir M, Patterson CA (2005) Getting up to speed: The future of supercomputing. Natl Res Counc Rep. https://books.google.co.in/books?hl=en&lr=&id=oadVAgAAQBAJ&oi=fnd&pg=PA1&ots=pATGl8b2vG&sig=uQNCozh8-e2cOHDKn7Ib-LDLgWk&redir_esc=y#v=onepage&q&f=false
Williams SW, Waterman A, Patterson DA (2009) Roofline: An insightful visual performance model for floating-point programs and multicore architectures. Comm ACM 52: 65-76. https://dl.acm.org/doi/10.1145/1498765.1498785
Demmel J, Grigori L, Hoemmen M, Langou J (2007) Communication-avoiding linear algebra. SIAM Rev 53: 1-20.
Hong JW, Kung HT (1981) I/O complexity: The red-blue pebble game. 326-333. https://dl.acm.org/doi/abs/10.1145/800076.802486
Cameron AC, Gelbach JB, Miller DL (2008) Bootstrap-based improvements for inference with clustered errors. Rev Econ Stat 90: 414-427. https://direct.mit.edu/rest/article-abstract/90/3/414/57731/Bootstrap-Based-Improvements-for-Inference-with?redirectedFrom=fulltext
Davison AC, Hinkley DV (1997) Bootstrap methods and their application. Cambridge Uni Press. https://books.google.co.in/books?hl=en&lr=&id=4aCDbm_t8jUC&oi=fnd&pg=PR7&dq=4.%09Davison,+A.+C.,+%26+Hinkley,+D.+V.+(1997).+Bootstrap+Methods+and+Their+Application.+Cambridge+University+Press&ots=m_32iaV118&sig=Q_U_rI1-cSinyGonpx9erAl_bEk&redir_esc=y#v=onepage&q=4.%09Davison%2C%20A.%20C.%2C%20%26%20Hinkley%2C%20D.%20V.%20(1997).%20Bootstrap%20Methods%20and%20Their%20Application.%20Cambridge%20University%20Press&f=false
McCalpin J (2006) STREAM: Sustainable memory bandwidth in high performance computers. https://cir.nii.ac.jp/crid/1572824500380421760

