×

Econometrics pedagogy and cloud computing: training the next generation of economists and data scientists. (English) Zbl 1462.62763

Summary: This paper describes how cloud computing tools widely used in the instruction of data scientists can be introduced and taught to economics students as part of their curriculum. The demonstration centers around a workflow where the instructor creates a virtual server and the students only need Internet access and a web browser to complete in-class tutorials, assignments, or exams. Given how prevalent cloud computing platforms are becoming for data science, introducing these techniques into students’ econometrics training would prepare them to be more competitive when job hunting, while making instructors and administrators re-think what a computer laboratory means on campus.

MSC:

62R07 Statistical aspects of big data and data science
62P20 Applications of statistics to economics
62-08 Computational methods for problems pertaining to statistics
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Athey, S., and L. Michael. 2019. “Economists (and Economics) in Tech Companies.” Journal of Economic Perspectives 33(1): 209-30. URL https://www.aeaweb.org/articles?id=10.1257/jep.33.1.209.
[2] Bryan, J. 2018. “Excuse Me, Do You Have a Moment to Talk About Version Control?.” American Statistician 72(1): 20-7, doi:10.1080/00031305.2017.1399928. · Zbl 07663914 · doi:10.1080/00031305.2017.1399928
[3] Cicero, M. T. 44 BCE. “De Divinatione, II.(2).4.” In Loeb Classical Library (1923), English translation by W. A. Falconer, Cicero, Vol. 20, 375, as transcribed by Bill Thayer, https://penelope.uchicago.edu/Thayer/E/Roman/Texts/Cicero/de_Divinatione/2*.html#R2 (retrieved on September 23, 2020).
[4] Fiksel, J., L. R. Jager, J. S. Hardin, and M. A. Taub. 2019. “Using GitHub Classroom to Teach Statistics.” Journal of Statistics Education 27(2): 110-19, doi:10.1080/10691898.2019.1617089. · doi:10.1080/10691898.2019.1617089
[5] Hansen, B. 2020. Econometrics. Retrieve from https://www.ssc.wisc.edu/∼bhansen/econometrics/.
[6] Ho, A. T. Y., K. P. Huynh, D. T. Jacho-Chavez, and D. Rojas. forthcoming. “Data Science in Stata 16: Frames, Lasso, and Python Integration.” Journal of Statistical Software.
[7] Jupyter Project, B. Douglas, D. Bourgin, A. Brown, M. Bussonnier, J. Frederic, B. Granger, T. Griffiths, J. Hamrick, K. Kelley, M. Pacer, and L. Page. 2019. “nbgrader: A Tool for Creating and Grading Assignments in the Jupyter Notebook.” Journal of Open Source Education 2(11): 32, doi:10.21105/jose.00032. · doi:10.21105/jose.00032
[8] Kaplan, D. 2018. “Teaching Stats for Data Science.” American Statistician 72: 89-96, doi:10.1080/00031305.2017.1398107. · Zbl 07663923 · doi:10.1080/00031305.2017.1398107
[9] Koehler, F. J., and S. Kim. 2020. “Interactive Classrooms with Jupyter and Python.” The Mathematics Teacher 111: 304, doi:10.5951/mathteacher.111.4.0304. · doi:10.5951/mathteacher.111.4.0304
[10] Perkel, J. M. 2018. “Why Jupyter is Data Scientists’ Computational notebook of Choice.” Nature 563(7729): 145-6, doi:10.1038/d41586-018-07196-1. · doi:10.1038/d41586-018-07196-1
[11] Popescu, D. A., N. Zilberman, and A. W. Moore. 2017. Characterizing the Impact of Network Latency on Cloud-Based Applications’ performance. Tech. Rep. UCAM-CL-TR-914. University of Cambridge, Computer Laboratory. URL https://www.cl.cam.ac.uk/techreports/UCAM-CL-TR-914.pdf.
[12] Stackoverflow. 2019. Stack Overflow’s Annual Developer Survey. Online. URL https://insights.stackoverflow.com/survey/2019.
[13] Wikipedia. 2020a. Distributed Version Control. Wikipedia. URL https://en.wikipedia.org/wiki/Distributed_version_control.
[14] Wikipedia. 2020b. Graphical User Interface. Wikipedia. URL https://en.wikipedia.org/wiki/Graphical_user_interface.
[15] Wikipedia. 2020c. HTML. Wikipedia. URL https://en.wikipedia.org/wiki/HTML.
[16] Wikipedia. 2020d. Integrated Development Environment. Wikipedia. URL https://en.wikipedia.org/wiki/Integrated_development_environment.
[17] Wikipedia. 2020e. Jupyter Kernels. Wikipedia. URL https://en.wikipedia.org/wiki/Project_Jupyter#Jupyter_kernels.
[18] Wikipedia. 2020f. Live Coding. Wikipedia. URL https://en.wikipedia.org/wiki/Live_coding.
[19] Wikipedia. 2020g. Machine Learning. Wikipedia. URL https://en.wikipedia.org/wiki/Machine_learning.
[20] Wikipedia. 2020h. Markdown. Wikipedia. URL https://en.wikipedia.org/wiki/Markdown.
[21] Wikipedia. 2020i. Open Source. Wikipedia. URL https://en.wikipedia.org/wiki/Open_source.
[22] Wikipedia. 2020j. Programming Language. Wikipedia. URL https://en.wikipedia.org/wiki/Programming_language.
[23] Wikipedia. 2020k. Software Repository. Wikipedia. URL https://en.wikipedia.org/wiki/Software_repository.
[24] Wikipedia. 2020l. Virtual Machine. Wikipedia. URL https://en.wikipedia.org/wiki/Virtual_machine.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.