Our keynote speakers are Dr. Julia Lane (Professor, Wagner School of Public Policy at New York University; Center for Urban Science and Progress; Provostial Fellow) and Dr. Tom Smith (Managing Director, Data Science Campus of the Office for National Statistics, UK). The keynote is moderated by Dr. Craig Hill (Senior Vice President, Survey, Computing, and Statistical Sciences, RTI International, USA).
Dr. Julia Lane will give a keynote on “Automating Metadata Documentation: A New Initiative.” The effective usage of data depends critically on knowing how it has been produced and used before. The required elements are: what do the data measure, what research has been done by what researchers, with what code, and with what results. Acquiring that knowledge has historically been manual and generated by survey specialists. In the big data world, data dictionaries are sparse, incorrect, or non-existent. re is almost no documentation of the thousands of pre-processing decisions that are inherent in any analysis. Reproducibility has been almost impossible.
This presentation describes a new initiative to build and develop automated tools to create the equivalent of an Amazon.com or TripAdvisor for the access and use of microdata. The goal of the project is to automate the collection and codification of knowledge about sensitive data sets from publications and people.
Dr. Tom Smith will give a keynote on “Data Science for Public Good.” The increasing availability of data sources--such as consumer transaction data, unstructured text, and high-resolution images--provides great opportunities to strengthen our statistical understanding of the world. This presentation shows how national statistics institutes are using non-traditional data sources alongside survey data, building-up their big data and data science capabilities, and collaborating across sectors and disciplines.