All resources in Educators

Introduction to the Command Line for Economics

(View Complete Item Description)

Command line interface (OS shell) and graphic user interface (GUI) are different ways of interacting with a computer’s operating system. The shell is a program that presents a command line interface which allows you to control your computer using commands entered with a keyboard instead of controlling graphical user interfaces (GUIs) with a mouse/keyboard combination. There are quite a few reasons to start learning about the shell: The shell gives you power. The command line gives you the power to do your work more efficiently and more quickly. When you need to do things tens to hundreds of times, knowing how to use the shell is transformative. To use remote computers or cloud computing, you need to use the shell.

Material Type: Module

Authors: Andras Vereckei, Arieda Muço, Miklós Koren

Programming with R

(View Complete Item Description)

The best way to learn how to program is to do something useful, so this introduction to R is built around a common scientific task: data analysis. Our real goal isn’t to teach you R, but to teach you the basic concepts that all programming depends on. We use R in our lessons because: we have to use something for examples; it’s free, well-documented, and runs almost everywhere; it has a large (and growing) user base among scientists; and it has a large library of external packages available for performing diverse tasks. But the two most important things are to use whatever language your colleagues are using, so you can share your work with them easily, and to use that language well. We are studying inflammation in patients who have been given a new treatment for arthritis, and need to analyze the first dozen data sets of their daily inflammation. The data sets are stored in CSV format (comma-separated values): each row holds information for a single patient, and the columns represent successive days. The first few rows of our first file look like this: 0,0,1,3,1,2,4,7,8,3,3,3,10,5,7,4,7,7,12,18,6,13,11,11,7,7,4,6,8,8,4,4,5,7,3,4,2,3,0,0 0,1,2,1,2,1,3,2,2,6,10,11,5,9,4,4,7,16,8,6,18,4,12,5,12,7,11,5,11,3,3,5,4,4,5,5,1,1,0,1 0,1,1,3,3,2,6,2,5,9,5,7,4,5,4,15,5,11,9,10,19,14,12,17,7,12,11,7,4,2,10,5,4,2,2,3,2,2,1,1 0,0,2,0,4,2,2,1,6,7,10,7,9,13,8,8,15,10,10,7,17,4,4,7,6,15,6,4,9,11,3,5,6,3,3,4,2,3,2,1 0,1,1,3,3,1,3,5,2,4,4,7,6,5,3,10,8,10,6,17,9,14,9,7,13,9,12,6,7,7,9,6,3,2,2,4,2,0,1,1 We want to: load that data into memory, calculate the average inflammation per day across all patients, and plot the result. To do all that, we’ll have to learn a little bit about programming.

Material Type: Module

Authors: Diya Das, Katrin Leinweber, Rohit Goswami

Programming with MATLAB

(View Complete Item Description)

The best way to learn how to program is to do something useful, so this introduction to MATLAB is built around a common scientific task: data analysis. Our real goal isn’t to teach you MATLAB, but to teach you the basic concepts that all programming depends on. We use MATLAB in our lessons because: we have to use something for examples; it’s well-documented; it has a large (and growing) user base among scientists in academia and industry; and it has a large library of packages available for performing diverse tasks. But the two most important things are to use whatever language your colleagues are using, so that you can share your work with them easily, and to use that language well.

Material Type: Module

Author: Gerard Capes

Python for Humanities

(View Complete Item Description)

Python is a general purpose programming language that is useful for writing scripts to work effectively and reproducibly with data. This is an introduction to Python designed for participants with no programming experience. These lessons can be taught in a day (~ 6 hours). They start with some basic information about Python syntax, the Jupyter notebook interface, and move through how to import CSV files, using the pandas package to work with data frames, how to calculate summary information from a data frame, and a brief introduction to plotting. The last lesson demonstrates how to work with databases directly from Python.

Material Type: Module

Author: Iain Emsley

Introduction to web scraping

(View Complete Item Description)

Web scraping is the process of extracting data from websites. Some data that is available on the web is presented in a format that makes it easier to collect and use it, for example in the form of downloadable comma-separated values (CSV) datasets that can then be imported in a spreadsheet or loaded into a data analysis script. Often however, even though it is publicly available, data is not readily available for reuse. For example it can be contained in a PDF, or a table on a website, or spread across multiple web pages. There are a variety of ways to scrape a website to extract information for reuse. In its simplest form, this can be achieved by copying and pasting snippets from a web page, but this can be unpractical if there is a large amount of data to be extracted, or if it spread over a large number of pages. Instead, specialized tools and techniques can be used to automate this process, by defining what sites to visit, what information to look for, and whether data extraction should stop once the end of a page has been reached, or whether to follow hyperlinks and repeat the process recursively. Automating web scraping also allows to define whether the process should be run at regular intervals and capture changes in the data.

Material Type: Module

Authors: Belinda Weaver, Joshua Dull, Thomas Guignard

OSF in the Classroom

(View Complete Item Description)

This webinar will introduce how to use the Open Science Framework (OSF; https://osf.io) in a Classroom. The OSF is a free, open source web application built to help researchers manage their workflows. The OSF is part collaboration tool, part version control software, and part data archive. The OSF connects to popular tools researchers already use, like Dropbox, Box, Github and Mendeley, to streamline workflows and increase efficiency. This webinar will discuss how to introduce reproducible research practices to students, show ways of tracking student activity, and introduce the use of Templates and Forks on the OSF to allow students to easily make new class projects. The OSF is the flagship product of the Center for Open Science, a non-profit technology start-up dedicated to improving the alignment between scientific values and scientific practices. Learn more at cos.io and osf.io, or email contact@cos.io.

Material Type: Lecture

Author: Center for Open Science

Foster Open Science

(View Complete Item Description)

The FOSTER portal is an e-learning platform that brings together the best training resources addressed to those who need to know more about Open Science, or need to develop strategies and skills for implementing Open Science practices in their daily workflows. Here you will find a growing collection of training materials. Many different users - from early-career researchers, to data managers, librarians, research administrators, and graduate schools - can benefit from the portal. In order to meet their needs, the existing materials will be extended from basic to more advanced-level resources. In addition, discipline-specific resources will be created.

Material Type: Full Course

Author: FOSTER Open Science

Tools for Reproducible Research

(View Complete Item Description)

Course summary A minimal standard for data analysis and other scientific computations is that they be reproducible: that the code and data are assembled in a way so that another group can re-create all of the results (e.g., the figures in a paper). The importance of such reproducibility is now widely recognized, but it is still not so widely practiced as it should be, in large part because many computational scientists (and particularly statisticians) have not fully adopted the required tools for reproducible research. In this course, we will discuss general principles for reproducible research but will focus primarily on the use of relevant tools (particularly make, git, and knitr), with the goal that the students leave the course ready and willing to ensure that all aspects of their computational research (software, data analyses, papers, presentations, posters) are reproducible.

Material Type: Full Course

Author: Karl Broman

Reproducible Research Methods

(View Complete Item Description)

This is the website for the Autumn 2014 course “Reproducible Research Methods” taught by Eric C. Anderson at NOAA’s Southwest Fisheries Science Center. The course meets on Tuesdays and Thursdays from 3:30 to 4:30 PM in Room 188 of the Fisheries Ecology Division. It runs from Oct 7 to December 18. The goal of this course is for scientists, researchers, and students to learn: to write programs in the R language to manipulate and analyze data, to integrate data analysis with report generation and article preparation using knitr, to work fluently within the Rstudio integrated development environment for R, to use git version control software and GitHub to effectively manage source code, collaborate efficiently with other researchers, and neatly package their research.

Material Type: Full Course

Author: Eric C. Anderson

Reproducible Science Curriculum Lesson for Publication

(View Complete Item Description)

Workshop goals - Why are we teaching this - Why is this important - For future and current you - For research as a whole - Lack of reproducibility in research is a real problem Materials and how we'll use them - Workshop landing page, with - links to the Materials - schedule Structure oriented along the Four Facets of Reproducibility: - Documentation - Organization - Automation - Dissemination Will be available after the Workshop How this workshop is run - This is a Carpentries Workshop - that means friendly learning environment - Code of Conduct - active learning - work with the people next to you - ask for help

Material Type: Module

Authors: Dave Clements, Hilmar Lapp, Karen Cranston

Reproducible Science Curriculum Lesson for Automation

(View Complete Item Description)

Workshop goals - Why are we teaching this - Why is this important - For future and current you - For research as a whole - Lack of reproducibility in research is a real problem Materials and how we'll use them - Workshop landing page, with - links to the Materials - schedule Structure oriented along the Four Facets of Reproducibility: - Documentation - Organization - Automation - Dissemination Will be available after the Workshop How this workshop is run - This is a Carpentries Workshop - that means friendly learning environment - Code of Conduct - active learning - work with the people next to you - ask for help

Material Type: Module

Authors: François Michonneau, Kim Gilbert, Matt Pennell

Reproducible Science Curriculum Lesson for Version Control

(View Complete Item Description)

Workshop goals - Why are we teaching this - Why is this important - For future and current you - For research as a whole - Lack of reproducibility in research is a real problem Materials and how we'll use them - Workshop landing page, with - links to the Materials - schedule Structure oriented along the Four Facets of Reproducibility: - Documentation - Organization - Automation - Dissemination Will be available after the Workshop How this workshop is run - This is a Carpentries Workshop - that means friendly learning environment - Code of Conduct - active learning - work with the people next to you - ask for help

Material Type: Module

Authors: Ciera Martinez, Hilmar Lapp, Karen Cranston

Reproducible Science Curriculum Lesson for Literate Programming

(View Complete Item Description)

Workshop goals - Why are we teaching this - Why is this important - For future and current you - For research as a whole - Lack of reproducibility in research is a real problem Materials and how we'll use them - Workshop landing page, with - links to the Materials - schedule Structure oriented along the Four Facets of Reproducibility: - Documentation - Organization - Automation - Dissemination Will be available after the Workshop How this workshop is run - This is a Carpentries Workshop - that means friendly learning environment - Code of Conduct - active learning - work with the people next to you - ask for help

Material Type: Module

Authors: Ciera Martinez, Courtney Soderberg, Hilmar Lapp, Jennifer Bryan, Kristina Riemer, Naupaka Zimmerman

Reproducible Science Curriculum Lesson for Organization

(View Complete Item Description)

Workshop goals - Why are we teaching this - Why is this important - For future and current you - For research as a whole - Lack of reproducibility in research is a real problem Materials and how we'll use them - Workshop landing page, with - links to the Materials - schedule Structure oriented along the Four Facets of Reproducibility: - Documentation - Organization - Automation - Dissemination Will be available after the Workshop How this workshop is run - This is a Carpentries Workshop - that means friendly learning environment - Code of Conduct - active learning - work with the people next to you - ask for help

Material Type: Module

Authors: Ciera Martinez, Courtney Soderberg, Hilmar Lapp, Jennifer Bryan, Kristina Riemer, Naupaka Zimmerman

Introduction materials for Reproducible Research Curriculum

(View Complete Item Description)

Workshop goals - Why are we teaching this - Why is this important - For future and current you - For research as a whole - Lack of reproducibility in research is a real problem Materials and how we'll use them - Workshop landing page, with - links to the Materials - schedule Structure oriented along the Four Facets of Reproducibility: - Documentation - Organization - Automation - Dissemination Will be available after the Workshop How this workshop is run - This is a Carpentries Workshop - that means friendly learning environment - Code of Conduct - active learning - work with the people next to you - ask for help

Material Type: Module

Authors: Kristina Riemer, Mine Çetinkaya-Rundel, Pat Schloss, Paul Magwene