In this workshop, attendees will work at their own pace to learn basic data science tasks in Pandas. Pandas is a fantastic Python package which provides data structures and analysis tools for data science tasks. The workshop will cover the data structures, selection, mapping functions, reductions, statistics, input/output, pivot tables, grouping, and time-series data. Basic knowledge of Python is required. Attendees should be familiar with the syntax, using lists, and basic knowledge of lambdas.
What is a Data Management Plan? This session will answer that question, as well as describe the steps to creating a DMP, tools that can help with DMP development, and post-award management issues. University of Pittsburgh-specific guidelines and support resources will also be shared.
Many funders, publishers, and institutions require researchers to make their research data public, but practical challenges can act as a barrier to sharing data, especially in the health sciences. This hands-on workshop will guide participants through the data sharing process, from initial study design to data deposit. Exercises will prompt participants to think through issues of data documentation, reuse value, and promotion of their own research projects.
Microsoft Excel is a commonly used program to record and store datasets with headings, rows, and columns. In this class, we will explore data with sorting and filtering functions, and transform data into summary tables. You will work through data examples to create pivot tables, apply conditional formatting, and prepare your figures for use in other programs.
Learn how to keep your data safe AND preserve it for future use by following a few simple rules. File formats, file-naming conventions, repositories, storage options and more will be discussed.
In this hands-on workshop, learn how to manage your work with the version control system Git. Git helps keep your files safe from accidental deletion, tracks who made what change when, and lets multiple people work on the same project without overwriting each other's work. We'll cover using Git from the Unix shell and through Github online. No previous experience with the command line is necessary, although some basic knowledge is recommended.
This workshop will cover the basics of R programming for data analysis and graphics using R Studio. Upon completion participants will be able to:
In this follow up session we will review the hands-on exercise questions distributed in the previous week's Introduction to R class.
Registration is not required.
To attend, use the same Zoom link you received upon registration for the original class.
In this class, learn the fundamentals of keeping your data secure and organized through brief introductions to the core areas of data management: file storage and organization, file documentation, data preservation, and data publication and/or data sharing. This class is intended for graduate students and researchers who are working on long-term research projects, or for anyone who wants to make sure their personal files are safe for the long-term.
You've collected your data. Now what? In this class we will learn how to use Tableau to demonstrate the significance of your data.
Need to find a dataset to act as a control for your study? Or do you want to reuse open access data? This class will offer tips for locating and citing data and include hands-on exercises to explore directories of data repositories and data journals.
Do you have data that require bioinformatics analysis? Are you concerned about scientific rigor and reproducibility? Come learn about the “4 C’s” available to Pitt researchers: Core facilities, Collaboration with bioinformaticians, Coding, and Commercially-licensed tools. Make an informed decision on the best option(s) for your data needs.
Need to find a dataset to act as a control for your study? Or do you want to reuse open access data? This workshop offers tips for locating and citing data, and includes hands-on exercises to explore directories of data repositories and data journals.
Do you want to track and organize your projects more efficiently, especially in a remote or distributed environment? Are you writing code or manuscripts with others and need to know who did what, when? In this class, learn the basics of version control and how it helps keep your work safe and reliable. Then we'll dive into Github to see how it tracks the changes you or your collaborators make to uploaded files, and how that can help make your research more reproducible.
Did you know that for each minute of planning at the beginning of a project, you will save yourself roughly 10 minutes of headache later? This session will provide practical tips for organizing, naming, documenting, storing and preserving your data.