Our partner Earlham Institute is offering a workshop ideal for researchers and technical workers with a background in biology and a basic knowledge of Python, to work with large, complex datasets, mine them for biological insights, and create visualizations to display the results.
Much of the popularity of Python stems from the availability of high quality libraries of existing code that we can use for our own projects. Libraries (“packages”, in Python terminology) are even more useful when they are designed to work together.
For scientific programming, we are lucky to have a collection of mature packages which work together to form a stack:
- numpy for numerical processing
- pandas for reading, cleaning and processing tabular data files
- matplotlib as a low-level charting library
- seaborn as a high-level charting library for rapid dataset exploration through visualization
In this course you will learn how to use these packages together to quickly explore large biological datasets, find meaningful patterns in the data, and present our results clearly. The focus will be on the high level packages – pandas and seaborn – as this will allow us to do the most work with the smallest amount of code. By concentrating on just two packages for an entire course, a large part of what these tools can do will be covered.
This event will be delivered virtually via Zoom & Slack and the closing date for registration is the 16th October 2022.
For more details, queries and registration please visit the the Earlham Institute website!