Tip
Reproducibility: understand the principles and importance of reproducible data analysis.
Data manipulation: develop proficiency in R, including key data structures, packages, and functions to read, clean, transform, and organize datasets.
Data visualization: create informative and aesthetically pleasing visualizations of data.
Modelling and iteration: apply various algorithms and statistical models common to plant agriculture, and implement techniques to handle multiple datasets simultaneously.
Professional reporting: produce professional reports for sharing results.
Version control: manage the basics of Git and Github for collaborative projects.
.qmd) documentsNote
Expectations
- Use reproducible practices throughout (files, code, and narrative)
- Clear questions, clean data, and transparent methods
- Professional communication (figures, tables, interpretation)
Data Science: Extracting insights from data using algorithms and statistical methods.
Data Literacy: Skills to read, interpret, and analyze data.
Reproducibility: Ensuring analyses can be recreated by others.
Note
Why does reproducibility matter?
Trustworthy results,
transparency, &
collaboration in research.
It is the #1 skill-gap in the job market:

Is there a REPRODUCIBILITY CRISIS in science?
YES
A Nature survey with ~1,600 researchers found that
+70% failure rate to reproduce another scientistโs experiments
+50% have failed to reproduce their own experiments
Main causes: selective reporting, weak stats, code/data unavailability, etc.
Agriculture research relies heavily on environmental data, often variable and complex.
We have complex challenges ๐๏ธ
Opportunities โ
Limited capability to reproduce analyses & results
DATA are rarely shared, CODES even less
โBut it all starts with โฆโ
Ihaka, Gentleman
There are currently 23,052 of packages (on CRAN only).

dplyr, tidyr).
Tip
Note
ggplot2.
scikit-learn, TensorFlow.Note
| Feature | R | Python |
|---|---|---|
| Primary Strength | Statistics & Visualization | General-purpose, ML, AI |
| Performance | Moderate | Moderate |
| Licensing | GPL (core), MIT, BSD (some) | PSFL, highly permissive |
| Production Use | Limited by GPL | Very friendly for proprietary |
Tip




๐ฌ acorrend@uoguelph.ca
Assistant Professor
Pick Family Chair, Sustainable Cropping Systems
Rm 226, Crop Science Building

![]() |
||
![]() |
![]() |
