Detailed information about the course

[ Back ]

Advanced Data Management & Manipulation Using R


23 & 30 September 2021


Dr Pauline Fritsch, Coordinator, Doctoral program in Organismal Biology, UNINE

Dr Marta Bellone, Coordinator CUSO-DPEE


Dr. Jan Wunder, Wunder Consulting Wald (ZH)

Dr. Tina Cornioley, University of Zurich


The analysis of large data sets ("big data") is becoming increasingly important in science and elsewhere. In this course you will learn how to use R to manage and manipulate large data sets, i.e. to sort, merge, subset, aggregate and reshape data, including outlier detection and gap filling algorithms.

For advanced data manipulation, we are going to use novel developments such as dplyr ("A Grammar of Data Manipulation"), the pipe operator (%>%) for simpler R-coding and data.table for the fast aggregation of large data sets. Furthermore, we will have a closer look at R-data base connections, SQL queries and the creation of new data bases from R.

Depending on the course progress, there will be scope for individuals to work on small projects and/ or their own data sets.


Course outline:

  • Data structures
  • Data management (merge, sort, reshape,...)
  • "The data.table way" (data.table)
  • "The grammar of data manipulation" (dplyr)
  • Tidying up messy data (tidyr, NAs & outliers)
  • Databases (ODB)
  • Reporting (knitr)


The completion of an homework will be requested after the end of the course (deadline: TBA).


University of Neuchâtel


Full attendance and active participation The completion of a homework will be requested after the end of the course (deadline: TBA)


! Pre-requirements!

Familiarity with R before attending the workshop or previous attendance of an introductory course to R.

Bring your own laptop to the workshop with recent versions of R and R-Studio installed. Make sure that your laptop is properly connecting to the University of Neuchâtel or eduroam WLAN.


23 and 30 September 2021
9:00 - 17:00


In agreement with the organizers, this workshop will be held either onsite in Neuchâtel or online. You may then register without fear of cancellation!


Catherine Suarez



Travel expenses:

PhD students of the DPEE are eligible for the reimbursement of incurred travel expenses by train (half-fare card, and 2nd class).

The online reimbursement system is now in place, so once the course is over you will be able to request the reimbursement via MyCUSO, without sending the paperwork to me. More information: Reimbursement


Please contact the coordinator of the doctoral program (ecologie-evolution(at) BEFORE the beginning of the course. NO reimbursement of accommodation without the coordinator of the doctoral program agreement prior of the course. In case of overnight stay, please post the original tickets and original bills along with the reimbursement form to:

Catherine Suarez
Département de Génétique et évolution - Unité d'anthropologie
Sciences II, 30 quai Ernest-Ansermet, 1205 Genève

NO reimbursement of meal expenses


Registrations through the webpage of DPOB :

Deadline for registration 7.09.2021
Priority is given to PhD students of the DPEE and DPOB until 29 August 2021. After this deadline, first comes, first serves! 

8 places are reserved for DPEE participants

Registration Fees:

Free for participants enrolled in the CUSO Ecology & Evolution doctoral program.
In case of cancellations, before the deadline: free
Late cancellations or no-show: Administrative fee of 50 CHF will be demanded
Other participants: please contact the program coordinator at ecologie-evolution(at)



Deadline for registration
Joint activity joint
short-url short URL

short-url URL onepage