2020 - Remote

Slides, recordings and other resources for CorrelCon 2020.


Keynote (Johannes Müller

Our founder Johannes Müller will share insights on the current state and the future of Data4Good, CorrelAid e.V.'s vision for society, data and tech and how we strive to achieve it.

Data4Good Projects (Frie)

Frie Preu, Head of Operations, will introduce our mission, terms, conditions and timelines in Data4Good projects.

Data4Good Use Cases Sessions

Detecting Anomalies in Bee Hives with Machine Learning

Bees are crucial for the preservation of life on earth in its current form. We helped taking bee observation and research to the next step by making sensor data usable and building an alarm system that alerts beekeepers in case of emergencies. Diren, Alex and Alex will share how they trained machine learning models for anomaly detection and walk you through the process of conducting a Data4Good project.

Speakers: Diren (Bee Server BOB), Alex and Alex (CorrelAid e.V.)

Building interactive dashboards with R Shiny

RShiny is a technology enabling the visualisation of data in interactive dashboards. Based on the programming language R, it comes with server solutions and ready-to-use templates. Dennis and Antonia will present how the technology can be used to automate advocacy and marketing processes with Erlassjahr.de, an NPO fighting for debt release for countries in the Global South.

Speakers: Dennis and Antonia (CorrelAid e.V.)

Interest in the good life for the Elderly in rural areas

CorrelAid assisted Denk-Mal-Kultur e.V. in finding out who is interested in the good life for the elderly inrural areas by setting up an online survey and writing up a short report with key findings.

Speakers: Theresa and Sylvi (CorrelAid e.V.)

Data Dialogues - An Interview with the Berlin Chapter

Defining a Data4Good project scope is not easy. Liam Bailey presents the open data dialogue that can help social organizations to define their data challenges and crowdsources possible data strategies with a pool of data scientists.

Interview partners: Liam (CorrelAid e.V.), Nina (CorrelAid e.V.)

Solving Coding Challenges for Good

What our community can do became obvious in our 2020 coding challenge - join us to revisit the great submissions of our community members and get inspired by what data can do. Isabel Willmann, our Head of Community Management, will walk you through the different examples of using Data4Good.

Speakers: Isabel, Konrad, Steffen, Sarah, Drenizë, Pia, Jie, Benjamin, and Long (CorrelAid e.V.)


CorrelAid 2020 - Revisiting a Turbulent Year

Frie Preu, Head of Operations, will walk you through our journey through our 2020 milestones, achievements and challenges.

Speaker: Frie (CorrelAid e.V.)

Data Ethics: Gathering Data on Ethnicity, Gender and Sexuality

Topic Sessions

Data4Good: How do we leverage data to serve the common good?

Data in the social sector is often still synonymous with impact measurement. In this topic session, we will discuss how to make the social sector and policies more efficient and effective using data by both enabling data-driven decision-making and completely rethinking processes by using data and tech. We will also talk about dangers and pitfalls of using data.

Speakers: Johannes (CorrelAid e.V.), Igor (Phineo), Moderator: Nina (CorrelAid e.V.)

Education: What makes a good data scientist?

The field of data science is broad: Job descriptions refer to data analysts, data scientists and data engineers and the list of technologies is growing fast. For young professionals, it is difficult to navigate the field. In this topic session, we aim to answer questions related to building a career in data science: ‘Which career paths are open to young professionals and how do they differ? Which key characteristics does a professional working in data possess? Which programming languages should prospective applicants focus on? Which technologies are in high demand? What are the go-to sources to stay ahead of current developments in tech?’ Get insights to answer your own questions in the 30min moderated discussion starting from 2PM.’

Speakers: Joseph (DataQuest), Oleg (NextRound.cc), Moderator: Isabel (CorrelAid e.V.)

Careers: Real Careers in Data Science

Sometimes finding a career is done by speaking to the people who already built it: Listen to members of our community present the variety of careers that can be found in data and listen to how they decided and succeeded on their respective paths. Some but not all examples include: Data journalism, research, tech companies...

Speakers: Yannik, Sylvi, Damon (CorrelAid e.V.), Moderator: Manuel (CorrelAid e.V.)

Beginner session: Introduction to R

The R workshop gives an introduction to base R covering functions to handle and visualize different variable types (bar and pie plots, scatterplots, histograms). In the second part, we look at more advanced plots(different visualization for different groups of respondents) and briefly cover the estimation and visualizationof linear regression models. It would be nice if everyone had R and RStudio installed (both are freely available under http://www.cran.r-project.org/ and http://rstudio.org/download/desktop).

Speaker: Mirka Henninger (Postdoctoral Researcher at the University of Zurich)

Beginner session: Introduction to Python

In this workshop, Henry will give an introduction to Python and its ecosystem for data science. In this session he tries to present key features of python and hint to further resources. It won't be an extensive introduction into the syntax and data structures, but a broad overview which should give you an impression of 'python' - so that you are able to decide if you want to go further.

Speaker: Henry Webel (PhD fellow in the Copenhagen Bioscience PhD Program)

Beginner session: Talking Shiny

Shiny apps offer sophisticated tools to generate web applications for a variety of needs such as visualizing(research) results or making data sets easily accessible. The workshop will guide you through the necessary steps to set up your own ShinyApp and discuss lessons learned from building a ShinyApp using Shiny for R, shinydashboards, and echarts4r.

Speaker: Cosima Meyer (Researcher at the University of Mannheim

Advanced Session: Data Visualization

Building a modularized Shiny App with {golem} and htmlwidges (Cédric Scherer)

In this talk, Cédric introduces you to modularized Shiny apps and a library called {golem} that simplifies the process. He will also give an overview of html widgets that are useful for designing your app and allow for interactive charts and an overall nice user experience. He is going to demonstrate the possibilities using our Shiny app from the LC Berlin that recently won the CorrelAidX challenge.

Reproducible Data Availability Visualization with Github Pages (Lisa Reiber)

An unfamiliar dataset or a 500-piece puzzle have (at least) one thing in common: in the beginning it is hard to find the things you are looking for. While codebooks with metadata usually provide a first overview on available sample sizes, they also tend to be dry and boring matters. Let’s make metadata easier on the eye. In her talk, Lisa will show you how she used RMarkdown and GithubPages to visualize the data availability for some of the Socio-Economic Panel data. The website is reproducible and helped her collaborators who were not familiar with the data to get started with the research design quickly, because it was easy to find out which variables would be suitable for analysis to begin with.

Visual narratives to connect: data visualization for and with people (Federica Fragapane)

As information designer, I constantly explore the possibilities that designing a piece opens in terms of creation of a connection with the people. A constant visual experimentation has a significant role in such research.During the talk I’ll show the process behind a set of data visualizations with a common ground of interest for the relationship between designer and people: not only as readers, but also as narrators. I’ll share the design phases and the lessons learnt throughout the processes.

Advanced Session: Data Engineering

Data Engineering in Google Spreadsheets with Google Script (Ilja Sperling)

Spreadsheets are an entry drug to data but can become messy real quick - especially when at scale. For robust data outputs we need clean and structured data. But how to get there without a 500$ Tableau team subscription? JavaScript (and Metabase) to the rescue! Ilja will walk us through his journey of low-Budget data engineering with Google Apps Script, R, and the Google Spreadsheet API.

Building Individual Data Products for Social Good with Python, PostgresQL and Vue.js (Jan Dix)

Jan from cause&effect will share how they use open source technologies to build individual data products. His motto: Don’ t be MEAN, use PPV.

AutoML in the wild (Gesa Müller)

Gesa will share insights into her job at Gpredictive where she uses technologies like Spark and Scala to build automated data pipelines.

Advanced Session: Modelling

DISH-O-TRON: A fun experiment to build your own AI product (Marcel Mikl & Oli Moser)

Solving real-world problems with AI often times starts with the realization that there is no data available.In this talk we will show how we tackled the problem of 'mysteriously appearing dirty dishes in the community kitchen sink' by building an AI-system from scratch. You can read more about the DISH-O-TRON here.

Using machine learning to detect anomalies in beehives (Alexander Goncharskiy, Diren Senger, Alexandros Melemenidis)

Bees are crucial for the preservation of life on earth in its current form. We helped taking bee observation and research to the next step by making sensor data usable and building an alarm system that alerts beekeepers in case of emergencies. Diren and Alexander will share how they trained machine learning models for anomaly detection.

The political discourse on discrimination: analysing 70 years of the Bundestag (Alexandra Wörner)

As the German constitution states, the sittings of the Bundestag need to be public which is why the plenary protocols are openly accessible. These massive amounts of text which have accumulated over70 years are not manageable anymore without machine assistance. Taking the example of discrimination, we are going to look into which insights we can derive using several techniques from Natural Language Processing.

Advanced Session: Geospatial Data

Introduction to Geodata in R (Michael Matiu)

Michael will give an introduction into fundamentals of spatial data and introduction into basic R packages for spatial data.

#30daymapchallenge: (a little less than) 30 different ways to create interactive maps with R (Alexandra Kapp)

Alex will share an exclusive preview of her results for the #30daymapchallenge. Each day is dedicated to an idea or a package about maps: Using Deck.gl in R, viz of most used pathways, getting OSM data straight into R with a single line of code, creating 3D maps, draggable polygons in leaflet, origin destination flows, voronoi maps, etc

Introduction to PostGIS: PostgreSQL going spatial (Malte Kyhos)

Spatial data is often encountered in the form of flat files, such as shapefiles. However, the integration of such data into relational databases may offer several advantages in complex scenarios. In this talk,Malte will provide a brief introduction and overview to PostGis, a spatial extension for PostgreSQL.

CorrelAidX Pecha Kucha

Meet our local chapters through this engaging presentation format and see where you could find a chapter that might work for you.

Speakers: Sylvi (Berlin), Lukas (Bremen), Sarah & Drenizë (Hamburg), Florian & Pia (München), Liubov (Paris), Jacob (Netherlands),Phil& Tilman (Konstanz), Konrad & Johannes & Alex & Sophie (Rhein-Main), Joshua (Mannheim)

LC Cologne

LC Hamburg

LC Bremen

Last updated

The CorrelAid Docs are licensed under CC-BY 4.0.