Neny46l7iwgajdbismih
Meet up

Causal inference and the data-fusion problem

Tuesday, 18th June at CodeNode, London

This meetup was organised by London Data Science Journal Club in June 2019

Causal inference and the data-fusion problem

The proliferation of Big Data systems means that there is an increasing amount of data available to Data Scientists but relatively little of it is collected in a controlled fashion, instead it is purely observational.

Pearl’s do-calculus offers a way, given a causal model, to get the benefits of randomised controlled trials from purely observational data. This paper proposes a theoretical solution to the problem of combining data from heterogenous sources using different selection criteria, and outlines how to correct for confounding bias and selection bias.

Additional materials:

Background reading (for wider context, not specifically covered in the session):

A note about the Journal Club format:

  1. The sessions usually start with a 5-10 minute introduction to the paper by the topic volunteer, followed by splitting into smaller groups to discuss the paper and other materials. We finish the session by coming together for about 15 minutes to discuss what we have learned as a group and ask questions around the room.
  2. There is no speaker at Journal Club. One of the community has volunteered their time to suggest the topic and start the session, but most of the discussion comes from within the groups.
  3. You will get more benefit from the session if you read the paper or other materials in advance. We try to provide (where we can find them) accompanying blog posts, relevant code and other summaries of the topic to serve as entry points.
  4. If you don't have time to do much preparation, please come anyway. You will probably have something to contribute, and even if you just end up following the other discussions, you can still learn a lot.
  5. It's OK just to read the blog post or watch the video :)
  6. We don't have spare copies of the paper during the session, so please print out your own if you want a hard copy for discussion. For digital copies, you are welcome to use your laptops/tablets/phones during the session.

Thanks to our sponsors

Attending Members

Overview

Causal inference and the data-fusion problem

The proliferation of Big Data systems means that there is an increasing amount of data available to Data Scientists but relatively little of it is collected in a controlled fashion, instead it is purely observational.

Pearl’s do-calculus offers a way, given a causal model, to get the benefits of randomised controlled trials from purely observational data. This paper proposes a theoretical solution to the problem of combining data from heterogenous sources using different selection criteria, and outlines how to correct for confounding bias and selection bias.

Additional materials:

Background reading (for wider context, not specifically covered in the session):

A note about the Journal Club format:

  1. The sessions usually start with a 5-10 minute introduction to the paper by the topic volunteer, followed by splitting into smaller groups to discuss the paper and other materials. We finish the session by coming together for about 15 minutes to discuss what we have learned as a group and ask questions around the room.
  2. There is no speaker at Journal Club. One of the community has volunteered their time to suggest the topic and start the session, but most of the discussion comes from within the groups.
  3. You will get more benefit from the session if you read the paper or other materials in advance. We try to provide (where we can find them) accompanying blog posts, relevant code and other summaries of the topic to serve as entry points.
  4. If you don't have time to do much preparation, please come anyway. You will probably have something to contribute, and even if you just end up following the other discussions, you can still learn a lot.
  5. It's OK just to read the blog post or watch the video :)
  6. We don't have spare copies of the paper during the session, so please print out your own if you want a hard copy for discussion. For digital copies, you are welcome to use your laptops/tablets/phones during the session.

Thanks to our sponsors

Who's coming?

Attending Members