You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 4 Next »

For the outreach programme LHCb Masterclass students from secondary schools are invited to analyze a sample of D→ K− pidecays as collected from the LHCb experiment to measure the lifetime of the Dmeson.

The data used for this exercise are public and can be obtained from the Open Data portal of CERN.

In this tutorial we repeat the analysis designed for the LHCb Masterclass, using Python and ROOT in order to show how the most common operations in data analysis can be performed within such a framework.

We will take the opportunity to apply some machine learning, this is not part of the original excercise, but it is worth to include an example on how to use Keras and Tensorflow to separate signal and background. This is not a lecture on machine learning: several basic and important aspect of a machine learning problem are ignored here, for example we do not split the data in training and test samples. From a software perspective, it should be trivial to extend the example to include a more careful treatment of the neural network training and application.

The website of the LHCb International Masterclass, where the excercise is shortly explained can be found at this link.


Static jupyer notebook: https://github.com/landerlini/MLINFN-TutorialNotebooks/blob/master/LHCbMasterclassExplained.ipynb


Requirements 

To run this exercise you will need python3, tensorflow 1.x and PyROOT for python3. 

Contents

With this tutorial, we will introduce the following topics:

  1. Download data with jupyter via http 
  2. Exploring a dataset with pandas 
  3. Exploring a dataset with matplotlib
  4. Obtaining high quality plots with ROOT
  5. Modelling the data with RooFit 
  6. Perform a per-event subtraction of the background using sPlot
  7. Training a simple neural network on nTuple data with keras
  8. Evaluate the performance of the trained algorithm
  9. Apply the neural network to data
  10. Studying systematic effects induced by the neural network


  • No labels