Running Art Tutorial: Difference between revisions

Revision as of 17:38, 4 June 2019

Tutorial Session Goal

In this Tutorial you will learn how to run the Mu2e 'art' framework executable (mu2e), both interactively and on the grid.

Session Prerequisites and Advance Preparation

Perform the Tutorial on setting up the Mu2e Offline

Session Introduction

Art is a software framework for processing events with modular code with lots of run-time configurability. Art is controlled by scripts in a dedicated configuration language called fhicl (.fcl suffix). Art uses rootIO to store events.

This tutorial will cover how to build and run several different kinds of art jobs, and how to use the mu2e job tools to divide large projects into many separate jobs, and how to run those jobs in parallel on Fermigrid or the OSG (open science grid).

Exercises

Exercise 1: Running a simple module (Hello, Tutorial!) and basic FHiCL

In this exercise, we will run a simple module that will print a welcoming message.

First set up to run Offline
The executable we use to run Offline is "mu2e." Use the --help option to display all the command line options
FHiCL files (*.fcl) tell Offline what to do. We specify the fcl file we want to use every time we run Offline using the "-c" option. We will now run a simple job that prints a hello message using a premade fcl file.
We can now explore the hello.fcl file that configured this Offline job to see how it works.

In FHiCL, we make definitions using the syntax "variable : value". A group of definitions can be combined into a table by surrounding with braces {}.
Like in C++, we can refer to FHiCL code in other files by using "#include" statements
After defining the process name, you will see three main tables: source, services, physics
"source" configures the inputs to the job. If we are making new events from scratch, we use "EmptyEvent". If we are building on top of old files, we might use "RootInput." You can also see that this job is configured to run 3 events by default
"services" configures parts of art that are common to all modules, like the geometry, detector conditions, or ability to print out to files
"physics" is where we configure the modules that do all the work. There are "producer" modules that creates data, and "analyzer" modules that read data and might make things like analysis TTrees. There are a couple different sections to the physics table. First we declare our producer and analyzer modules, then we define our "paths" (see below), and then we tell Art which paths we want to run.
Just adding a module to physics doesn't mean art will run it. It is like defining a function in c++ without calling it. To make the module run, we must tell art the list of modules and the order we want to run them in. We do this by defining a variable called a path to be this list of module names. Here there are two paths, p1 (which is empty), and e1. We then tell Art which paths to run using the definitions of "trigger_paths" and "end_paths". Producers (and filters) go in trigger paths, analyzers go in end paths.

You can see more detail about FHiCL at https://mu2ewiki.fnal.gov/wiki/FclIntro or check out the Art workbook and user guide chapter 9 (https://art.fnal.gov/wp-content/uploads/2016/03/art-workbook-v0_91.pdf)

Exercise 2: Module configuration with FHiCL

We will now see how to modify FHiCL to run different modules and even configure those modules at runtime

We have a new fcl file, hello2.fcl, try running that.
We can look at the source code for HelloWorld2 to see how we change Magic number.
Configure fcl to set Magic number to 5 by adding a line "magicNumber : 5" under module_type. Run the fcl again to check that it changed
You can also add this configuration to the end of the fcl file by using the full parameter location, i.e.
Finally, try running both this module and the original HelloWorld module by adding the module declaration from hello.fcl and adding it to your end_path. If you need help, check $TUTORIAL_BASE/solutions/hello2.fcl

Exercise 3: Using a more realistic Mu2e fcl to simulate an event

To fully simulate an event in Mu2e, we will need to run many more modules and services. Modules can become dependent on output from previous modules and may require certain services to be set up, so the final FHiCL for a functioning Mu2e Offline job ends up being somewhat complex. To help make things easier, we use a few FHiCL tricks.

Lets look at an example script to produce conversion electron events
At the top you will see a #include line. Lets look at the file it is including
You can see that this file includes several more files, and then starts with BEGIN_PROLOG. Prolog files are just a bunch of FHiCL definitions that then can be used later. You can see for example it defines a table called Primary.producers

Most directories in Offline have a prolog.fcl file that provide standard definitions for their modules and folders

In JobConfig/fcl/prolog.fcl (and in fcl/CeEndpoint.fcl) you will see several definitions using "@local" or "@table". This is how you reference a previously defined value (for example something defined in a prolog).
- @local references a standard definition (for example line 16 the definition of a module)
- @table references a table of several definitions but without the curly braces (for example line 18 adds several more module definition name:value pairs to the producers table)
- @sequence references a list of values separated by commas like for a path (for example line 62, each sequence adds the name of several modules to this path)
- For more details read https://mu2ewiki.fnal.gov/wiki/FclIntro
Back in fcl/CeEndpoint.fcl, you should see @table::Primary.producers, which we found in JobConfig/fcl/prolog.fcl. See if you can find out where the generate module in this fcl comes from and what it is running.
You can debug a complicated FHiCL with lots of #includes using the Offline option --debug-config. This will fully process the script, and print out to a file the results with all the @local etc. references made explicit. Lets try with our CeEndpoint.fcl
Finally, run fcl/CeEndpoint.fcl and generate 10 events

Exercise 4: Exploring Offline outputs

The above exercise should produce two files, dig.owner.CeEndpoint.version.sequencer.art and nts.owner.CeEndpoint.version.sequencer.root (also located in $TUTORIAL_BASE/RunningArt/data). Both are actually root files, but they contain different information. The .root files produced by Offline are used for diagnostic histograms and TTrees, and analysis output like TrkAna that can be used in a normal root analysis. The .art files contain the actual c++ objects Offline uses to describe the event (both simulation information and reconstructed information), and so are in general meant to be processed by other Offline jobs.

Open both files in a root TBrowser to see their contents
We can use Offline modules to better understand the .art file contents.

Exercise 5: Create your own primary production job

The JobConfig directory has the base scripts for all the production jobs, and so has examples of how to correctly configure most kinds of Offline jobs. If you need to do anything different, you can usually start with a JobConfig script and modify it slightly.

JobConfig/primary has scripts to generate primary only (no backgrounds) events without doing reconstruction
JobConfig/mixing has scripts to generate primary plus background events
JobConfig/reco has scripts to take output of the previous two and run reconstruction
These scripts are designed to be used with grid production scripts, and so don't include configuration of the random seed. To run the scripts as is, you will need to add two lines:

Lets try to make our own fcl script now. See if you can make and run a script to run primary only events with 100 MeV photons. Start by looking at $MU2E_BASE_RELEASE/EventGenerator/fcl/prolog.fcl, which has the default configuration for all of the event generators. There should be one called photonGun.

Tip: start by copying $MU2E_BASE_RELEASE/JobConfig/primary/CeEndpoint.fcl and replace the generator.
Don't forget to add the seed configuration!

Now see if you can turn on the StrawDigi diagnostic output. Looking at $MU2E_BASE_RELEASE/TrackerMC/src/StrawDigisFromStepPointMCs_module.cc, you will see a FHiCL parameter called diagLevel. Try increasing it in your script from 0 to 2.
Run your script and check the output .root file, you should see a new TDirectory with the StrawDigi diagnostics.

If you need help, check solutions/photongun.fcl

Exercise 6: Running event reconstruction

FIXME need non mixing script
Use output of exercise 4 or $TUTORIAL_BASE/RunningArt/data/dig.owner.CeEndpoint.version.sequencer.art

mu2e -c JobConfig/fcl/mcdigis.fcl --source $TUTORIAL_BASE/RunningArt/data/dig.owner.CeEndpoint.version.sequencer.art

Run dumpDataProducts.fcl on the dig.*.art and the mcs.*.art files and compare

Exercise 7: Running TrkDiag to create TrkAna TTrees

mu2e -c TrkDiag/fcl/TrkAnaReco.fcl --source-list files.txt

Exercise 8: Using generate_fcl to prepare to run jobs on the grid

When you want to simulate a large number of events, you will need to split your work up over multiple Offline jobs, and you will often want to run them on the Fermilab grid. There is a series of scripts to help you set up your fcl files and to start and track your jobs. The first problem is creating the fcl scripts. If you want to generate 10000 conversion electron events for example but only want to run 500 in a single job, you will need 20 fcl files that do the same thing - but each will need to have a different random seed, and each will need to output to different file names! The script generate_fcl takes a base fcl and will build these 20 files for it.

The generate_fcl tool is in mu2etools so you will first need to setup that UPS product
In the $MU2E_BASE_RELEASE/JobConfig/examples directory, there are scripts to call generate_fcl for each of the fcl files in JobConfig/primary,JobConfig/mixing, and JobConfig/reco. Lets look at $MU2E_BASE_RELEASE/JobConfig/examples/generate_CeEndpoint.sh

You can see from the --include option that this script is using JobConfig/primary/CeEndpoint.fcl as the base fcl, and will be used to produce 2000000 events (200 jobs of 10000 events each)
The description, dsconf, and dsowner options configure the output filenames. If you are simulating a large production that will be widely used by others, it is important that the output filenames follow specific patterns, which is what generate_fcl tries to enforce. You can read more about the Mu2e filename patterns at https://mu2ewiki.fnal.gov/wiki/FileNames
generate_fcl will by default create directories named 000,001,002, etc. and put 1000 fcl files in each. In this script you will see that the 000 folder is automatically renamed to CeEndpoint
More detailed documentation for generate_fcl is located at https://mu2ewiki.fnal.gov/wiki/GenerateFcl

Run this script to produce 200 jobs

There should now be a CeEndpoint directory. If you ls inside it, you will see many .fcl files and .fcl.json files
The .json files are used during official productions to keep track of the provenance of every file produced, you can ignore them for now
If you look at one of the .fcl files, you will see it just #includes our base fcl, and the only change from file to file is a reconfiguration of the seed and output filenames.

Look at scripts/generate_reco-CeEndpoint-mix.sh (This is a slightly modified version of JobConfig/examples/generate_reco-CeEndpoint-mix.sh). Instead of njobs and events-per-job we now just provide a --inputs argument with the name of a file containing filenames of art files to read from, and a --merge-factor parameter. Normally reconstruction is fast enough that in one job you can process the output of many simulation jobs. The merge-factor configuration tells generate_fcl how many art files each reco job will use.
Run this script and explore the output

Since data/CeEndpoint-mix.txt had 60 entries and we have a merge-factor of 20, we will produce 3 output fcl files.

Exercise 9: Adding backgrounds with event mixing

So far when we've been simulating Mu2e events, we've only included the primary signal. But in the real experiment any signal event will sit on top of a lot of backgrounds from the beam etc. Simulating backgrounds from the beam takes a very long time, so they are done in separate jobs. The outputs of those dedicated background simulations can then be put on top of a signal only simulation to create something more realistic - this is called event "mixing".

First lets look at a production mixing script: open up JobConfig/mixing/CeEndpointMix.fcl and JobConfig/primary/CeEndpoint.fcl side by side and compare. You can see that by using prolog we've kept the scripts looking basically identical.
We want to randomize which background events are mixed on top of each signal event - we can do this using the generate_fcl script. If you ls data/*.txt you will see files for several different types of backgrounds. Each is a list of filenames of background only simulation art files. Open scripts/generate_CeEndpointMix.sh. It looks similar to the non-mixing version but now has several --aux-input values which point to these text files.
Build a set of mixing jobs:

Find out more about mixing here https://mu2ewiki.fnal.gov/wiki/Mixing

Exercise 10: Submitting grid jobs with mu2eprodsys

setup mu2egrid

wfproject
setup vs code tarball
fcl on pnfs vs tarball

mu2eprodsys --dry-run

Exercise 11: Running the event display

Reference Materials

https://mu2ewiki.fnal.gov/wiki/Code
art workbook
various DocDBs that reference production, satellite release, partial checkout, etc.

@@ Line 185: / Line 185: @@
 I have not put the actual background .art files in the docker so you will not be able to run a mixing job here. When you do run a mixing job on the Fermilab machines, one thing to note is that they will take much much longer than a normal simulation job. Simulating 500 conversion electrons for example can take hours to a full day. By looking at the JobConfig/example scripts you can get a sense of how many events to run in each kind of job so they dont end up taking too long.
 </ol>
+* Find out more about mixing here https://mu2ewiki.fnal.gov/wiki/Mixing
 === Exercise 10: Submitting grid jobs with mu2eprodsys ===

Running Art Tutorial: Difference between revisions

Revision as of 17:38, 4 June 2019

Contents

Tutorial Session Goal

Session Prerequisites and Advance Preparation

Session Introduction

Exercises

Exercise 1: Running a simple module (Hello, Tutorial!) and basic FHiCL

Exercise 2: Module configuration with FHiCL

Exercise 3: Using a more realistic Mu2e fcl to simulate an event

Exercise 4: Exploring Offline outputs

Exercise 5: Create your own primary production job

Exercise 6: Running event reconstruction

Exercise 7: Running TrkDiag to create TrkAna TTrees

Exercise 8: Using generate_fcl to prepare to run jobs on the grid

Exercise 9: Adding backgrounds with event mixing

Exercise 10: Submitting grid jobs with mu2eprodsys

Exercise 11: Running the event display

Reference Materials

Navigation menu

Running Art Tutorial: Difference between revisions

Revision as of 17:38, 4 June 2019

Tutorial Session Goal

Session Prerequisites and Advance Preparation

Session Introduction

Exercises

Exercise 1: Running a simple module (Hello, Tutorial!) and basic FHiCL

Exercise 2: Module configuration with FHiCL

Exercise 3: Using a more realistic Mu2e fcl to simulate an event

Exercise 4: Exploring Offline outputs

Exercise 5: Create your own primary production job

Exercise 6: Running event reconstruction

Exercise 7: Running TrkDiag to create TrkAna TTrees

Exercise 8: Using generate_fcl to prepare to run jobs on the grid

Exercise 9: Adding backgrounds with event mixing

Exercise 10: Submitting grid jobs with mu2eprodsys

Exercise 11: Running the event display

Reference Materials

Navigation menu

Search