Datasets-SU2020-Naming-Conventions: Difference between revisions

From Mu2eWiki
Jump to navigation Jump to search
(Created page with " Mu2e files are required to have 6 field-long names, see https://mu2ewiki.fnal.gov/wiki/FileNames for more details, example: cnf.mu2e.su2020.dddddddd.rrrrrr_ssssssss.art...")
 
(Add note about how to access the private su2020 repo on github.)
 
(12 intermediate revisions by one other user not shown)
Line 1: Line 1:
(work in progress)
=== Naming Conventions ===


Mu2e files are  required to have 6 field-long names, see https://mu2ewiki.fnal.gov/wiki/FileNames for more details, example:
Mu2e files are  required to have 6 field-long names, see https://mu2ewiki.fnal.gov/wiki/FileNames for more details, example:


   cnf.mu2e.su2020.dddddddd.rrrrrr_ssssssss.art
   cnf.mu2e.dddddddddd.su2020.rrrrrr_ssssssss.art


* Let's use MC run number 1000 only for all datasets
* Let's use MC run number 1000 only for all datasets
* lets fix the third field to su2020 (lowcased) - that would allow to find handily for all su2020 datasets  in SAM catalog
* lets fix the 4-th field to su2020 (lowcased) - that would allow to find handily for all su2020 datasets  in SAM catalog
* 1+2 fix 5 fields out of 6.  
* 1+2 fix 5 fields out of 6.  
* It makes sense to treat the remaining field as a 8 char long dataset ID. Having fixed length dataset ID is handy  
* It makes sense to treat the remaining, 3-rd, field as a 10 char long dataset ID. Having fixed length dataset ID is handy  
: in many respects - we will need to propagate it through the whole analysis chain, from the MC files through the output  
: in many respects - we will need to propagate it through the whole analysis chain, from the MC files through the output  
: of reconstruction to ntuples and histogram files.
: of reconstruction to ntuples and histogram files.
Line 13: Line 16:
: to be used for all practical purposes in the software. Ok, so
: to be used for all practical purposes in the software. Ok, so


   dddddddd = DDDDVVBS
   DDDDVsSObB  (lowercased characters stay as is, uppercase - interpreted)


   DDDD: 4-character-long name stub (it looks that 4 chars would allow us to name meaningfully  all datasets we have so far dealt with
   DDDD: 4-character-long name stub (it looks that 4 chars would allow us to name meaningfully  all datasets we have so far dealt with


   VV  : dataset version, i.e. 01 (in principle, could be anything), for MC files,
   : dataset version, i.e. 0,1,2 ... (in principle, could be anything and be used for any purpose)
        version could encode the proton beam intensity,
  S  : stage index: some datasets require a multi-stage generation, to make a clean and visible cut, can assign <b>S=a</b> to reconstruction
        e.i. >=10 : single batch >=2 : 2 batches
  O  : for stages with several output streams, just number them - 0,1,2,3,4...
  B  : beam intensity : 0 - signal only, 1 : 1 batch mode, 2 : 2 batch mode , the rest -reserved for future
    
    
  B  : beam intensity : 0 - signal only, 1 : 1 batch mode, 2 : 2 batch mode , the rest -reserved for future
 
  S  : stage index: some datasets require a multi-stage generation, to make a clean and visible cut, can assign <b>S=a</b> to reconstruction
   examples:
   examples:
<pre>
<pre>
Line 30: Line 30:
| 4 char dataset ID stub | comments, examples                                |
| 4 char dataset ID stub | comments, examples                                |
|------------------------+---------------------------------------------------|
|------------------------+---------------------------------------------------|
| beam                  | beam flash                                        |
| beam                  | beam, trace everything                            |
|                        | beam01s1                                          |
| bmum                  | bmum : beam, trace mu-, but not e+,e-, gamma      |
|                        | beam01s2                                          |
|                        | bmum1s10b0 : s1, 0=first output stream - into TS  |
|                        | beam01s3                                          |
|                        | bmum1s20b0 :                                      |
| dioe                   | DIO electrons,                                    |
|                        | bmum1s30b0 :                                      |
|                        | examples: dioe01s5: leading order                 |
| bpim                  | beam pi- tracing                                  |
|                        | dioe02s5: leading log                             |
| mdio                   | DIO electrons,                                    |
| mumi                  | mu-, example : mumi01s5 single particle gun muons |
|                        | examples: mdio0s10b0: leading order               |
| mupo                   | mu+                                              |
|                        |           mdio1s10b1: leading log,+ 1 batch      |
| mumi                  | mu-, example : mgun0sr1b0 : muon particle gun     |
| mupl                   | mu+                                              |
| elec                  | e- (i.e. single particle gun)                    |
| elec                  | e- (i.e. single particle gun)                    |
| posi                  | e+                                                |
| posi                  | e+                                                |
| pipl                  | pi+                                              |
| pipl                  | pi+                                              |
| pimi                  | pi-                                              |
| pimi                  | pi-                                              |
| pbar                  | pbars example: pbar0110 (reconstructed)           |
| pbar                  | pbars example: pbar0s01b0 (reconstructed)         |
| rpce                  | RPC external                                      |
| rpce                  | RPC external                                      |
| rpci                  | RPC internal                                      |
| rpci                  | RPC internal                                      |
Line 54: Line 56:
|------------------------+---------------------------------------------------|
|------------------------+---------------------------------------------------|
</pre>
</pre>
=== Dataset description on GitHub ===
* https://github.com/Mu2e/su2020/blob/master/doc/dataset_summary.org  (Note: Mu2e/su2020 is a private repository, so you must be logged in to GitHub and be a member of the GitHub Mu2e organization in order to see it. Otherwise GitHub pretends the repository doesn't exist.)
=== Links ===
*
-----
[[Category:Datasets]] [[Category:SU2020]]

Latest revision as of 20:00, 2 March 2022

(work in progress)

Naming Conventions

Mu2e files are required to have 6 field-long names, see https://mu2ewiki.fnal.gov/wiki/FileNames for more details, example:

 cnf.mu2e.dddddddddd.su2020.rrrrrr_ssssssss.art
  • Let's use MC run number 1000 only for all datasets
  • lets fix the 4-th field to su2020 (lowcased) - that would allow to find handily for all su2020 datasets in SAM catalog
  • 1+2 fix 5 fields out of 6.
  • It makes sense to treat the remaining, 3-rd, field as a 10 char long dataset ID. Having fixed length dataset ID is handy
in many respects - we will need to propagate it through the whole analysis chain, from the MC files through the output
of reconstruction to ntuples and histogram files.
  • in addition to the dataset ID, a dataset has to have a long descriptive name, but it is the dsid which is supposed
to be used for all practical purposes in the software. Ok, so
 DDDDVsSObB  (lowercased characters stay as is, uppercase - interpreted)
 DDDD: 4-character-long name stub (it looks that 4 chars would allow us to name meaningfully  all datasets we have so far dealt with
 V   : dataset version, i.e. 0,1,2 ...  (in principle, could be anything and be used for any purpose)
 S   : stage index: some datasets require a multi-stage generation, to make a clean and visible cut, can assign S=a to reconstruction
 O   : for stages with several output streams, just number them - 0,1,2,3,4...
 B   : beam intensity : 0 - signal only, 1 : 1 batch mode, 2 : 2 batch mode , the rest -reserved for future 
  
 examples:
|------------------------+---------------------------------------------------|
| 4 char dataset ID stub | comments, examples                                |
|------------------------+---------------------------------------------------|
| beam                   | beam, trace everything                            |
| bmum                   | bmum : beam, trace mu-, but not e+,e-, gamma      |
|                        | bmum1s10b0 : s1, 0=first output stream - into TS  |
|                        | bmum1s20b0 :                                      |
|                        | bmum1s30b0 :                                      |
| bpim                   | beam pi- tracing                                  |
| mdio                   | DIO electrons,                                    |
|                        | examples: mdio0s10b0: leading order               |
|                        |           mdio1s10b1: leading log,+ 1 batch       |
| mumi                   | mu-, example : mgun0sr1b0 : muon particle gun     |
| mupl                   | mu+                                               |
| elec                   | e- (i.e. single particle gun)                     |
| posi                   | e+                                                |
| pipl                   | pi+                                               |
| pimi                   | pi-                                               |
| pbar                   | pbars example: pbar0s01b0 (reconstructed)         |
| rpce                   | RPC external                                      |
| rpci                   | RPC internal                                      |
| rmce                   | RMC external                                      |
| rmci                   | RMC internal                                      |
| cosm                   | cosmics                                           |
| prot                   | ejected protons                                   |
| deut                   | ejected deutrons                                  |
| phot                   | ejected photons                                   |
|------------------------+---------------------------------------------------|

Dataset description on GitHub

Links