Datasets-SU2020-Naming-Conventions: Difference between revisions

From Mu2eWiki
Jump to navigation Jump to search
No edit summary
(Add note about how to access the private su2020 repo on github.)
 
(3 intermediate revisions by one other user not shown)
Line 1: Line 1:
(work in progress)  
(work in progress)  
=== Naming Conventions ===


Mu2e files are  required to have 6 field-long names, see https://mu2ewiki.fnal.gov/wiki/FileNames for more details, example:
Mu2e files are  required to have 6 field-long names, see https://mu2ewiki.fnal.gov/wiki/FileNames for more details, example:


   cnf.mu2e.su2020.dddddddd.rrrrrr_ssssssss.art
   cnf.mu2e.dddddddddd.su2020.rrrrrr_ssssssss.art


* Let's use MC run number 1000 only for all datasets
* Let's use MC run number 1000 only for all datasets
* lets fix the third field to su2020 (lowcased) - that would allow to find handily for all su2020 datasets  in SAM catalog
* lets fix the 4-th field to su2020 (lowcased) - that would allow to find handily for all su2020 datasets  in SAM catalog
* 1+2 fix 5 fields out of 6.  
* 1+2 fix 5 fields out of 6.  
* It makes sense to treat the remaining field as a 8 char long dataset ID. Having fixed length dataset ID is handy  
* It makes sense to treat the remaining, 3-rd, field as a 10 char long dataset ID. Having fixed length dataset ID is handy  
: in many respects - we will need to propagate it through the whole analysis chain, from the MC files through the output  
: in many respects - we will need to propagate it through the whole analysis chain, from the MC files through the output  
: of reconstruction to ntuples and histogram files.
: of reconstruction to ntuples and histogram files.
Line 14: Line 16:
: to be used for all practical purposes in the software. Ok, so
: to be used for all practical purposes in the software. Ok, so


   dddddddd = DDDDVSOB
   DDDDVsSObB  (lowercased characters stay as is, uppercase - interpreted)


   DDDD: 4-character-long name stub (it looks that 4 chars would allow us to name meaningfully  all datasets we have so far dealt with
   DDDD: 4-character-long name stub (it looks that 4 chars would allow us to name meaningfully  all datasets we have so far dealt with
Line 35: Line 37:
| bpim                  | beam pi- tracing                                  |
| bpim                  | beam pi- tracing                                  |
| mdio                  | DIO electrons,                                    |
| mdio                  | DIO electrons,                                    |
|                        | examples: dioe01s5: leading order                 |
|                        | examples: mdio0s10b0: leading order               |
|                        | dioe02s5: leading log                             |
|                        |           mdio1s10b1: leading log,+ 1 batch      |
| mumi                  | mu-, example : mumi01s5 single particle gun muons |
| mumi                  | mu-, example : mgun0sr1b0 : muon particle gun     |
| mupl                  | mu+                                              |
| mupl                  | mu+                                              |
| elec                  | e- (i.e. single particle gun)                    |
| elec                  | e- (i.e. single particle gun)                    |
Line 43: Line 45:
| pipl                  | pi+                                              |
| pipl                  | pi+                                              |
| pimi                  | pi-                                              |
| pimi                  | pi-                                              |
| pbar                  | pbars example: pbar0110 (reconstructed)           |
| pbar                  | pbars example: pbar0s01b0 (reconstructed)         |
| rpce                  | RPC external                                      |
| rpce                  | RPC external                                      |
| rpci                  | RPC internal                                      |
| rpci                  | RPC internal                                      |
Line 55: Line 57:
</pre>
</pre>


=== Dataset description on GitHub ===
* https://github.com/Mu2e/su2020/blob/master/doc/dataset_summary.org  (Note: Mu2e/su2020 is a private repository, so you must be logged in to GitHub and be a member of the GitHub Mu2e organization in order to see it. Otherwise GitHub pretends the repository doesn't exist.)
=== Links ===
*
-----
-----
[[Category:Datasets]] [[Category:SU2020]]
[[Category:Datasets]] [[Category:SU2020]]

Latest revision as of 20:00, 2 March 2022

(work in progress)

Naming Conventions

Mu2e files are required to have 6 field-long names, see https://mu2ewiki.fnal.gov/wiki/FileNames for more details, example:

 cnf.mu2e.dddddddddd.su2020.rrrrrr_ssssssss.art
  • Let's use MC run number 1000 only for all datasets
  • lets fix the 4-th field to su2020 (lowcased) - that would allow to find handily for all su2020 datasets in SAM catalog
  • 1+2 fix 5 fields out of 6.
  • It makes sense to treat the remaining, 3-rd, field as a 10 char long dataset ID. Having fixed length dataset ID is handy
in many respects - we will need to propagate it through the whole analysis chain, from the MC files through the output
of reconstruction to ntuples and histogram files.
  • in addition to the dataset ID, a dataset has to have a long descriptive name, but it is the dsid which is supposed
to be used for all practical purposes in the software. Ok, so
 DDDDVsSObB  (lowercased characters stay as is, uppercase - interpreted)
 DDDD: 4-character-long name stub (it looks that 4 chars would allow us to name meaningfully  all datasets we have so far dealt with
 V   : dataset version, i.e. 0,1,2 ...  (in principle, could be anything and be used for any purpose)
 S   : stage index: some datasets require a multi-stage generation, to make a clean and visible cut, can assign S=a to reconstruction
 O   : for stages with several output streams, just number them - 0,1,2,3,4...
 B   : beam intensity : 0 - signal only, 1 : 1 batch mode, 2 : 2 batch mode , the rest -reserved for future 
  
 examples:
|------------------------+---------------------------------------------------|
| 4 char dataset ID stub | comments, examples                                |
|------------------------+---------------------------------------------------|
| beam                   | beam, trace everything                            |
| bmum                   | bmum : beam, trace mu-, but not e+,e-, gamma      |
|                        | bmum1s10b0 : s1, 0=first output stream - into TS  |
|                        | bmum1s20b0 :                                      |
|                        | bmum1s30b0 :                                      |
| bpim                   | beam pi- tracing                                  |
| mdio                   | DIO electrons,                                    |
|                        | examples: mdio0s10b0: leading order               |
|                        |           mdio1s10b1: leading log,+ 1 batch       |
| mumi                   | mu-, example : mgun0sr1b0 : muon particle gun     |
| mupl                   | mu+                                               |
| elec                   | e- (i.e. single particle gun)                     |
| posi                   | e+                                                |
| pipl                   | pi+                                               |
| pimi                   | pi-                                               |
| pbar                   | pbars example: pbar0s01b0 (reconstructed)         |
| rpce                   | RPC external                                      |
| rpci                   | RPC internal                                      |
| rmce                   | RMC external                                      |
| rmci                   | RMC internal                                      |
| cosm                   | cosmics                                           |
| prot                   | ejected protons                                   |
| deut                   | ejected deutrons                                  |
| phot                   | ejected photons                                   |
|------------------------+---------------------------------------------------|

Dataset description on GitHub

Links