Difference between revisions of "Workflows"

From Mu2eWiki
Jump to navigation Jump to search
 
(40 intermediate revisions by 2 users not shown)
Line 2: Line 2:
  
 
== Workflow for jobs ==
 
== Workflow for jobs ==
* [[Grids|Grids]] - overview of the computing farms
+
* [[Grids|Grids]] - overview of the computing farms and job submission
*[[DataTransfer|Data transfer]] - how to move larger datasets, necessary for grid jobs
+
* [[POMS]] - grid campaign organization, submission and recovery tool
* [http://mu2e.fnal.gov/atwork/computing/grid_workflow/index.shtml Workflow] for running grid jobs (convert to wiki)
+
* [[HPC]] - high performance computing resources
 +
* [[Docker]] software containers for submitting to special farms
 +
* [[AnalysisWorkflow| analysis workflow]] - how to read existing datasets
 +
* [[MCProdWorkflow| production simulation workflow]] - how to run simulation, concatenate and upload the files
 +
* [[MARSWorkflow| MARS and G4Beamline workflow]] - these are not in the art framework
 +
* [[JobPlan|Job planning]] - considerations of project CPU, disk,  memory and file size
 +
* [[Prestage]] - make sure files have been retrieved off tape and are ready to use
 +
* [[SimulationFCL]] - an overview how MDC2020 fcl works
 +
* [[GenerateFcl]] - generate fcl for simulation jobs
 +
* [[JustInTimeFcl]] - a technique for simplifying fcl generation
 +
* [[SubmitJobs]] - submit, monitor and recover jobs in the workflow
 +
* [[Concatenate]] - merging root and art files
 +
* [[Upload]] - upload files to tape
 +
* [[ErrorRecovery]] - how to recover from common errors
 +
* [[Validation]] - check how code is performing, using MC
 +
* [[DQM]] - Data Quality Monitoring
 +
* [[Datasets]] - production datasets
 +
* [[RunNumbers]] - run number ranges and uses
 +
* [[ProductionLogic]] - production job logic
  
 
== Data Handling ==
 
== Data Handling ==
 +
* [[Dcache|dCache]] - the large aggregated data disk system
 +
* [[Enstore|enstore]] - the mass storage tape system
 +
* [[FileTransferService|File Transfer Service (FTS)]] - service to create SAM records and copy files to tape
 +
* [[RawDataMover|Raw Data Mover (RDM)]] - Mu2e service to copy teststand data files to tape
 +
* [[DataTransfer|Data transfer]] - how to move larger datasets, necessary for grid jobs
 
* [[FileNames|File names]] - how to name files for upload or for production
 
* [[FileNames|File names]] - how to name files for upload or for production
* [[FileFamilies|File families]] - logical grouping of data for upload to tape
+
* [[FileFamilies|File families]] - the logical grouping of data on tapes
* [[Dcache|dCache]] - the large aggregated disk system
 
* [[Enstore|enstore]] - the tape system
 
 
* [[SAM|SAM]] - file metadata and data handling management system
 
* [[SAM|SAM]] - file metadata and data handling management system
* [http://mu2e.fnal.gov/public/hep/computing/grid_workflow/ file upload] - run MC and write to tape
+
** [[SamMetadata|SAM metadata]] - the metadata stored for each file
* [[UploadFTS|FTS Upload]] (deprecated upload to tape)
+
** [[SamExpert|SAM expert]] - some notes for the future, not of general interest
 +
* [[FileTools|File tools]] - tools for manipulating large file datasets and metadata
 +
* [[UploadFTS|FTS Upload]] - deprecated method to upload by the FTS area
 +
* [[Rucio]] - next-gen data handling
  
 
==Operations==
 
==Operations==
* [http://mu2e.fnal.gov/atwork/computing/ops/ops.shtml operations links]
+
* [[OfflineOps|operations links]]
 
* [[Mu2epro|Mu2epro account]]
 
* [[Mu2epro|Mu2epro account]]
  
 
[[Category:Computing]]
 
[[Category:Computing]]
[[Category:Computing/Workflow]]
+
[[Category:Workflows]]

Latest revision as of 15:32, 18 April 2024


Workflow for jobs

Data Handling

  • dCache - the large aggregated data disk system
  • enstore - the mass storage tape system
  • File Transfer Service (FTS) - service to create SAM records and copy files to tape
  • Raw Data Mover (RDM) - Mu2e service to copy teststand data files to tape
  • Data transfer - how to move larger datasets, necessary for grid jobs
  • File names - how to name files for upload or for production
  • File families - the logical grouping of data on tapes
  • SAM - file metadata and data handling management system
    • SAM metadata - the metadata stored for each file
    • SAM expert - some notes for the future, not of general interest
  • File tools - tools for manipulating large file datasets and metadata
  • FTS Upload - deprecated method to upload by the FTS area
  • Rucio - next-gen data handling

Operations