LoadBatch: Difference between revisions

From CoGepedia
Jump to navigation Jump to search
Line 13: Line 13:
Data files can be given individually or together as a compressed tar archive file (ending in .tar.gz, also known as a "tarball").
Data files can be given individually or together as a compressed tar archive file (ending in .tar.gz, also known as a "tarball").


Valid combinations of input files include:
'''Valid combinations of input files include:'''
* tarball of metadata file and data file(s)
* tarball of metadata file and data file(s)
* metadata file and tarball of data file(s)
* metadata file and tarball of data file(s)
Line 20: Line 20:
''Note:  tarballs must not contain subdirectories.''
''Note:  tarballs must not contain subdirectories.''


You can select and retrieve data files located at:  
'''You can select and retrieve data files located at:'''


*The iPlant Data Store  
*The iPlant Data Store  

Revision as of 18:03, 3 October 2014

LoadBatch provides the ability to conveinently load a set of genomes or experiments in a single operation. To load a set of genomes or experiments using LoadGenome and LoadExperiment would require running the tool for each genome/experiment individually and is very time consuming for large data sets.

Inputs

Metadata File

A single metadata file that describes the data files contained is required for the load. See the metadata section: Metadata

Data File(s)

Data files can be given individually or together as a compressed tar archive file (ending in .tar.gz, also known as a "tarball").

Valid combinations of input files include:

  • tarball of metadata file and data file(s)
  • metadata file and tarball of data file(s)
  • separate metadata file and data files

Note: tarballs must not contain subdirectories.

You can select and retrieve data files located at:

  • The iPlant Data Store
  • An FTP server
  • Your computer (Upload)

Data Formats

For supported genome data file formats, see LoadGenome.

For supported experiment data file formats, see LoadExperiment.