LoadBatch: Difference between revisions

From CoGepedia
Jump to navigation Jump to search
No edit summary
mNo edit summary
 
(17 intermediate revisions by the same user not shown)
Line 1: Line 1:
<div style="color:red">
LoadBatch provides the ability to conveinently load a set of genomes or experiments in a single operation.  To load a set of genomes or experiments using [[LoadGenome]] and [[LoadExperiment]] would require running the tool for each genome/experiment individually and is very time consuming for large data sets.  [[File:LoadBatch.png|thumb|400px]]
UNDER CONSTRUCTION
</div>


LoadBatch provides the ability to conveinently load a set of genomes or experiments in a single operation. To load a set of genomes using [[LoadGenome]] would require running the tool for each genome individually.
== Inputs ==
 
=== Metadata File ===


[[File:LoadBatch.png|thumb|400px]]
A single metadata file that describes the data files contained is required for the load.  See the metadata section: '''[[Metadata]]'''


== Inputs  ==
=== Data File(s)===


=== Metadata File ===
Data files can be given individually or together as a compressed tar archive file (ending in .tar.gz, also known as a "tarball").


See the metadata section: [[Metadata]]
'''Valid combinations of input files include:'''
* tarball of metadata file and data file(s)
* metadata file and tarball of data file(s)
* separate metadata file and data files


=== Data File(s) ===
<span style="color:red">''Note: tarballs must not contain subdirectories.''</span>


You can select and retrieve data file located at:  
'''The interface allows you to select and retrieve data files located at:'''


*The iPlant Data Store  
*The iPlant Data Store  
*An FTP server  
*An FTP server  
*Your computer (Upload)<br>
*Your computer (Upload)


=== Data Formats ===
=== Data Formats ===
For supported '''genome''' data file formats, see '''[[LoadGenome]]'''.
For supported '''experiment''' data file formats, see '''[[LoadExperiment]]'''.

Latest revision as of 18:58, 30 March 2015

LoadBatch provides the ability to conveinently load a set of genomes or experiments in a single operation. To load a set of genomes or experiments using LoadGenome and LoadExperiment would require running the tool for each genome/experiment individually and is very time consuming for large data sets.

Inputs

Metadata File

A single metadata file that describes the data files contained is required for the load. See the metadata section: Metadata

Data File(s)

Data files can be given individually or together as a compressed tar archive file (ending in .tar.gz, also known as a "tarball").

Valid combinations of input files include:

  • tarball of metadata file and data file(s)
  • metadata file and tarball of data file(s)
  • separate metadata file and data files

Note: tarballs must not contain subdirectories.

The interface allows you to select and retrieve data files located at:

  • The iPlant Data Store
  • An FTP server
  • Your computer (Upload)

Data Formats

For supported genome data file formats, see LoadGenome.

For supported experiment data file formats, see LoadExperiment.