Changelog
This project adheres to Semantic Versioning and Keep a Changelog format.
[0.13.0] - unreleased
Added
--pipeline-kwargsarguemnt torefgenie buildcommand, which allows you to pass keyword arguments to the pypiper pipeline.
[0.12.1] - 2021-11-04
Fixed
- A bug with setuptools no longer allowing use_2to3.
[0.12.0] - 2021-06-28
Added
- MapReduce framework support in
refgenie buildthat supports automating asset builds at scale (--mapand--reduceoptions) - an option to automatically pull parent assets when building a derived asset (
--pull-parentsoption inrefgenie build)
Fixed
- misleading exit codes in
refgenie build; #258
[0.11.0] - 2021-04-27
Added
- remote commands, which work without genome config file:
refgenie seekrrefgenie populaterrefgenie listrrefgenie populatecommand for refgenie registry paths populating with local paths--flag-meaningsargument torefgenie comparecommand
[0.10.0] - 2021-03-11
Version 0.10.0 requires an upgrade to the configuration file and asset structure. Please refer to the upgrade tutorial for instructions on how to migrate your config upon upgrade to 0.10.0.
Changed
- instead of using human-readable names as genome identifiers refgenie uses sequence-derived digests in the config
- asset data moved to
datadirectory - asset files are now named after genome digests
- refgenieserver API v3 is now used for remote assets retrieval
- improved visual interface in
list,listrandpullsubcommands
Added
dataandaliasdirectories in genome directory that are used to store asset and aliases data, respectivelyrefgenie aliascommand for genome aliases managementrefgenie upgradecommand for config format upgradesrefgenie comparecommand for genome compatibility determination
[0.9.3] - 2020-07-29
Changed
- short option string for
--no-overwritefrom-nto-o
Added
- option to handle large asset archives pulling from the CLI (
-l/--no-largeflag) - option to set the maximum archive size to
pullwith no confirmation required (--size-cutoffargument) -s/--seek-keysargument torefgenie addto specify seek keys for added assets
Fixed
refgenie addissues -- added assets are no longer imported to thegenome_folder; #180
[0.9.2] - 2020-07-01
Changed
- in
refgenie buildreduced the config file locking time to prevent problems in multi-build context - dropped Python 2 support
Added
- parametrized
kmerin salmon recipes - support for all genome configuration file parameter values initialization in
refgenie init
[0.9.1] - 2020-05-01
Added
- added option (
-f/--force) to confirm assets overwriting upfront inrefgenie addaddrefgenie pull
Changed
- fixed bug in hisat2_index that pointed to the parent folder. The seek key now points to the folder/{genome}, as expected by the tool
- fixed bug in bwa_index that pointed to the parent folder. The seek key now points to the folder/{genome}.fa, as expected by the tool
[0.9.0] - 2020-03-17
Changed
- fixed a bug in bowtie2_index recipe that pointed to the parent folder. The seek key now points to the folder/{genome}, as expected by bowtie2
- in
refgenie seekfile existence check is not performed by default
Added
- possibility to execute library module as a script:
python -m refgenie ... - support for repeated recipe inputs on CLI (arguments:
--files,--assetsand--params) - a possibility to perform file existence check (
-e/--check-exists) inrefgenie seek
[0.8.2] - 2020-01-08
Fixed
SyntaxErrorin Python 2.7; #155
[0.8.1] - 2019-12-13
Fixed
salmon_partial_sa_indexrecipe
Changed
refgenie removeremoves the asset relatives linksrefgenie initusesinitialize_config_filemethod fromrefgenconf- default input assets for
salmon_sa_indexandsalmon_partial_sa_indexrecipes to transcriptomes within the namespace
Added
threadsparameter to the following recipes:dbnsfp,salmon_index,star_index
Removed
- documentation regarding
-r/--recipeoption inrefgenie build. It will be removed in the future
[0.8.0] - 2019-12-06
Changed
refgenie buildcommand arguments naming scheme:--{input_name} <path>to--files {input_name}=<path>-r/--requirementsinrefgenie buildcommand to-q/--requirements- recipe format: requirements (both assets and inputs) are lists of dicts rather that lists of strings
refgenie listdisplays current server subscriptions
Added
refgenie idcommand for asset digest retrieval- cross-namespace asset relationships support
--assetsargument inrefgenie buildcommand to provide parent assets, if required-r/--recipeinrefgenie buildcommand argument to provide the recipe for the buildsubscribeandunsubscribesubcommands to enable server list manipulation in the config file (genome_serversentry in the refgenie configuration file)- new recipes:
salmon_sa_indexsalmon_partial_sa_indexsuffixerator_indextallymer_index
Removed
-t/--taginrefgenie build. Use more flexible--assetsinstead.
[0.7.2] - 2019-11-06
Added
dbsnprecipe- distribute the license file with the package
[0.7.1] - 2019-10-29
Changed
--genome-servercan now be called multiple times to add additional refgenieserverslistrwill check each available refgenieserver and display assetspullwill check each available refgenieserver and take the first matching asset found
Added
- possibility to list multiple selected genomes in
refgenie list/listr -g
[0.7.0] - 2019-10-21
Added
import_igenomecommand line tool for iGenomes integration with Refgenie--genome/tag-descriptionarguments to therefgenie buildcommand-r/--requirementsargument to therefgenie buildcommand recipe requirements to display required inputs and required assets for a particular recipe- config manipulation support in multi-user contexts, it's racefree, uses file locks
dbNSFPasset recipe- assets tagging
refgenie tagcommand that assigns a tag to an assets (re-tags it)refgenie getseqcommand that retrieves sequence ranges from a genomeseek_keys, which provide control over files within an assetasset_digests, which are calculated after asset building and used to assure asset provenance- asset relationships recording (
asset_children,asset_parentsfields)
Changed
- assets can be referred to by registry paths:
genome/asset.seek_key:tag - config v0.3 is required
refgenie pullusesrefgenieserverAPI v2
[0.6.0] - 2019-08-05
Added
listandlistrsubcommand results can be restricted to a specific genome with-g/--genomeoptionsremovesubcommand will remove an asset from disk and config- Added recipes for new assets:
ensembl_gtfandfeat_annotation buildnow populates theasset_descriptionfield in the config with corresponding value from the recipe
Changed
- changed some asset locations;
tss_annotationis now namedrefgene_tssorensembl_tssand is built by therefgene_annoorensembl_gtfassets. Renamedgene_annotorefgene_anno.
[0.5.0] - 2019-07-11
Changed
refgenie builduses dict-like recipes for build instructions-
Major genome configuration file format changes
- Added
config_versionentry - Added
assetssection ingenomessection
- Added
-
recipes can now include container images
Added
genomescan have attributes, like description- Added recipes for new assets
salmon,bwa,star,gene_anno, andtss_annotation.
[0.4.4] - 2019-07-01
Added
addsubcommand
[0.4.3] - 2019-06-21
Changed
- Build process now builds individual assets
[0.4.2] - 2019-06-18
Added
seeksubcommand
Changed
- Require config file arg for
init.
Fixed
- Pick up env var for
initconfig.
[0.4.1] -- 2019-06-14
Fixed
- Use newer
yacmanand regaininitfunctionality.
[0.4.0] -- 2019-06-14
Added
- Added new commands
init,pull,listandlistr - Added connectivity option with remote data sources
[0.3.2] -- 2019-05-14
Fixed
- Fixed a bug with packaging
[0.3.0] -- 2019-05-10
Added
- Implemented installable CLI
- Packaged for release on PyPI
- Wrote comprehensive docs
Fixed
- Fixed naming of
.fq.gzfiles
[0.2.0] -- 2017-03-08
Added
- Transition release as a functional script
[0.1.0] -- 2016-11-11
- Project started