|
|
|
|
Changelog for bcftools-1.19-15.28.x86_64.rpm :
* Wed Jan 03 2024 Stefan BrĂ¼ns - Update to version 1.19 Too many changes to list, for details see https://github.com/samtools/bcftools/releases/tag/1.19- Update to version 1.18 Too many changes to list, for details see https://github.com/samtools/bcftools/releases/tag/1.18- Update to version 1.17 Too many changes to list, for details see https://github.com/samtools/bcftools/releases/tag/1.17 * Fri Aug 26 2022 Jan Engelhardt - Update to release 1.16 * New plugin `bcftools +variant-distance` to annotate records with distance to the nearest variant. * The -i/-e filtering expression gained support for multiple filters, e.g. `-i FILTER=\"A;B\"`.- Resolve \"file packaged twice\" rpmlint warnings * Thu Apr 21 2022 Ferdinand Thiessen - Update to version 1.15.1 * bcftools annotate: New -H, --header-line convenience option to pass a header line on command line * bcftools csq: A list of consequence types supported by bcftools csq has been added to the manual page. * bcftools +fill-tags: * Extend generalized functions so that FORMAT tags can be filled as well * Allow multiple custom functions in a single run. * bcftools norm: * Fix an assertion failure triggered when a faulty VCF file with a \'-\' character in the REF allele was used with bcftools norm --atomize. * Fix the loss of phasing in half-missing genotypes in variant atomization * bcftools roh: Fix a bug that could result in an endless loop or incorrect AF estimate when missing genotypes are present and the --estimate-AF - option was used * bcftools +split-vep: VEP fields with characters disallowed in VCF tag names by the specification couldn\'t be queried.- Update to version 1.15 * New bcftools head subcommand for conveniently displaying the headers of a VCF or BCF file. * The -T, --targets-file option had the following bug originating in HTSlib code * bcftools annotate: * In addition to --rename-annots, which requires a file with name mappings, it is now possible to do the same on the command line -c NEW_TAG:=OLD_TAG * Add new option --min-overlap which allows to specify the minimum required overlap of intersecting regions * Allow to transfer ALT from VCF with or without replacement * bcftools convert: * Revamp of --gensample, --hapsample and --haplegendsample family of options * New --3N6 option to output/input the new version of the .gen file format * Deprecate the --chrom option in favor of --3N6. * The CHROM:POS_REF_ALT IDs which are used to detect strand swaps are required and must appear either in the \"SNP ID\" column or the \"rsID\" column. * bcftools csq: Allow GFF files with phase column unset * bcftools filter: New --mask, --mask-file and --mask-overlap options to soft filter variants in regions * bcftools +fixref * The -m id option now works also for non-dbSNP ids * New -m flip-all mode for flipping all sites * bcftools isec: Prevent segfault on sites filtered with -i/-e in all files * bcftools mpileup: More flexible read filtering using the options * bcftools query: Make the --samples and --samples-file options work also in the --list-samples mode. * bcftools +setGT: Fix a bug in -t q -e EXPR logic applied on FORMAT fields, sites with all samples failing the expression EXPR were incorrectly skipped. * bcftools sort: make use of the TMPDIR environment variable when defined * bcftools +trio-dnm2: The --use-NAIVE mode now also adds the de novo allele in FORMAT/VA- Update to version 1.14 * New --regions-overlap and --targets-overlap options which address a long-standing design problem with subsetting VCF files by region. * The --output-type option can be used to override the default compression level * bcftools annotate: * when --set-id and --remove are combined, --set-id cannot use tags deleted by --remove. * while non-symbolic variation are uniquely identified by POS,REF,ALT, symbolic alleles starting at the same position were indistinguishable. * add a new . modifier to control whether missing values should be carried over from a tab-delimited file or not. * bcftools +check-ploidy: by default missing genotypes are not used when determining ploidy. * bcftools concat: new --ligate-force and --ligate-warn options for finer control of -l, --ligate behavior in imperfect overlaps. * bcftools consensus: Apply mask even when the VCF has no notion about the chromosome. * bcftools +contrast: support for chunking within map/reduce framework allowing to collect NASSOC counts even for empty case/control sample sets * bcftools csq: * bug fix, compound indels were not recognised in some cases * compound variants were incorrectly marked as \'inframe\' even when stop codon would occur before the frame was restored * bug fix, FORMAT/BCSQ bitmasks could have been assigned incorrectly to some samples at multiallelic sites, a superset of the correct consequences would have been set * bug fix, the upstream stop could be falsely assigned to all samples in a multi-sample VCF even if the stop was relevant for a single sample only * further improve the detection of mismatching chromosome naming (e.g. \"chrX\" vs \"X\") in the GFF, VCF and fasta files * bcftools merge: keep (sum) INFO/AN,AC values when merging VCFs with no samples * bcftools mpileup: new --indel-size option which allows to increase the maximum considered indel size considered, large deletions in long read data are otherwise lost. * bcftools norm: * atomization now supports Number=A,R string annotations * assign as many alternate alleles to genotypes at multiallelic sites in the-m + mode, disregarding the phase. * bcftools sort: increase accuracy of the --max-mem option limit, previously the limit could be exceeded by more than 20% * bcftools +trio-dnm: new --with-pAD option to allow processing of VCFs without FORMAT/QS. * bcftools view: the functionality of the option --compression-level lost in 1.12 has been restored- Update to version 1.13 * bcftools annotate: * Fix rare a bug when INFO/END is present, all INFO fields are removed with bcftools annotate -x INFO and BCF output is produced. * Support for matching annotation line by ID, in addition to CHROM,POS,REF, and ALT * bcftools csq: * When GFF and VCF/fasta use a different chromosome naming convention no consequences would be added. * Parametrize brief-predictions parameter to allow explicit number of amino acids to be printed. * bcftools +fill-tags: * Generalization and better support for custom functions that allow adding new INFO tags based on arbitrary -i, --include type of expressions. * When FORMAT/GT is not present, the INFO/AF tag will be newly calculated from INFO/AC and INFO/AN. * bcftools gtcheck: * Switch between FORMAT/GT or FORMAT/PL when one is (implicitly) requested but only the other is available * Improve diagnostics, printing warnings when a line cannot be matched and the number of lines skipped for various reasons * bcftools index: The program now accepts both data file name and the index file name. * bcftools isec: Always generate sites.txt with isec -p * bcftools +mendelian: Consider only complete trios, do not crash on sample name typos * bcftools mpileup: * New --seed option for reproducibility of subsampling code in HTSlib * The SCR annotation which shows the number of soft-clipped reads now correctly pools reads together regardless of the variant type. * Major revamp of BAQ. * Modified scale of Mann-Whitney U tests. Newly INFO/ *Z annotations will be printed * bcftools norm: * Fix Type=Flag output in norm --atomize * Atomization must not discard ALT=. records * Atomization of AD and QS tags now correctly updates occurrences of duplicate alleles within different haplotypes * Fix a bug in atomization of Number=A,R tags * bcftools reheader: Add -T, --temp-prefix option * bcftools +setGT: A wider range of genotypes can be set by the plugin by allowing specifying custom genotypes. * bcftools +split-vep: * New -u, --allow-undef-tags option * Better handling of ambiguous keys such as INFO/AF and CSQ/AD. * Some consequence field names may not constitute a valid tag name, such as \"pos(1-based)\". * bcftools +tag2tag: New --QR-QA-to-QS option to convert annotations generated by Freebayes to QS used by BCFtools * bcftools +trio-dnm: * Add support for sites with more than four alleles. * New --use-NAIVE option for a naive DNM calling based solely on FORMAT/GT and expected Mendelian inheritance. * Fix behaviour to match the documentation, the --dnm-tag DNG option now correctly outputs log scaled values by default, not phred scaled. * Fix bug in VAF calculation, homozygous de novo variants were incorrectly reported as having VAF=50% * Fix arithmetic underflow which could lead to imprecise scores and improve sensitivity in high coverage regions * Allow combining --pn and --pns to set the noise thresholds independently- Rebased use_python3.patch- Drop python3 and perl build requirements, not needed, shbang of executables can be patched anyway. * Fri May 14 2021 Ferdinand Thiessen - Update to version 1.12 * The output file type is determined from the output file name suffix, where available, so the -O/--output-type option is often no longer necessary. * Make F_MISSING in filtering expressions work for sites with multiple ALT alleles * Fix N_PASS and F_PASS to behave according to expectation when reverse logic is used (#1397). This fix has the side effect of query (or programs like +trio-stats) behaving differently with these expressions, operating now in site-oriented rather than sample-oriented mode. * bcftools annotate: * New --rename-annots option to help fix broken VCFs * New -C option allows to read a long list of options from a file to prevent very long command lines. * New append-missing logic allows annotations to be added for each ALT allele in the same order as they appear in the VCF. * bcftools concat: * Do not phase genotypes by mistake if they are not already phased with -l * bcftools consensus: * New --mask-with, --mark-del, --mark-ins, --mark-snv options * Symbolic should have only one REF base. If there are multiple, take POS+1 as the first deleted base. * Make consensus work when the first base of the reference genome is deleted. * bcftools +contrast: * The NOVELGT annotation was previously not added when requested. * bcftools convert: * Make the --hapsample and --hapsample2vcf options consistent with each other and with the documentation. * bcftools call: * Revamp of call -G, previously sample grouping by population was not truly independent and could still be influenced by the presence of other sample groups. * Optional addition of INFO/PV4 annotation with call -a INFO/PV4 * Remove generation of useless HOB and ICB annotation; use +fill-tags -- -t HWE,ExcHet instead * The call -f option was renamed to -a to (1) make it consistent with mpileup and (2) to indicate that it includes both INFO and FORMAT annotations * bcftools csq: * Fix a bug wich caused incorrect FORMAT/BCSQ formatting at sites with too many per-sample consequences * Fix a bug which incorrectly handled the --ncsq parameter and could clash with reserved BCF values, consequently producing truncated or even incorrect output of the %TBCSQ formatting expression in bcftools query. * bcftools +fill-tags: * MAF definition revised for multiallelic sites, the second most common allele is considered to be the minor allele * New FORMAT/VAF, VAF1 annotations to set the fraction of alternate reads provided FORMAT/AD is present * bcftools gtcheck: * support matching of a single sample against all other samples in the file with -s qry:sample -s gt:-. * bcftools merge: * Make merge -R behavior consistent with other commands and pull in overlapping records with POS outside of the regions * Bug fix * bcftools mpileup: * Add new optional tag mpileup -a FORMAT/QS * bcftools norm: * New -a, --atomize functionality to decompose complex variants, for example MNVs into consecutive SNVs * New option --old-rec-tag to indicate the original variant * bcftools query: * Incorrect fields were printed in the per-sample output when subset of samples was requested via -s/-S and the order of samples in the header was different from the requested -s/-S order * bcftools +prune: * New options --random-seed and --nsites-per-win-mode * bcftools +split-vep: * Transcript selection now works also on the raw CSQ/BCSQ annotation. * Bug fix, samples were dropped on VCF input and VCF/BCF output * bcftools stats: * Changes to QUAL and ts/tv plotting stats: avoid capping QUAL to predefined bins, use an open-range logarithmic binning instead * plot dual ts/tv stats: per quality bin and cumulative as if threshold applied on the whole dataset * bcftools +trio-dnm2: * Major revamp of +trio-dnm plugin, which is now deprecated and replaced by +trio-dnm2. * The original trio-dnm calling model used genotype likelihoods (PLs) as the input for calling. * This new version also implements the DeNovoGear model. * For more details see http://samtools.github.io/bcftools/trio-dnm.pdf- Update use_python3.patch * Thu May 13 2021 Ferdinand Thiessen - Update to version 1.11 * Breaking change in -i/-e expressions on the FILTER column. The new behaviour is: Expression Result FILTER=\"A\" Exact match, for example \"A;B\" does not pass FILTER!=\"A\" Exact match, for example \"A;B\" does pass FILTER~\"A\" Both \"A\" and \"A;B\" pass FILTER!~\"A\" Neither \"A\" nor \"A;B\" pass * Fix in commutative comparison operators, in some cases reversing sides would produce incorrect results * Better support for filtering on sample subsests * bcftools annotate: * Previously it was not possible to use --columns =TAG with INFO tags and the --merge-logic feature was restricted to tab files with BEG,END columns, now extended to work also with REF,ALT. * Make annotate -TAG/+TAG work also with FORMAT fields. * ID and FILTER can be transferred to INFO and ID can be populated from INFO. * bcftools consensus: * Fix in handling symbolic deletions and overlapping variants. * Fix --iupac-codes crash on REF-only positions with ALT=\".\". * Fix --chain crash * Preserve the case of the genome reference. * Add new -a, --absent option which allows to set positions with no supporting evidence to \"N\" (or any other character). * bcftools convert: * The option --vcf-ids now works also with -haplegendsample2vcf. * New option --keep-duplicates * bcftools csq: * Add misc/gff2gff.py script for conversion between various flavors of GFF files. The initial commit supports only one type * Add missing consequence types. * Allow overlapping CDS to support ribosomal slippage. * bcftools +fill-tags: * Added new annotations: INFO/END, TYPE, F_MISSING. * bcftools filter: * Make --SnpGap optionally filter also SNPs close to other variant types. * bcftools gtcheck: * Complete revamp of the command. The new version is faster and allows N:M sample comparisons, not just 1:N or NxN comparisons. Some functionality was lost (plotting and clustering) but may be added back on popular demand. * bcftools +mendelian: * Revamp of user options, output VCFs with mendelian errors annotation, read PED files * bcftools merge: * Update headers when appropriate with the \'--info-rules *:join\' INFO rule. * Local alleles merging that produce LAA and LPL when requested, a draft implementation of samtools/hts-specs#434 * New --no-index which allows to merge unindexed files. * Fixes in gVCF merging. * bcftools norm: * Fixes in --check-ref s reference setting features with non-ACGT bases. * New --keep-sum switch to keep vector sum constant when splitting multiallelics. * bcftools +prune: * Extend to allow annotating with various LD metrics: r^2, Lewontin\'s D\' * bcftools query: * New %N_PASS() formatting expression to output the number of samples that pass the filtering expression. * bcftools reheader: * Improved error reporting to prevent user mistakes. * bcftools roh: * The --AF-file description incorrectly suggested \"REF\\tALT\" instead of the correct \"REF,ALT\". * RG lines could have negative length. * new --include-noalt option to allow also ALT=. records. * bcftools scatter: * New plugin intended as a convenient inverse to concat * bcftools +split: * New --groups-file option for more flexibility of defining desired output * New --hts-opts option to reduce required memory by reusing one output header and allow overriding the default hFile\'s block size * Add support for multisample output and sample renaming * bcftools +split-vep: * Add default types (Integer, Float, String) for VEP subfields and make --columns - extract all subfields into INFO tags in one go. * Tue Feb 25 2020 Pierre Bonamy - Changed python dependencies from python3 to python3-base and python3-matplotlib * Wed Feb 12 2020 Todd R - Add use_python3.patch to switch from python2 to python3 * Wed Feb 05 2020 Todd R - Update to 1.10.2 * This release fixes crashes reported on files including integer INFO tags with values outside the range officially supported by VCF. It also fixes a bug where invalid BCF files would be created if such values were present.- Update to 1.10.0 + Numerous bug fixes, usability improvements and sanity checks were added to prevent common user errors. + The -r, --regions (and -R, --regions-file) option should never create unsorted VCFs or duplicates records again. This also fixes rare cases where a spanning deletion makes a subsequent record invisible to bcftools isec and other commands. + Additions to filtering and formatting expressions * support for the spanning deletion alternate allele (ALT= *) * new ILEN filtering expression to be able to filter by indel length * new MEAN, MEDIAN, MODE, STDEV, phred filtering functions * new formatting expression %PBINOM (phred-scaled binomial probability), %INFO (the whole INFO column), %FORMAT (the whole FORMAT column), %END (end position of the REF allele), %END0 (0-based end position of the REF allele), %MASK (with multiple files indicates the presence of the site in other files) + New plugins * +gvcfz: compress gVCF file by resizing gVCF blocks according to specified criteria * +indel-stats: collect various indel-specific statistics * +parental-origin: determine parental origin of a CNV region * +remove-overlaps: remove overlapping variants. * +split-vep: query structured annotations such INFO/CSQ created by bcftools/csq or VEP * +trio-dnm: screen variants for possible de-novo mutations in trios + annotate * new -l, --merge-logic option for combining multiple overlapping regions + call * new bcftools call -G, --group-samples option which allows grouping samples into populations and applying the HWE assumption within but not across the groups. + csq * significant reduction of memory usage in the local -l mode for VCFs with thousands of samples and 20% reduction in the non-local haplotype-aware mode. * fixes a small memory leak and formatting issue in FORMAT/BCSQ at sites with many consequences * do not print protein sequence of start_lost events * support for \"start_retained\" consequence * support for symbolic insertions (ALT=\"\"), \"feature_elongation\" consequence * new -b, --brief-predictions option to output abbreviated protein predictions. + concat * the --naive command now checks header compatibility when concatenating multiple files. + consensus * add a new -H, --haplotype 1pIu/2pIu feature to output first/second allele for phased genotypes and the IUPAC code for unphased genotypes * new -p, --prefix option to add a prefix to sequence names on output + +contrast * added support for Fisher\'s test probability and other annotations + +fill-from-fasta * new -N, --replace-non-ACGTN option + +dosage * fix some serious bugs in dosage calculation + +fill-tags * extended to perform simple on-the-fly calculations such as calculating INFO/DP from FORMAT/DP. + merge * add support for merging FORMAT strings * bug fixed in gVCF merging + mpileup * a new optional SCR annotation for the number of soft-clipped reads + reheader * new -f, --fai option for updating contig lines in the VCF header + +trio-stats * extend output to include DNM homs and recurrent DNMs + VariantKey support * Thu Sep 06 2018 flyosAATTmailoo.org- Update to 1.9 * `annotate` - REF and ALT columns can be now transferred from the annotation file. - fixed bug when setting vector_end values. * `consensus` - new -M option to control output at missing genotypes - variants immediately following insersions should not be skipped. Note however, that the current fix requires normalized VCF and may still falsely skip variants adjacent to multiallelic indels. - bug fixed in -H selection handling * `convert` - the --tsv2vcf option now makes the missing genotypes diploid, \"./.\" instead of \".\" - the behavior of -i/-e with --gvcf2vcf changed. Previously only sites with FILTER set to \"PASS\" or \".\" were expanded and the -i/-e options dropped sites completely. The new behavior is to let the -i/-e options control which records will be expanded. In order to drop records completely, one can stream through \"bcftools view\" first. * `csq` - since the real consequence of start/splice events are not known, the aminoacid positions at subsequent variants should stay unchanged - add `--force` option to skip malformatted transcripts in GFFs with out-of-phase CDS exons. * `+dosage`: output all alleles and all their dosages at multiallelic sites * `+fixref`: fix serious bug in -m top conversion * `-i/-e` filtering expressions: - add two-tailed binomial test - add functions N_PASS() and F_PASS() - add support for lists of samples in filtering expressions, with many samples it was impractical to list them all on the command line. Samples can be now in a file as, e.g., GT[AATTsamples.txt]=\"het\" - allow multiple perl functions in the expressions and some bug fixes - fix a parsing problem, \'AATT\' was not removed from \'AATTfilename\' expressions * `mpileup`: fixed bug where, if samples were renamed using the `-G` (`--read-groups`) option, some samples could be omitted from the output file. * `norm`: update INFO/END when normalizing indels * `+split`: new -S option to subset samples and to use custom file names instead of the defaults * `+smpl-stats`: new plugin * `+trio-stats`: new plugin * Fixed build problems with non-functional configure script produced on some platforms * Thu Jul 12 2018 flyosAATTmailoo.org- Cleaned spec file using spec-cleaner- Update to 1.8 * `-i, -e` filtering: Support for custom perl scripts * `+contrast`: New plugin to annotate genotype differences between groups of samples * `+fixploidy`: New options for simpler ploidy usage * `+setGT`: Target genotypes can be set to phased by giving `--new-gt p` * `run-roh.pl`: Allow to pass options directly to `bcftools roh` * Number of bug fixes * `-i, -e` filtering: Major revamp, improved filtering by FORMAT fields and missing values. New GT=ref,alt,mis etc keywords, check the documenation for details. * `query`: Only matching expression are printed when both the -f and -i/-e expressions contain genotype fields. Note that this changes the original behavior. Previously all samples were output when one matching sample was found. This functionality can be achieved by pre-filtering with view and then streaming to query. Compare bcftools query -f\'[%CHROM:%POS %SAMPLE %GT\ ]\' -i\'GT=\"alt\"\' file.bcf and bcftools view -i\'GT=\"alt\"\' file.bcf -Ou | bcftools query -f\'[%CHROM:%POS %SAMPLE %GT\ ]\' * `annotate`: New -k, --keep-sites option * `consensus`: Fix --iupac-codes output * `csq`: Homs always considered phased and other fixes * `norm`: Make `-c none` work and remove `query -c` * `roh`: Fix errors in the RG output * `stats`: Allow IUPAC ambiguity codes in the reference file; report the number of missing genotypes * `+fill-tags`: Add ExcHet annotation * `+setGt`: Fix bug in binom.test calculation, previously it worked only for nAlt* `+split`: New plugin to split a multi-sample file into single-sample files in one go * Improve python3 compatibility in plotting scripts * New `sort` command. * New options added to the `consensus` command. Note that the `-i, --iupac` option has been renamed to `-I, --iupac`, in favor of the standard `-i, --include`. * Filtering expressions (`-i/-e`): support for `GT=` expressions and for lists and ranges (#639) - see the man page for details. * `csq`: relax some GFF3 parsing restrictions to enable using Ensembl GFF3 files for plants (#667) * `stats`: add further documentation to output stats files (#316) and include haploid counts in per-sample output (#671). * `plot-vcfstats`: further fixes for Python3 (AATTnsoranzo, #645, #666). * `query` bugfix (#632) * `+setGT` plugin: new option to set genotypes based on a two-tailed binomial distribution test. Also, allow combining `-i/-e` with `-t q`. * `mpileup`: fix typo (#636) * `convert --gvcf2vcf` bugfix (#641) * `+mendelian`: recognize some mendelian inconsistencies that were being missed (AATToronnavon, #660), also add support for multiallelic sites and sex chromosomes.
|
|
|