[ -l NUM ], [ --lower-length-bound NUM ]
Specifies the minimum length of a TFO, TTS or triplex (TTS-TFO pair)
[ -u NUM ], [ --upper-length-bound NUM ]
Specifies the maximum length of a TFO, TTS or triplex (TTS-TFO pair),
-1 = unrestricted (default -1)
[ -m ], [ --triplex-motifs MOTIF1,MOTIF2,... ]
Specifies the motifs from the canonical triplex-formation rules to be
used when searching for TFOs in the third strand:
- R - the purine motif that permit guanines (G) and adenines (A)
- Y - the pyrimidine motif that permit cytosines (C) and thymines (T)
- M - the mixed motif, purine-pyrimdine, that permit guanines (G) and
thymines (T)
- P - parallel binding, i.e. motifs facilitating Hoogsten bonds;
this covers the pyrimidine motif and the purine-pyrimidine motif
in parallel configuration
- A - anti-parallel binding, i.e. motifs facilitating reverse Hoogsten bonds;
this covers the purine motif and the purine-pyrimidine motif
in anti-parallel configuration
By default all motifs are used.
[ -mpmg NUM ], [ --mixed-parallel-max-guanine NUM ]
Specifies the maximum guanine proportion (in %) in a mixed-motif triplexes
(GT) to consider this feature for parallel binding (Hoogsteen bonds)
(default 100).
As GT-TFOs can bind in either orientation this parameter can be used
to specify at which guanine content a GT-TFO should not be able to
bind in parallel orientation
(because anti-parallel binding will always dominate due to high G content).
Works in conjunction with --mixed-antiparallel-min-guanine but
parameters are keep separate as there may be a smooth transition
between the binding modes.
[ -mamg NUM ], [ --mixed-antiparallel-min-guanine NUM ]
Specifies the minimum guanine proportion (in %) in a mixed-motif triplexes
(GT) to consider this feature for anti-parallel binding (reverse Hoogsteen
bonds) (default 0).
As GT-TFOs can bind in either orientation this parameter can be used
to specify at which guanine content a GT-TFO should not be able to
bind in anti-parallel orientation.
(because parallel binding will always dominate due to the low G content).
Works in conjunction with --mixed-parallel-max-guanine but
parameters are keep separate as there may be a smooth transition
between the binding modes.
[ -e NUM ], [ --error-rate NUM ]
Set the maximal error-rate in % tolerated (default 5).
Triplexator searches for matches with an error-rate percent
of at most NUM. A match of a feature R with E errors has
error-rate of 100*(E/|R|), whereby |R| is the feature length.
In other words, a feature is allowed to have not more than
|R|*ceil(NUM)/100 errors.
[ -E NUM ], [ --maximal-error NUM ]
Set the maximal overall error tolerated, disable with -1 (default -1).
The maximal overall error is a hard threshold that can be used in
conjunction with the error-rate (see above).
For example, in a scenario using an error-rate of 10% and a
maximal error of 3, the error-rate will be the limiting factor up to
features of length 30, after which the maximal error takes over.
[ -c NUM ], [ --consecutive-errors NUM ]
Sets the tolerated number of consecutive errors with respect to the
canonical triplex rules as such were found to greatly destabilize
triplexes in vitro. The maximum permitted number is 3.
[ -g NUM ], [ --min-guanine NUM ]
Set the minimum guanine conten in triplex features. NUM must be a value
between 0 and 100 (default is 10).
The minimum guanine rate controls the ratio of guanines required in the
any triplex target site. For triplex-forming oligonucleotide this
constraint will be applied to their respective target.
[ -G NUM ], [ --max-guanine NUM ]
Set the maximum guanine conten in triplex features. NUM must be a value
between 0 and 100 (default is 100).
The maximum guanine rate controls the ratio of guanines required in the
any triplex target site. For triplex-forming oligonucleotide this
constraint will be applied to their respective target.
[ -b NUM ], [ --minimum-block-run NUM ]
Sets the number of consecutive matches required in a feature
discarding any feature that violates this constrait (default 1).
The rational behind this parameter is that a seed of consecutive matching
positions of a given length is required to initiate triplex formation.
Given the observation that central errors are more disruptive than
errors at the flanks of a triplex, this parameter will be especially
effective for short features.
Example:
feature valid -b discarded with -b
AGGAGAGtGAGAAAGA <= 8 >= 9
AGGAGAGGAGAAAtGA <= 13 >= 14
[ -a ], [--all-matches ]
Flag indicates that all qualifying sub-matches should be processed and
reported in addition to the longest match.
Careful! This can result in hugh output files when searching for
TFO-TTS pairs (i.e. providing single-stranded and doubles-stranded input).
[ -mf NUM], [ --merge-features NUM ]
merge overlapping features into a cluster and report the spanning region
Only supported for TFO and TTS detection, respectively.
For TFO-TTS pairs (triplexes) features are merged in the TFO and TTS
detection phase on default.
Any merge is performed before duplicate detection (-dd).
[ -dd NUM], [ --detect-duplicates NUM ]
Indiates whether and how duplicates should be detected (default 0).
Choices are:
- 0 = off do not detect any duplicates
- 1 = permissive detect duplicates in feature space,
e.g. AGGGAcGAGGA != AGGGAtGAGGA
- 2 = strict detect duplicates in target space,
e.g. AGGGAcGAGGA == AGGGAtGAGGA == AGGGAYGAGGA
Detection of duplicates requires all input sequence to be present in
memory at the same time, which will increase memory consumption
particularly when whole genomes are under investigation.
It is further advised to enable filtering of repeat and low complexity
regions to minimize the workload during duplicate detection.
[ -ssd [on|off] ], [--same-sequence-duplicates [on|off] ]
Whether to count a feature copy in the same sequence as duplicates
or not. (default off)
[ -v ], [ --verbose ]
Verbose. Print extra information and running times.
[ -vv ], [ --vverbose ]
Very verbose. Like -v, but also print filtering statistics like true and
false positives (TP/FP).
[ -V ], [ --version ]
Print version information.