Genbank Class
Genbank
Genbank(
gbk_source: str | Path | TextIOWrapper,
name: str | None = None,
reverse: bool = False,
min_range: int | None = None,
max_range: int | None = None,
)
Genbank Parser Class
PARAMETER | DESCRIPTION |
---|---|
gbk_source |
Genbank file or source
(
TYPE:
|
name |
name (If None,
TYPE:
|
reverse |
If True, reverse complement genome is used
TYPE:
|
min_range |
Min range to be extracted (Default:
TYPE:
|
max_range |
Max range to be extracted (Default:
TYPE:
|
calc_gc_skew
calc_gc_skew(
window_size: int | None = None,
step_size: int | None = None,
*,
seq: str | None = None
) -> tuple[np.ndarray, np.ndarray]
Calculate GC skew in sliding window
PARAMETER | DESCRIPTION |
---|---|
window_size |
Window size (Default:
TYPE:
|
step_size |
Step size (Default:
TYPE:
|
seq |
Sequence for GCskew calculation (Default:
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
gc_skew_result_tuple
|
Position list & GC skew list
TYPE:
|
calc_gc_content
calc_gc_content(
window_size: int | None = None,
step_size: int | None = None,
*,
seq: str | None = None
) -> tuple[np.ndarray, np.ndarray]
Calculate GC content in sliding window
PARAMETER | DESCRIPTION |
---|---|
window_size |
Window size (Default:
TYPE:
|
step_size |
Step size (Default:
TYPE:
|
seq |
Sequence for GCskew calculation (Default:
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
gc_content_result_tuple
|
Position list & GC content list
TYPE:
|
get_seqid2seq
Get seqid & complete/contig/scaffold genome sequence dict
RETURNS | DESCRIPTION |
---|---|
seqid2seq
|
seqid & genome sequence dict
TYPE:
|
get_seqid2size
Get seqid & complete/contig/scaffold genome size dict
RETURNS | DESCRIPTION |
---|---|
seqid2size
|
seqid & genome size dict
TYPE:
|
get_seqid2features
get_seqid2features(
feature_type: str | None = "CDS",
target_strand: int | None = None,
pseudogene: bool | None = False,
) -> dict[str, list[SeqFeature]]
Get seqid & features in target seqid genome dict
PARAMETER | DESCRIPTION |
---|---|
feature_type |
Feature type (
TYPE:
|
target_strand |
Extract target strand. If None, extract regardless of strand.
TYPE:
|
pseudogene |
If True,
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
seqid2features
|
seqid & features dict
TYPE:
|
extract_features
extract_features(
feature_type: str = "CDS",
target_strand: int | None = None,
fix_position: bool = False,
allow_partial: bool = False,
pseudogene: bool = False,
) -> list[SeqFeature]
Extract features within min-max range
PARAMETER | DESCRIPTION |
---|---|
feature_type |
Extract feature type
TYPE:
|
target_strand |
Extract target strand
TYPE:
|
fix_position |
If True, fix feature start & end position by specified min_range parameter (fixed_start = start - min_range, fixed_end = end - min_range)
TYPE:
|
allow_partial |
If True, allow extraction of features that are partially included in range
TYPE:
|
pseudogene |
If True and
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
features
|
Extracted features
TYPE:
|
write_cds_fasta
write_cds_fasta(
fasta_outfile: str | Path,
seqtype: str = "protein",
fix_position: bool = False,
allow_partial: bool = False,
)
Write CDS protein features fasta file
PARAMETER | DESCRIPTION |
---|---|
fasta_outfile |
CDS fasta file
TYPE:
|
seqtype |
Sequence type (
TYPE:
|
fix_position |
If True, fix feature start & end position by specified min_range parameter (fixed_start = start - min_range, fixed_end = end - min_range)
TYPE:
|
allow_partial |
If True, features that are partially included in range are also extracted
TYPE:
|