Home • Key Features • List of files • Dependencies • Installing • How To Use • Citation
Descriptors calculated by MathFeature for DNA, RNA, and Protein sequences.
| Descriptor groups | Descriptor | Dimension | Sequence | Example (Study with Application or Theory) |
|---|---|---|---|---|
| Binary | L * 4 | DNA/RNA | Ref 1 - Ref 2 | |
| Z-curve | L * 3 | DNA/RNA | Ref 1 - Ref 2 | |
| Real | L | DNA/RNA | Ref 1 - Ref 2 | |
| Numerical Mapping | Integer | L | DNA/RNA/Protein | Ref 1 - Ref 2 |
| EIIP | L | DNA/RNA/Protein | Ref 1 - Ref 2 | |
| Complex Number | L | DNA/RNA | Ref 1 - Ref 2 | |
| Atomic Number | L | DNA/RNA | Ref 1 - Ref 2 | |
| Chaos Game Representation | L * 2 | DNA/RNA | Ref 1 | |
| Chaos Game | Frequency Chaos Game Representation | L - k + 1 | DNA/RNA | |
| Chaos Game Signal (with Fourier) | 19 | DNA/RNA | Ref 1 | |
| Fourier Transform | Numerical Mapping + Fourier | 19 | DNA/RNA/Protein | Ref 1 |
| Entropy | Shannon | k | DNA/RNA/Protein | Ref 1 |
| Tsallis | k | DNA/RNA/Protein | Ref 1 | |
| Graphs | Complex Networks (with threshold) | 12 * t | DNA/RNA/Protein | Ref 1 - Ref 2 |
| Complex Networks (without threshold - v2) | 27 * k | DNA/RNA/Protein | Ref 1 - Ref 2 | |
| Basic k-mer | 4^k | DNA/RNA | Ref 1 | |
| Customizable k-mer | 4^k | DNA/RNA | ||
| Nucleic acid composition (NAC) | 4 | DNA/RNA | Ref 1 | |
| Di-nucleotide composition (DNC) | 16 | DNA/RNA | Ref 1 | |
| Tri-nucleotide composition (TNC) | 64 | DNA/RNA | Ref 1 | |
| ORF Features or Coding Features | 10 | DNA/RNA | Ref 1 - Ref 2 | |
| Fickett score | 2 | DNA/RNA | Ref 1 | |
| Pseudo K-tuple nucleotide composition | - | DNA/RNA | Ref 1 | |
| Other techniques | Accumulated Nucleotide Frequency-ANF | L | DNA/RNA/Protein | Ref 1 |
| ANF with Fourier | 19 | DNA/RNA/Protein | ||
| Xmer k-Spaced Ymer Composition Frequency (kGap) | 4^X * 4^Y or 20^X * 20^Y | DNA/RNA/Protein | Ref 1 - Ref 2 | |
| Amino acid composition (AAC) | 20 | Protein | Ref 1 | |
| Dipeptide composition (DPC) | 400 | Protein | Ref 1 | |
| Tripeptide composition (TPC) | 8000 | Protein | Ref 1 | |
| Basic k-mer | 20^k | Protein | ||
| Customizable k-mer | 20^k | Protein | ||
| Kmer Frequency Mapping | L - k + 1 | Protein | ||
| Kmer Frequency Mapping with Fourier | 19 | Protein |
To use any descriptor, see our documentation.
Note 1: L = length of the longest sequence.
Note 2: k = frequencies of k-mer.
Note 3: t = threshold: number of subgraphs.
Note 4: The reference column represents some studies that apply the descriptor (Similar approach). Other references are cited in our article.
