bioin.motif.profile_probable_motifs

bioin.motif.profile_probable_motifs(motif_profile, dna)[source]

Find the profile-most probable k-mers in each string from dna. Without pseudocount.

Parameters:
  • motif_profile (dict) – a profile matrix, Profile.
  • dna (list) – a list of strings DNA.
Returns:

List, a list of the Profile-most probable k-mers in each string from DNA.

Examples

>>> motif_profile = {'A': [0.8, 0.0, 0.0, 0.2], 'C': [0.0, 0.6, 0.2, 0.0], 'G': [0.2, 0.2, 0.8, 0.0], 'T': [0.2, 0.2, 0.0, 0.8]}
>>> dna = ['TTACCTTAAC', 'GATGTCTGTC', 'ACGGCGTTAG', 'CCCTAACGAG', 'CGTCAGAGGT']
>>> kmers = profile_probable_motifs(motif_profile, dna)
>>> kmers
    ['ACCT', 'ATGT', 'GCGT', 'ACGA', 'AGGT']