greatpy.tl.len_regdom

greatpy.tl.len_regdom(regdom)

Calculate for each gene name the size of the regulatory region in the genome

Parameters:
regdom : pd.DataFrame

df of the regulatory domains => columns: [“chr” “chr_start” “chr_end” “name” “tss” “strand”].

Returns:

dictionary in which each key corresponds to a gene name from regdom and the value is the size of the regulatory region for that gene

Return type:

dict

Examples

>>> regdom = pd.DataFrame(
    {
        "chr":["chr1","chr1"],
        "chr_start":[1034992,1079306],
        "chr_end": [1115089,1132016],
        "name":["RNF223","C1orf159"],
        "tss":[1074306,1116089],
        "strand":['-','-']
        })
>>> len_regdom(regdom)
...    {'RNF223': 80097, 'C1orf159': 52710}