greatpy.tl.number_of_hits

greatpy.tl.number_of_hits(test, regdom)

Calculate the number of hits from several genomic regions and the file describing the regulatory regions

Parameters:
test : pd.DataFrame

df of the tests pics => columns: [“chr”,”chr_start”,”chr_end”]

regdom : pd.DataFrame

df of the regulatory domains => columns: [“chr” “chr_start” “chr_end” “name” “tss” “strand”].

Returns:

nb – number of hit

Return type:

int

Examples

>>> test = pd.DataFrame(
    {
        "chr":["chr1"],
        "chr_start":[1052028],
        "chr_end": [1052049]}
    )
>>> regdom = pd.DataFrame(
    {
        "chr":["chr1","chr1"],
        "chr_start":[1034992,1079306],
        "chr_end": [1115089,1132016],
        "name":["RNF223","C1orf159"],
        "tss":[1074306,1116089],
        "strand":['-','-']
    })
>>> number_of_hits(test,regdom)
...    1