pybedtools.contrib.venn_maker.cleaned_intersect¶
- pybedtools.contrib.venn_maker.cleaned_intersect(items)[source]¶
Perform interval intersections such that the end products have identical features for overlapping intervals.
The VennDiagram package does set intersection, not interval intersection. So the goal here is to represent intersecting intervals as intersecting sets of strings.
Doing a simple BEDTools intersectBed call doesn’t do the trick (even with the -u argument). As a concrete example, what would the string be for an intersection of the feature “chr1:1-100” in file
xand “chr1:50-200” in filey?The method used here is to substitute the intervals in
ythat overlapxwith the corresponding elements inx. This means that in the resulting sets, the overlapping features are identical. To follow up with the example, bothxandywould have an item “chr1:50-200” in their sets, simply indicating that one interval overlapped.Venn diagrams are not well suited for nested overlaps or multi-overlaps. To illustrate, try drawing the 2-way Venn diagram of the following two files. Specifically, what number goes in the middle – the number of features in
xthat intersecty(1) or the number of features inythat intersectx(2)?:x: chr1 1 100 chr1 500 6000 y: chr1 50 100 chr1 80 200 chr9 777 888
In this case, this function will return the following sets:
x: chr1:1-100 chr1:500-6000 y: chr1:1-100 chr9:777-888
This means that while
xdoes not change in length,ycan. For example, if there are 2 features inxthat overlap one feature iny, thenywill gain those two features in place of its single original feature.This strategy is extended for multiple intersections – see the source for details.