Features used in machine learning to predict rice protein-protein interactions
Feature | Protein pair | Coverage(1) | Predicted protein interaction | Coverage(2) |
---|---|---|---|---|
Structural similarity | 121,410,081 | 14.72% | 486,238 | 68.60% |
Structural distance | 121,410,081 | 14.72% | 486,238 | 68.60% |
Preserved interface size | 97,755,074 | 11.85% | 441,450 | 62.28% |
Fraction of preserved interface | 97,755,074 | 11.85% | 441,450 | 62.28% |
Biological process ontology | 129,468,231 | 15.70% | 543,838 | 76.72% |
Molecular function ontology | 188,363,222 | 22.84% | 533,153 | 75.22% |
Cellular component ontology | 71,682,505 | 8.69% | 388,407 | 54.80% |
Gene coexpression | 797,861,431 | 96.75% | 703,595 | 99.26% |
Phylogenetic profile | 824,646,966 | 100.00% | 708,819 | 100.00% |
Interolog | 881,560 | 0.11% | 443,542 | 62.57% |
Rosetta stone | 1,503,600 | 0.18% | 3,627 | 0.51% |
Coverage(2): the number of predicted protein-protein interactions with an available given feature was divided by the total number of predicted interactions.