Features used in machine learning to predict Arabidopsis protein-protein interactions
| Feature | Protein pair | Coverage(1) | Predicted protein interaction | Coverage(2) |
|---|---|---|---|---|
| Structural similarity | 39,439,520 | 11.48% | 116,077 | 36.65% |
| Structural distance | 39,439,520 | 11.48% | 116,077 | 36.65% |
| Preserved interface size | 31,625,234 | 9.21% | 100,964 | 31.88% |
| Fraction of preserved interface | 31,625,234 | 9.21% | 100,964 | 31.88% |
| Biological process ontology | 210,586,502 | 61.32% | 299,301 | 94.49% |
| Molecular function ontology | 211,428,752 | 61.57% | 294,459 | 92.96% |
| Cellular component ontology | 39,867,985 | 11.61% | 182,806 | 57.71% |
| Gene coexpression | 204,353,436 | 59.51% | 266,063 | 84.00% |
| Phylogenetic profile | 343,416,528 | 100.00% | 316,747 | 100.00% |
| Interolog | 679,718 | 0.20% | 45,621 | 14.40% |
| Rosetta stone | 979,649 | 0.29% | 1,205 | 0.38% |
Coverage(2): the number of predicted protein-protein interactions with an available given feature was divided by the total number of predicted interactions.
AraPPINet - Protein-Protein Interaction Network for Arabidopsis
