Abstract
This paper used an amino acid location-based sequence encoding as a feature extraction techniques to identify single chains antibody molecules that bind to B-lymphocyte stimulator (BLyS) antigen. The data were manually derived from the European patent (EP2275449B1) text. The dataset was cleaned and made suitable for the machine learning models. The accuracy, precision and recall achieved across individual descriptors (Membrane and Soluble) for Logistic regression, KNN, KSVM, and Random Forest Tree was above 80%. However, it was much lower for the Naïve Bayes except for the precision score. The promising accuracy value achieved from such a minimal dataset has significant implications for the drug discovery process – this includes considerable savings in time and resources.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of ICICM 2020 - 2020 10th International Conference on Information Communication and Management, Worshop |
| Subtitle of host publication | ICKET 2020 - 2020 9th International Conference on Knowledge and Education Technology |
| Place of Publication | Paris, France |
| Publisher | Association for Computing Machinery |
| Pages | 20-24 |
| Number of pages | 5 |
| ISBN (Electronic) | 978-1-4503-8770-5 |
| ISBN (Print) | 978-1-4503-8770-5 |
| DOIs | |
| Publication status | Published - 28 Jun 2020 |
Publication series
| Name | ACM International Conference Proceeding Series |
|---|
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Amino acid sequence
- Antibody
- Antigen
- Infectious disease
- Machine learning
Fingerprint
Dive into the research topics of 'A machine learning approach for predicting Antibody Properties'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver