papers
Cyclohexanol (c1ccccc1O) is the organic compound with the formula (CH2)5CHOH. The molecule is related to cyclohexane ring by replacement of one hydrogen atom by a hydroxyl group.[4] This compound exists as a deliquescent colorless solid with a camphor-like odor, which, when very pure, melts near room temperature. Billions of kilograms are produced annually, mainly as a precursor to nylon.
Source wiki
paper list
综述
-
Drug discovery effectiveness from the standpoint of therapeutic mechanisms and indications,Hsin-Pei Shih1, Xiaodan Zhang2 and Alex M. Aronov1, 2017
-
Opportunities and obstacles for deep learning in biology and medicine. https://greenelab.github.io/deep-review/
小分子矢量化
-
Convolutional Networks on Graphs for Learning Molecular Fingerprints. David Duvenaud, Dougal Maclaurin, Jorge Aguilera-Iparraguirre Rafael G ́omez-Bombarelli, Timothy Hirzel, Al ́an Aspuru-Guzik, Ryan P. Adams. Harvard University (2015)
-
Convolutional Embedding of Attributed Molecular Graphs for Physical Property Prediction (Connor W. Coley, Regina Barzilay, William H. Green, Tommi S. Jaakkola, and Klavs F. Jensen, 2017)
-
Learning Graph Representations with Embedding Propagation, Alberto García-Durán, Mathias Niepert, (NIPS 2017)
分子相似性 Molecular Similarity
- Computer-Assisted Retrosynthesis Based on Molecular Similarity, Connor W. Coley, Luke Rogers, William H. Green, and Klavs F. Jensen, 2017
ADMET
- XenoSite: Accurately Predicting CYP-Mediated Sites of Metabolism with Neural Networks, Jed Zaretzki, Matthew Matlock, and S. Joshua Swamidass, 2013 (代谢位点,氘代)
分子能预测 (energy, potentials)
- Atom-centered symmetry functions for constructing high-dimensional neural network potentials, Jörg Behler, 2011
PESs (potential energy surface, atomic positions –> symmetric function)
-
Quantum-chemical insights from deep tensor neural networks. Kristof T. Schütt, Farhad Arbabzadah, Stefan Chmiela, Klaus R. Müller & Alexandre Tkatchenko. (2016 DTNN)
- SchNet: A continuous-filter convolutional neural network for modeling quantum interactions. K. T. Schütt, P.-J. Kindermans 1 , H. E. Sauceda, S. Chmiela, A. Tkatchenko, K.-R. Müller. (2017)
分子生成 (generating molecules)
- Generating Focussed Molecule Libraries for Drug Discovery with Recurrent Neural Networks, Marwin HS Segler, Thierry Kogej, Christian Tyrchan and Mark P Waller, 2017
In de novo drug design, computational strategies are used to generate novel molecules with good affinity to the desired biological target. In this work, we show that recurrent neural networks can be trained as generative models for molecular structures, similar to statistical language models in natural language processing. We demonstrate that the properties of the generated molecules correlate very well with the properties of the molecules used to train the model. In order to enrich libraries with molecules active towards a given biological target, we propose to fine-tune the model with small sets of molecules, which are known to be active against that target. Against Staphylococcus aureus, the model reproduced 14% of 6051 hold-out test molecules that medicinal chemists designed, whereas against Plasmodium falciparum (Malaria) it reproduced 28% of 1240 test molecules. When coupled with a scoring function, our model can perform the complete de novo drug design cycle to generate large sets of novel molecules for drug discovery.
-
ChemTS: An Efficient Python Library for de novo Molecular Generation, Xiufeng Yang, Jinzhe Zhang, Kazuki Yoshizoe, Kei Terayama, and Koji Tsuda, 2017 (RNN+MCTS)
- MDTS: automatic complex materials design using Monte Carlo tree search, Thaer M. Dieb, Shenghong Ju, Kazuki Yoshizoe, Zhufeng Hou, Junichiro Shiomi, and Koji Tsuda, 2017
蛋白质二级结构预测
-
Protein structure link
一篇介绍蛋白质结构的短篇(仅两页)
-
Predicting Backbone Calpha angles and dihedrals from protein sequences by stacked sparse auto-encoder deep neural network (2014)
早期的一篇利用神经网络模型来预测 theta 和 tau,(之前的模型都是预测 helix, sheet, coil 和 phi, psi)
-
Improving prediction of secondary structure, local backbone angles, and solvent accessible surface area of proteins by iterative deep learning (2015)
利用迭代的结构来预测二级结构
-
Capturing non-local interactions by long short term memory bidirectional recurrent neural networks for improving prediction of protein secondary structure, backbone angles, contact numbers, and solvent accessibility (2017)
利用循环神经网络来预测二级结构
- Analysis of deep learning methods for blind protein contact prediction in CASP12, Sheng Wang, Siqi Sun and Jinbo Xu, 2017
Following the CASP definition, we say two residues form a contact if in the native structure, the distance of their Cβ atoms is less than 8Å
- Prediction of amino acid side chain conformation using a deep neural network. Ke Liu , Xiangyan Sun , Jun Ma , Zhenyu Zhou , Qilin Dong , Shengwen Peng ,Junqiu Wu, Suocheng Tan, Günter Blobel, and Jie Fan. (2017, Accutar)
蛋白质矢量化
-
Predicting protein-protein interactions based only on sequences information. Shen JW, et al. Proc Natl Acad Sci U S A. 2007;104(11):4337–41. (Cojoint traid method)
将20种氨基酸分为7组,然后对序列采用宽度为3的滑窗计算频率
-
Predicting protein-protein interactions from primary protein sequences using a novel multi-scale local representation scheme and the random forest. You ZH, et al. PLoS One. 2015;10(5):e0125811
计算序列中不同位置之间的相关性 (Autocovariance method)
Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics. Ehsaneddin Asgari, Mohammad R. K. Mofrad. (2015) ProtVec
蛋白质+小分子亲和力 (Binding affinity)
-
Protein-Ligand Scoring with Convolutional Neural Networks (Matthew Ragoza, Joshua Hochuli, Elisa Idrobo, Jocelyn Sunseri, and David Ryan Koes, 2016)
-
Protein–protein binding affinity prediction on a diverse set of structures (Iain H. Moal, Rudi Agius and Paul A. Bates, 2011)
-
NNScore: A Neural-Network-Based Scoring Function for the Characterization of Protein-Ligand Complexes, Jacob D. Durrant and J. Andrew McCammon, 2010
-
Performance of machine-learning scoring functions in structure-based virtual screening, Maciej Wójcikowski, Pedro J. Ballester & Pawel Siedlecki, 2016
-
Deep learning with feature embedding for compound-protein interaction prediction. Fangping Wan, Jianyang (Michael) Zeng. (2016)
预测 compound-protein interaction
蛋白结合位点 (binding-site)
-
DeepSite: Protein binding site predictor using 3D-convolutional neural networks. J. Jime ́nez, S. Doerr, G. Martınez-Rosell, A. S. Rose and G. De Fabritiis
将蛋白质的 3D 数据进行体素化,从而将其看作 3D 的 image(并有多个channels);以此为输入预测每个体素是 binding-site 的概率。类似于图片的 segmentaion 模型
-
A simple method for finding a protein’s ligand-binding pockets. Seyed Majid Saberi Fathi and Jack A Tuszynski, 2014
非监督式学习
-
Unsupervised representation learning with deep convolutional generative adversarial networks
-
Seq2seq Fingerprint: An Unsupervised Deep Molecular Embedding for Drug Discovery, Zheng Xu etc. (2017)
may be usefull to chem molecules/protein datasets
-
Visualizing Data using t-SNE. Laurens van der Maaten, Geoffrey Hinton. (2008) (blog)
多任务学习/迁移学习 (Multitask Learning/transfer learning)
-
Consistent Multitask Learning with Nonlinear Output Relations, Carlo Ciliberto, Alessandro Rudi, Lorenzo Rosasco, Massimiliano Pontil, (NIPS 2017)
-
Learning Multiple Tasks with Multilinear Relationship Networks, Mingsheng Long, Zhangjie Cao, Jianmin Wang, Philip S. Yu, (NIPS 2017)
-
Label Efficient Learning of Transferable Representations across Domains and Tasks, Zelun Luo, Yuliang Zou, Judy Hoffman, Li Fei-Fei, (NIPS 2017)
-
Low Data Drug Discovery with One-shot Learning, Han Altae-Tran, Bharath Ramsundar, Aneesh S. Pappu, and Vijay Pande, (2016)
-
Active One-shot Learning, Mark Woodward, Chelsea Finn, (2017)
Graphs
-
Spectral Networks and Deep Locally Connected Networks on Graphs. Joan Bruna, Wojciech Zaremba, Arthur Szlam, Yann LeCun. (2014)
-
Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering. Michaël Defferrard Xavier Bresson Pierre Vandergheynst. (2017)
-
Learning Graph Representations with Embedding Propagation, Alberto García-Durán, Mathias Niepert, (NIPS 2017)
-
Adaptive Graph Convolutional Neural Networks. Ruoyu Li, Sheng Wang, Feiyun Zhu, Junzhou Huang. (2018)
-
Convolutional Networks on Graphs for Learning Molecular Fingerprints. David Duvenaud, Dougal Maclaurin†, Jorge Aguilera-Iparraguirre, Rafael Gomez-Bombarelli, Timothy Hirzel, Alan Aspuru-Guzik, Ryan P. Adams. (2015)
-
Semi-supervised classification with graph convolutional networks. Thomas N. Kipf, Max Welling. (2017)
-
Graph Partition
http://glaros.dtc.umn.edu/gkhome/metis/metis/publications
-
Covariant compositional networks for learning graphs. Risi Kondor, Hy Truong Son, Horace Pan, Brandon Anderson, Shubhendu Trivedi. (2018) paper