top of page



Ying Liu
Jan 31, 20243 min read
Artificial Intelligence regulations in the UK, US and EU
The recent international AI safety Summit was hosted in UK, Nov 2023. There are governments from 28 countries, 42 industries and 50...
11 views
0 comments

Ying Liu
Jan 22, 20243 min read
Structured Probabilistic Models, Sec 3
This is a summary of Chapter 16.2 in "Deep Learning" by Ian Goodfellow, Yoshua Bengio, Aaron Courvilley. In a graphical model, the edges...
10 views
0 comments


Ying Liu
Jan 15, 20244 min read
A summary of research paper about Protein generation using EvoDiff
The original paper can be found here. All figures are from this paper as well. In this paper, the authors introduced a new protein...
8 views
0 comments


Ying Liu
Jan 4, 20242 min read
Structured Probabilistic Models, Sec 2
This is a summary of Chapter 16.2 in "Deep Learning" by Ian Goodfellow, Yoshua Bengio, Aaron Courvilley. Undirected models Undirected...
15 views
0 comments


Ying Liu
Jan 2, 20243 min read
Structured Probabilistic Models, Sec 1
This is a summary of Chapter 16.1 and part of 16.2 in "Deep Learning" by Ian Goodfellow, Yoshua Bengio, Aaron Courvilley. What is a...
16 views
0 comments

Ying Liu
Dec 28, 20233 min read
NLP 6: Natural Language Processing (NLP) - History and State-of-the-Art
The Natural Language Processing (NLP) might have a long history which trace back to the start of Machine learning which is early 20th...
65 views
0 comments
Ying Liu
Dec 26, 20232 min read
NLP 5: Pretraining and Transfer Learning
This is a summary of chapter 25.5 in "Artificial intelligence: A Modern Approach". It would be very difficult to build a robust language...
4 views
0 comments


Ying Liu
Dec 21, 20235 min read
Solution to Kaggle competition - Pfam seed random split
The detailed python notebook can be found here. Problem Setup Classify sample of protein sequence to a domain family_accession label....
15 views
0 comments


Ying Liu
Dec 21, 20232 min read
What is Pfam database?
Pfam is a protein domain family database. In the database, each sample is a segmentation of protein, domain. Based on the...
3 views
0 comments


Ying Liu
Dec 14, 20233 min read
NLP 4: Transformer and self-attention
This post is a summary of chapter 25.4 in this book. The figures are from the book. Self-attention Self-attention is part...
10 views
0 comments


Ying Liu
Dec 14, 20231 min read
A summary of research paper in Unified Representation (UniRep) of Protein
The original paper can be found here. All figures and tables are from paper. What is UniRep? In this paper, the author applied deep...
4 views
0 comments


Ying Liu
Dec 12, 20231 min read
A summary on research paper of AlphaFold protein model structure search
The original paper can be found here. All figures and tables are directly from the paper. AlphaFold AlphaFold is a structure prediction...
2 views
0 comments


Ying Liu
Dec 11, 20232 min read
A summary of research paper on protein sequences segmentation
The research paper can be found here. In this paper, the authors introduced a protein sequences segmentation methods called peptide-pair...
2 views
0 comments


Ying Liu
Dec 9, 20232 min read
A summary of research paper on membrane protein classification
Original paper can be found here. Input features There are two main features about protein which were used for the classification. One is...
3 views
0 comments


Ying Liu
Dec 8, 20232 min read
A summary of review paper Biological Sequence Classification
The original paper can be found here. All figures and tables are from paper. Introduction Sequence-based prediction and analysis are the...
2 views
0 comments

Ying Liu
Dec 7, 20233 min read
NLP 3: Sequence-to-Sequence Model and Attention in Natural Language Processing (NLP)
What is sequence-to-sequence model One of the application of Natural Language Processing (NLP) is to generate one sentence based on...
14 views
0 comments


Ying Liu
Dec 6, 20233 min read
NLP 2: Recurrent Neural Networks (RNNs) for Natural Language Processing (NLP)
Advantage of RNN over fixed window In a simple task like Part-of-Speech (POC) tagging, a fixed-size window such as n-gram would offer...
25 views
0 comments


Ying Liu
Dec 4, 20233 min read
NLP 1: Word Embedding in Natural Language Processing (NLP)
Feedforward part of speech tagging model using n-grams word embedding.
27 views
0 comments
bottom of page