2023: Notable innovations: a large language model (ProGen) that could generate functional protein sequences with a predictable function, with the input including tags specifying protein properties. In linguistics, a corpus is a collection of linguistic data used for research, scholarship, and teaching. Indeed, even if different corpora existed to train protein language models, the correct interpretation of the produced sequences remains a challenge. Protein evolution differed from language evolution, containing irregularities due to randomness and environmental pressure, and with a grammar that unavoidably will contain many irregularities. Finally, we had to remark on the size of the language of proteins that needs to cover millions of species on Earth, which necessitated studying the general properties of proteins rather than studying the proteins of a particular species. While the dissimilarities between human and protein languages presented significant challenges for applying Natural language processing (NLP) to protein design, the apparent connections between the two fields offered a new perspective in protein research, opening the way to the adaptation of NLP models to protein modeling and design. Machine Learning (ML) methods had a long-standing history in natural language processing (NLP), and considering the similarities between natural and protein languages (Ofer et al., 2021), Natural language processing (NLP) methods had been transferred and adapted in the context of protein design and modeling. Indeed, as far back as the 1990s, “shallow” ML methods such as hidden Markov models and support vector machines were applied both in NLP and computational biology (Krogh et al., 1994; Zhou and Su, 2002). A hidden Markov model (HMM) is a Markov model in which the observations are dependent on a latent (or "hidden") Markov process (referred to as X. An HMM requires that there be an observable process Y whose outcomes depend on the outcomes of X in a known way. Since X cannot be observed directly, the goal is to learn about the state of X by observing Y. Support Vector Machine (SVM) is a powerful machine learning algorithm used for linear or nonlinear classification, regression between relationships and variables, and even outlier detection tasks. SVMs can be used for a variety of tasks, such as text classification, image classification, spam detection, handwriting identification, gene expression analysis, face detection, and anomaly detection. Then the application of shallow neural networks for word representation learning (Mikolov et al., 2013) and, more importantly, the advent of deep learning methods introduced significant advances in Natural language processing (NLP) and in protein modeling (Collobert and Weston, 2008; Manning, 2015; Hou et al., 2017). In particular recurrent neural networks (RNN) displayed excellent performance because of their ability to learn long-range relationships between words as well as between amino acids, and demonstrated to be essential for both global text comprehension and to detect long-range distal contacts in proteins (Socher et al., 2011; Krause et al., 2017). In medicine, distal refers to a part of the body that is farther away from the center of the body than another part.
Resources APP Composition
[Appstore Playstore]
Video Maker
PowerDirector
HD Screen Recorder
RECX: Screen Recorder/ Pk master
Picture Maker
Social Media Post Maker stylish app world Art & Design
In-text voice
[aiReader: AI Text to Speech]
[TTS Reader - Text To Speech withtheflow01]
MP3 volume-increase conversion
[MP3 Audio Gain and Equalizer]
[Super Sound Editor: Music Audio Editor, MP3 Cutter]
Music Sources and Titles: Pixabay
[Content composition of “In-Brief Archives Facebook Page” and of my blogger page “www.ilovemytimeoranothertimeofyours.blogspot.com” in sound and music does not represent the pictures, videos and text contents.] [Music volume is increased if deviated from the actual files.]
[energetic-upbeat-stylish-pop-fashion-136514]
[epic-cinematic-trailer-113981]
Picture sources: Peakpx.com and Pexels, Pixabay in PowerDirector and other websites:
1:https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210728081846913-0360:9781316884485:18550fig5_2.png?pub-status=live
2:https://www.researchgate.net/publication/385012837/figure/fig1/AS:11431281284348153@1729233092741/A-timeline-of-representative-MLLMs.jpg
3:https://medium.com/yogsblog/supercomputer-c11bbb804bf8
4:https://www.livescience.com/technology/computing/top-most-powerful-supercomputers
5:http://www.m-s-c.co.uk/protectthespecies/img/demo/world1.jpg
6:https://www.psychologs.com/wp-content/uploads/2024/02/Comparative-Psychology-Exploring-the-Behaviour-Across-all-Species-768x415.jpg
7:https://www.pinterest.com/pin/536632111835192643/
8:https://www.pinterest.com/pin/4151824651906151/
9:https://m.media-amazon.com/images/I/61eib1KQrtL._AC_UF1000,1000_QL80_FMwebp_.jpg
10:https://m.media-amazon.com/images/I/61HIU801lSL._AC_UF1000,1000_QL80_FMwebp_.jpg
11:https://static.wixstatic.com/media/f147a7_c2c4e887bb07435a8231672ea4c004a1~mv2.webp/v1/fill/w_1000,h_571,al_c,q_85,usm_0.66_1.00_0.01/f147a7_c2c4e887bb07435a8231672ea4c004a1~mv2.webp
12:https://www.techleagues.com/wp-content/uploads/2024/12/31919639-3428-4017-8a19-f1961ffc0275-768x768.webp
13:https://www.geeksforgeeks.org/nlp-techniques/
14:https://cdn.slidesharecdn.com/ss_thumbnails/applying-hidden-markov-models-to-bioinformatics2018-thumbnail.jpg?width=640&height=640&fit=bounds
15:https://miro.medium.com/v2/resize:fit:720/format:webp/0*-AuHL_OpNJgREmu4
16:https://link.springer.com/book/10.1007/978-3-030-99142-5
17:https://link.springer.com/book/10.1007/978-981-19-6553-1
18:https://m.media-amazon.com/images/I/71a6fB67hCL._AC_UF1000,1000_QL80_FMwebp_.jpg
19:https://www.packtpub.com/en-us/product/machine-learning-a-z-support-vector-machine-with-python-9781801071833
20:https://media.springernature.com/lw1200/springer-static/image/art%3A10.1007%2Fs11042-022-13428-4/MediaObjects/11042_2022_13428_Fig3_HTML.png
21:https://miro.medium.com/v2/resize:fit:1073/1*zbvw14XaCpyXIwfBYZooxw.jpeg
22:https://www.researchgate.net/profile/Dongkwon-Han/publication/346219836/figure/fig1/AS:980115399913472@1610689129209/Schematic-of-shallow-neural-network-and-deep-neural-network.ppm
23:https://mriquestions.com/shallow-v-deep-ml.html
24:https://images.saymedia-content.com/.image/c_limit%2Ccs_srgb%2Cq_auto:eco%2Cw_700/MTc0NDc5Mzc5NTcxNjgwNjE2/deep-learning-vs-machine-learning.webp
25:https://www.researchgate.net/profile/Tomer-Toledo/publication/245563174/figure/fig1/AS:669081116094471@1536532777801/State-transition-diagram-of-a-hidden-Markov-model.png
26:https://media.springernature.com/lw685/springer-static/image/art%3A10.1007%2Fs00362-024-01608-3/MediaObjects/362_2024_1608_Fig1_HTML.png
27:https://d3o9vfi90r1966.cloudfront.net/seminar-image/seminar-5695-1523970407.jpg
28:https://i0.wp.com/copyassignment.com/wp-content/uploads/2022/08/Support-Vector-MachineSVM-in-Machine-Learning.jpg?fit=1600%2C1200&ssl=1
29:https://miro.medium.com/v2/resize:fit:720/format:webp/1*myJAQOLNWdeNCJvTWnNY5Q.png
30:https://www.theknowledgeacademy.com/blog/correlation-vs-regression/
31:https://cdn.prod.website-files.com/670cbf146221ee06c3cdd761/670cbf146221ee06c3cde759_Correlation%20vs%20Regression.webp
32:https://media.geeksforgeeks.org/wp-content/uploads/20240507122514/what-is-Outlier-Detection-768.webp
33:https://academichelp.net/wp-content/uploads/2024/03/Photomath-Integral.jpg
34:https://www.researchgate.net/publication/364349498/figure/fig2/AS:11431281107484502@1671073089804/A-schematic-of-neural-network-architecture-for-a-a-shallow-neural-network-with-just-one.png
35:https://www.pond5.com/stock-footage/item/233792169-deep-learning-concept-over-glitch-neural-network-background
36:https://www.pond5.com/stock-footage/item/165598077-nlp-animated-word-cloudanimation-text-design-kinetic-typogra
37:https://pythongeeks.org/wp-content/uploads/2022/02/ml-rnn-1200x675.webp
Video Sources: Pexels and Pixabay in PowerDirector and other websites:
38:https://www.salesforce.com/blog/wp-content/uploads/sites/2/2021/07/lysozyme_demo_lowres.gif
39:https://www.pond5.com/stock-footage/item/252949377-augmented-reality-strand-dna-infographics
40:https://www.pond5.com/stock-footage/item/266976034-animation-data-processing-over-dna-strand-purple-background
41:https://www.pond5.com/stock-footage/item/305726001-dna-strands-and-world-map-data-points-scientific-data-proces
42:https://www.pond5.com/stock-footage/item/250221411-augmented-reality-strand-dna-infographics
43:https://www.pond5.com/stock-footage/item/304643259-genetic-research-dna-strands-evolutionary-genome-code-mappin
44:https://www.pond5.com/stock-footage/item/150716643-medical-research-analyzing-rotating-dna-strand-data-processi
45:https://www.pond5.com/stock-footage/item/157848513-digital-shield-goes-through-dna-strand-and-gathers-data-mode
46:https://www.pond5.com/stock-footage/item/65773239-blue-dna-strand-rotating-screen-forensic-dna-analysis-geneti
47:https://www.pond5.com/stock-footage/item/73532819-technology-interface-computer-data-digital-screen
48:https://www.pond5.com/stock-footage/item/73551190-technology-interface-computer-data-digital-screen
49:https://www.pond5.com/stock-footage/item/129701419-technology-interface-computer-data-digital-screen
50:https://www.pond5.com/stock-footage/item/150005469-tv-broadcast-news-studio-video-control-room-screens-alpha-ch
51:https://www.pond5.com/stock-footage/item/80123448-global-computer-network-software-source-code-and-program-dat
52:https://www.pond5.com/stock-footage/item/60966082-video-big-data-network
53:https://www.pond5.com/stock-footage/item/161745193-moving-shot-dark-interior-big-data-center-working-equipment
54:https://www.pond5.com/stock-footage/item/161745204-closeup-view-working-equipment-and-technical-systems-dark-in
55:https://www.pond5.com/stock-footage/item/156906725-network-and-data-powerful-servers-behind-glass-panels-server
56:https://www.pond5.com/stock-footage/item/129450766-network-and-data-powerful-servers-behind-glass-panels-server
57:https://www.pond5.com/stock-footage/item/164104417-modern-interior-server-room-data-center-cloud-computing-data
58:https://www.pond5.com/stock-footage/item/164104427-modern-interior-server-room-data-center-cloud-computing-data
59:https://www.pond5.com/stock-footage/item/164104447-modern-interior-server-room-data-center-cloud-computing-data
60:https://www.pond5.com/stock-footage/item/91577038-moving-slowly-through-server-room-datacenter
61:https://www.pond5.com/stock-footage/item/92169716-moving-slow-between-server-racks-datacenter
62:https://www.pond5.com/stock-footage/item/90341054-concept-cloud-data-center-hosting-scheme-loop
63:https://www.pond5.com/stock-footage/item/100290747-walking-through-server-room
64:https://www.pond5.com/stock-footage/item/97876826-server-room-data-network-center-ethernet-cable-server-room
65:https://www.pond5.com/stock-footage/item/94812015-server-room-interior-datacenter
66:https://www.pond5.com/stock-footage/item/67613014-servers-close-8k-uhd-loop-modern-datacenter-cloud-computing
67:https://www.pond5.com/stock-footage/item/81396884-modern-datacenter-cloud-computing-concept-servers-racks-data
68:https://www.pond5.com/stock-footage/item/280975622-nlp-natural-language-processing-ai-artificial-intelligence
69:https://www.pond5.com/stock-footage/item/280965289-nlp-natural-language-processing-ai-artificial-intelligence
70:https://www.pond5.com/stock-footage/item/280512986-mlp-futuristic-robot-artificial-intelligence-enlightening-ai
71:https://www.pond5.com/stock-footage/item/246000111-neural-network-model-concept
72:https://www.pond5.com/stock-footage/item/276203640-llms-large-language-models-and-neural-networks-ai-deep-learn
73:https://www.pond5.com/stock-footage/item/276203825-deep-learning-neural-networks-machine-learning-ai-llms
74:https://www.pond5.com/stock-footage/item/268416546-simulation-animation-neural-network-large-language-artificia
75:https://www.pond5.com/stock-footage/item/275317636-llms-large-language-models-natural-language-processing-ai
76:https://www.pond5.com/stock-footage/item/277864313-llms-large-language-models-text-speech-natural-language-proc
77:https://miro.medium.com/v2/resize:fit:1358/0*ABgQRoSOxyHksOEo.gif
78:https://www.pond5.com/stock-footage/item/275316292-neural-networks-deep-learning-natural-language-processing-ai
79:https://www.pond5.com/stock-footage/item/268416445-simulation-animation-partial-neural-network-large-language
80:https://www.pond5.com/stock-footage/item/258573734-artificial-intelligence-deep-learning-simulation-zoom-out
81:https://www.pond5.com/stock-footage/item/258649414-ai-neural-network-concept-chatbot-artificial-intelligence-de
82:https://www.pond5.com/stock-footage/item/259008550-artificial-intelligence-and-machine-vision-deep-learning-lar
83:https://www.pond5.com/stock-footage/item/259180618-3d-animation-neural-network-concept-chatbot-deep-learning-ar
84:https://www.pond5.com/stock-footage/item/84730407-neural-networks-loop
85:https://www.pond5.com/stock-footage/item/247431758-neural-network-concept
86:https://www.pond5.com/stock-footage/item/246907209-neural-network-model-concept
87:https://www.pond5.com/stock-footage/item/288906002-llm-artificial-intelligence-text-word-data-training-algorith
88:https://www.pond5.com/stock-footage/item/275316933-llms-large-language-models-and-neural-networks-ai-deep-learn
89:https://www.pond5.com/stock-footage/item/255098276-chatgpt-revolutionizing-conversational-ai-and-natural-langua
90:https://www.pond5.com/stock-footage/item/132507475-hud-elements-display-fingerprint-scanning-and-person-identif
91:https://www.pond5.com/stock-footage/item/105174359-analyzing-dna-core-data-genetic-engineering-forensic-disorde
92:https://www.pond5.com/stock-footage/item/247524414-visualization-dna-interface-molecular-analysis-software-rese
93:https://www.pond5.com/stock-footage/item/296115402-anomaly-detection-3d-title-animation-planet-earth-background
94:https://www.pond5.com/stock-footage/item/271357328-surveillance-camera-artificial-intelligence-recognizes-and-i
95:https://www.pond5.com/stock-footage/item/268678357-neural-network-exploring-artificial-intelligence-ai-innovati
96:https://www.pond5.com/stock-footage/item/171094265-artificial-neural-network-nodes-solving-ai-artificial-intell
97:https://www.pond5.com/stock-footage/item/171092610-neural-networks-simulated-machine-deep-learning-ai-artificia
98:https://www.pond5.com/stock-footage/item/270424559-natural-language-processing-algorithm-and-neural-network-dee
99:https://www.pond5.com/stock-footage/item/245326623-amino-acids-serve-catalysts-signaling-molecules-and-gene-exp
100:https://www.pond5.com/stock-footage/item/257175781-animation-computer-language-and-connected-dots-multiple-medi
101:https://www.pond5.com/stock-footage/item/233849246-animation-computer-language-over-multiple-medical-interface
102:https://www.pond5.com/stock-footage/item/233650065-animation-multiple-medical-interfaces-over-connected-dots-ag
Screen recorded on 23 April 2025 from websites:
103:https://www.english-corpora.org//coca/
104:https://www.thoughtco.com/what-is-corpus-language-1689806#:~:text=In%20linguistics%2C%20a%20corpus%20is,Plural%3A%20corpora.
Consulted References:
Refer to Part 3 for all consolidated references for all parts.
Comments
Post a Comment