合肥生活安徽新聞合肥交通合肥房產生活服務合肥教育合肥招聘合肥旅游文化藝術合肥美食合肥地圖合肥社保合肥醫院企業服務合肥法律

        代寫EMATM0050 DSMP MSc in Data Science

        時間:2024-04-21  來源:合肥網hfw.cc  作者:hfw.cc 我要糾錯



         University of Bristol MSc in Data Science; DSMP (Data Science Mini Project; EMATM0050)
        Predicting T-Cell Receptor Specificity
        T cells (T lymphocytes) are among the most important immune system cells with a vital role in adaptive immunity. T cells recognise cells in the body infected by viruses, bacteria or cells that have undergone cancer transformation. After recognising the infected or cancerous cells, T cells eliminate them from the body thereby preventing the spread of infection or cancer.
        T cells recognise their targets through their T Cell Receptors (TCRs) expressed on their cell membrane. A T Cell Receptor consists of an alpha and a beta subunit. The evolutionary arms race between pathogens and the immune system has resulted in a mechanism for generation of a huge number of unique TCRs: and this is essential for a proper immune response against infections and cancer. Although TCR genes are encoded in the genome, their diversity is massively enhanced in several ways: (i) each TCR is composed of a pair of proteins (either alpha + beta chains or gamma + delta chains); (ii) rather than being encoded as a single gene, the DNA encoding the variable region of each of these chains is formed by joining 3 or 4 different stretches of DNA (gene segments) in a process is called VDJ recombination. Each alpha subunit contains a single V and J segment and each beta subunit contains a single V, a D and a J segment. Diversity is provided by the fact that the genome encodes multiple V, D and J segment; (iii) The joining of these segments involves mechanisms which insert and delete nucleotides in a pseudorandom fashion, maximising diversity in the joining region (the CDR3), the region of the TCR chain which contacts the peptide antigen. (ref 1)
        T Cell Receptors (TCRs) constitute one of the most promising classes of emerging therapeutics. Whilst TCRs are amongst the most complex facets of immune biology, engineering of an optimum TCR can transform immunotherapies and personalised medicines. The TCR repertoire at any time point reflects on the person’s health and contains a memory of all past experiences. However, CRs are highly variable and their specificities aren’t easily predictable with traditional empirical methods.
        In this project you will analyse TCR repertoire from the VDJdb (link) and use machine learning to predict TCRs that will bind to specific epitopes.
         
         Tasks
        1. Data Download and Preprocessing
        1.1 Download the zip file from GitHub and focus on the VDJdb.txt file.
        1.2 Preprocess the dataset. Figure out what each column represents and keep
        columns that will help you complete the project.
        Predicting TCR specificity from sequence alone is the holy grail of immunotherapy. TCRs that are specific to the same target, often have very similar sequences, thereby TCR sequence – target patterns emerge in the data.
        A crude approach could be to represent amino acids of the TCR or key regions of it using one-hot representation.
        2. What are the limitations of this approach in downstream analysis? Could you describe a way to overcome them (Hint: Consider the CDR3 length distribution. We are looking for a high level description of the limitation and an approach that would overcome it. No algorithm development is required.)
        A common method to predict specificity from a sequence is described in Vujovic et.al. (1). It creates some kind of distance or similarity score matrix of TCR sequences and uses that representation to train models that can classify TCRs based on specificity (Fig 1.).
         
          3. Estimate a distance/similarity matrix representation of the data. Calculate these metrics for the alpha and the beta chains separately, then calculate these for the combined alpha and beta chains too. (Hint: TCRDist, GLIPH or GIANA can be used for this. Alternatively, you can define your own similarity metric.)
        4. Plot the TCRs in 2 dimensions and colour them based on specificity. Compare the plots for the alpha, the beta and the combined alpha-beta chains. Comment on your findings. (Hint: scikit-learn has a plethora of dimensionality reduction tools. Some examples are PCA, tSNE and UMAP.)
        5. Write code to cluster TCRs. How well do TCRs cluster based on specificity? Can you explain why they do/don’t?
        6. Write an algorithm that can predict antigen specificity from sequence. You can use any supervised/unsupervised algorithm to predict specificity. Comment on the performance of the model and reason why it performs good or bad. (Hint: Any reasonable modelling approach is fine. However, keep in mind that simpler models sometimes provide more insights regarding the underlying problem.)

         Bibliography/References
        1. Vujovic M, Degn KF, Marin FI, Schaap-Johansen AL, Chain B, Andresen TL, Kaplinsky J, Marcatili P. T cell receptor sequence clustering and antigen specificity. Comput Struct Biotechnol J (2020) 18:2166–21**. doi:10.1016/j.csbj.2020.06.041
        2. Mayer-Blackwell. TCR meta-clonotypes for biomarker discovery with tcrdist3: quantification of public, HLA- 2 restricted TCR biomarkers of SARS-CoV-2 infection. bioRxiv (2020) 1:75–94.
        3. Huang H, Wang C, Rubelt F, Scriba TJ, Davis MM. Analyzing the Mycobacterium tuberculosis immune response by T-cell receptor clustering with GLIPH2 and genome-wide antigen screening. Nat Biotechnol (2020) 38:1194–1202. doi:10.1038/s41587-020-0505-4
        4. Zhang H, Zhan X, Li B. GIANA allows computationally-efficient TCR clustering and multi-disease repertoire classification by isometric transformation. Nat Commun (2021) 12:1–11.doi:10.1038/s41467-02**25006-WX:codinghelp

        掃一掃在手機打開當前頁
      1. 上一篇:學習英語必備的幾大教材!非常全面
      2. 下一篇:代做CS 7642 Reinforcement Learning and Decision
      3. 無相關信息
        合肥生活資訊

        合肥圖文信息
        挖掘機濾芯提升發動機性能
        挖掘機濾芯提升發動機性能
        戴納斯帝壁掛爐全國售后服務電話24小時官網400(全國服務熱線)
        戴納斯帝壁掛爐全國售后服務電話24小時官網
        菲斯曼壁掛爐全國統一400售后維修服務電話24小時服務熱線
        菲斯曼壁掛爐全國統一400售后維修服務電話2
        美的熱水器售后服務技術咨詢電話全國24小時客服熱線
        美的熱水器售后服務技術咨詢電話全國24小時
        海信羅馬假日洗衣機亮相AWE  復古美學與現代科技完美結合
        海信羅馬假日洗衣機亮相AWE 復古美學與現代
        合肥機場巴士4號線
        合肥機場巴士4號線
        合肥機場巴士3號線
        合肥機場巴士3號線
        合肥機場巴士2號線
        合肥機場巴士2號線
      4. 幣安app官網下載 短信驗證碼 丁香花影院

        關于我們 | 打賞支持 | 廣告服務 | 聯系我們 | 網站地圖 | 免責聲明 | 幫助中心 | 友情鏈接 |

        Copyright © 2024 hfw.cc Inc. All Rights Reserved. 合肥網 版權所有
        ICP備06013414號-3 公安備 42010502001045

        主站蜘蛛池模板: 亚洲AV成人一区二区三区观看 | 无码少妇一区二区性色AV| 日韩精品一区二区午夜成人版| 91麻豆精品国产自产在线观看一区| 亚洲AV无码一区二区三区牛牛| 国产在线精品一区免费香蕉 | 在线精品自拍亚洲第一区| 国产色综合一区二区三区| 一区二区国产在线观看| 国产精品免费综合一区视频| 国产成人久久一区二区不卡三区| 亚洲制服丝袜一区二区三区| 日韩精品一区二区三区大桥未久 | 狠狠色综合一区二区| 亚洲AV无一区二区三区久久| 久久久精品一区二区三区| a级午夜毛片免费一区二区| 国产一区二区三区免费观在线| 久久久久女教师免费一区| 国产观看精品一区二区三区| 精品乱子伦一区二区三区| 视频在线一区二区三区| 国产精品无码不卡一区二区三区| 视频一区二区三区免费观看 | 国产精品视频一区麻豆| 中文字幕无码不卡一区二区三区| 久久久久人妻一区二区三区| 人妻无码一区二区三区免费| 亚洲国产日韩一区高清在线| 久久久av波多野一区二区| 久久久久人妻一区精品色| 无码人妻一区二区三区一| 亚洲AV噜噜一区二区三区| 国产精品亚洲一区二区三区| 日本一区二区三区不卡视频中文字幕| 精品一区二区三区在线播放| 亚洲午夜精品第一区二区8050| 香蕉视频一区二区三区| 久久无码人妻一区二区三区午夜 | 精品国产免费一区二区| 日本一区中文字幕日本一二三区视频|