Weekly Research Progress Report

Week 36, 2025 (September 1 - September 7)

๐Ÿ“Š This Week's Focus

Protein Language Models (PLM) for Mitochondrial Protein Analysis

Developed a computational pipeline to analyze mitochondrial protein targeting using ESM-2 protein language models.

This Week's Focus: Protein Language Models & Genome Editing Literature Review

1. Protein Language Model Project Development COMPLETED

Developed a complete analysis pipeline for studying mitochondrial protein sub-compartment targeting using ESM-2 protein language models.

Key Findings:
Sub-compartment Number of Proteins Mean Pairwise Distance Biological Interpretation
Intermembrane Space 4 2.48 Highly conserved electron transport
Matrix 6 3.29 Core metabolic enzymes
Outer Membrane 5 4.06 Diverse import receptors
Inner Membrane 5 4.35 Varied transport functions

๐Ÿ“„ Project Abstract Available

Mitochondrial protein targeting analysis using PLMs

View GitHub Repository

2. Technical Implementation Python 3.9

PyTorch Transformers ESM scikit-learn UMAP NumPy Pandas Matplotlib Seaborn
Pipeline Components:

3. Literature Review: Bridge RNA Systems COMPLETED

Studied recent advances in programmable genome editing technologies, particularly bridge RNA systems.

Bridge RNAs direct programmable recombination of target and donor DNA
Nature, June 2024

Reviewed the discovery of bridge RNAs in IS110 insertion sequences that enable programmable DNA rearrangements.

Structural mechanism of bridge RNA-guided recombination
Nature, June 2024

Studied cryo-EM structures revealing the molecular basis of bridge RNA-guided recombination.

Uncovering differential tolerance to deletions versus substitutions with a protein language model
Cell Systems, September 2025

Analyzed how protein language models predict differential effects of deletions vs substitutions.

4. Project Documentation & Reproducibility

Key Metrics for the Week

Proteins Analyzed: 20 mitochondrial proteins
Embeddings Generated: 640-dimensional vectors
Processing Time: ~17 seconds total
Computational Resources: 3GB disk space, CPU-only
Papers Reviewed: 3 high-impact publications
Code Files Created: 8 Python modules

Technical Achievements

Next Week's Goals (Week 37)