<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en"><generator uri="https://jekyllrb.com/" version="4.3.4">Jekyll</generator><link href="https://chunde.github.io/feed.xml" rel="self" type="application/atom+xml"/><link href="https://chunde.github.io/" rel="alternate" type="text/html" hreflang="en"/><updated>2026-02-23T07:40:50+00:00</updated><id>https://chunde.github.io/feed.xml</id><title type="html">iℏ∮dͩ𝛑•💕</title><subtitle>Chunde&apos;s Personal Pages. </subtitle><entry><title type="html">Noisy Student Self-Distillation</title><link href="https://chunde.github.io/blog/2026/Noisy-Student-Self-Distillation/" rel="alternate" type="text/html" title="Noisy Student Self-Distillation"/><published>2026-02-22T18:09:47+00:00</published><updated>2026-02-22T18:09:47+00:00</updated><id>https://chunde.github.io/blog/2026/Noisy-Student-Self-Distillation</id><content type="html" xml:base="https://chunde.github.io/blog/2026/Noisy-Student-Self-Distillation/"><![CDATA[<p>在AlphaFold的上下文中，”noisy student self-distillation”（带噪声的学生自蒸馏）是一种<strong>半监督学习</strong>或<strong>自训练</strong>的技术。为了理解它，我们可以把这个术语拆解成几个核心概念，并结合AlphaFold的应用场景来解释。</p> <h3 id="1-核心概念拆解">1. 核心概念拆解</h3> <ul> <li><strong>自蒸馏 (Self-Distillation)：</strong> <ul> <li>通常的”知识蒸馏”是用一个大的、训练好的<strong>教师模型</strong>来指导一个小的<strong>学生模型</strong>学习。</li> <li>“自蒸馏”意味着<strong>教师和学生是同一个架构的模型</strong>。通常是先训练一个初始模型，然后用这个模型去预测无标签数据（产生伪标签），再把这些伪标签加入训练集，重新训练一个新的模型（学生）。这个新模型往往会比原来的教师模型表现更好。</li> </ul> </li> <li><strong>学生 (Student)：</strong> <ul> <li>指的是正在被训练的新模型。</li> </ul> </li> <li><strong>教师 (Teacher)：</strong> <ul> <li>指的是之前训练好的、用来生成伪标签的旧模型。</li> </ul> </li> <li><strong>Noisy（带噪声）：</strong> <ul> <li>这是该技术的点睛之笔。在训练<strong>学生</strong>模型时，<strong>刻意给输入数据或模型本身增加噪声</strong>（例如，随机遮挡部分输入序列、增加随机扰动、使用数据增强或Dropout等）。</li> <li><strong>目的：</strong> 强迫学生模型在输入不完美、有噪声的情况下，仍然尝试去拟合教师模型给出的干净的、高质量的伪标签。这能让学生模型学习到更鲁棒、更本质的蛋白质折叠规律，从而超越教师模型的性能。</li> </ul> </li> </ul> <h3 id="2-把这个概念应用到alphafold的场景中">2. 把这个概念应用到AlphaFold的场景中</h3> <p>结合你给出的句子，AlphaFold的训练流程大致是这样的：</p> <ol> <li><strong>第一阶段：监督学习（基线）</strong> <ul> <li>只使用实验确定的PDB（蛋白质数据银行）数据，通过比对（MSA）和模板，训练一个AlphaFold模型。这个模型已经能达到很高的精度。</li> </ul> </li> <li><strong>第二阶段：Noisy Student Self-Distillation（精度提升的关键）</strong> <ul> <li><strong>生成伪标签：</strong> 使用第一阶段训练好的模型（作为<strong>教师</strong>）去预测大量的、结构未知的蛋白质序列。这些预测出来的结构虽然未经实验验证，但置信度很高，被当作”伪标签”（Pseudo-labels）。</li> <li><strong>构建新数据集：</strong> 将原始的PDB数据（真实验数据）与这些带有伪标签的预测数据混合在一起，形成一个更大的训练集。</li> <li><strong>训练新模型（带噪声）：</strong> 训练一个新的AlphaFold模型（作为<strong>学生</strong>）。在训练这个学生模型时： <ul> <li>对于真实验数据（PDB），正常计算损失。</li> <li>对于伪标签数据，让学生模型去拟合教师模型预测的结构。</li> <li><strong>关键步骤：</strong> 在这个过程中，向学生模型的输入（例如，MSA序列、模板）添加噪声（如随机掩码）。由于目标是拟合教师模型的输出，学生模型必须学会”纠正”这些噪声干扰，从而对蛋白质结构有更深入的理解。</li> </ul> </li> </ul> </li> </ol> <h3 id="3-为什么这种方法是有效的">3. 为什么这种方法是有效的？</h3> <ul> <li><strong>克服过拟合：</strong> PDB中的数据虽然精确，但数量有限，且包含特定的偏好。通过自蒸馏，模型可以从海量的、多样化的预测数据中学习，覆盖更广阔的蛋白质空间。</li> <li><strong>一致性学习：</strong> 带噪声的训练迫使模型学习到的特征更加平滑和一致。模型不能仅仅记住结构，而必须真正理解”即使我看不清某些部分，整体的折叠结构也应该是那个样子”。</li> <li><strong>迭代提升：</strong> 这个过程往往可以迭代多次。新的学生模型可以变成下一轮的教师，生成更高质量的伪标签，周而复始，性能不断提升。</li> </ul> <h3 id="总结">总结</h3> <p><strong>“Noisy student self-distillation”在AlphaFold中的含义是：</strong></p> <blockquote> <p>利用已经训练好的AlphaFold模型（教师）去预测大量未知蛋白的结构作为伪标签，然后在训练新的AlphaFold模型（学生）时，<strong>故意给输入数据添加噪声</strong>，强迫学生模型去学习拟合那些干净的伪标签（以及真实的PDB数据）。通过这种机制，学生模型能学到比单纯依靠实验数据更强大、更鲁棒的结构预测能力。</p> </blockquote>]]></content><author><name></name></author><category term="microwave-engineering"/><category term="transmission-line"/><category term="impedance-matching"/><category term="Kuroda-identity"/><category term="Richard-transformation"/><summary type="html"><![CDATA[Mathematical identities in microwave engineering for impedance and admittance inversion.]]></summary></entry><entry><title type="html">Molecular Docking</title><link href="https://chunde.github.io/blog/2025/Molecular-Docking-Explanation/" rel="alternate" type="text/html" title="Molecular Docking"/><published>2025-10-15T22:59:27+00:00</published><updated>2025-10-15T22:59:27+00:00</updated><id>https://chunde.github.io/blog/2025/Molecular-Docking-Explanation</id><content type="html" xml:base="https://chunde.github.io/blog/2025/Molecular-Docking-Explanation/"><![CDATA[<p>Molecular docking is a computational method used to predict the preferred orientation and binding affinity of one molecule (typically a small molecule ligand) to a second molecule (usually a protein or enzyme) when bound to each other to form a stable complex. This technique plays a crucial role in structural molecular biology and computer-assisted drug design.</p> <h2 id="key-components-of-molecular-docking">Key Components of Molecular Docking</h2> <h3 id="1-molecular-structures">1. <strong>Molecular Structures</strong></h3> <ul> <li><strong>Receptor</strong>: Usually a protein or enzyme with a known 3D structure</li> <li><strong>Ligand</strong>: Small molecule that binds to the receptor’s active site</li> <li><strong>Complex</strong>: The resulting structure after binding</li> </ul> <h3 id="2-docking-process">2. <strong>Docking Process</strong></h3> <p>The docking procedure typically involves:</p> <p><strong>Search Algorithm</strong>: Explores possible orientations and conformations of the ligand within the receptor’s binding site using methods like:</p> <ul> <li>Genetic algorithms</li> <li>Monte Carlo simulations</li> <li>Systematic search methods</li> <li>Fragment-based approaches</li> </ul> <p><strong>Scoring Function</strong>: Evaluates and ranks the predicted binding poses based on:</p> <ul> <li>Molecular mechanics force fields</li> <li>Empirical scoring functions</li> <li>Knowledge-based potentials</li> <li>Machine learning approaches</li> </ul> <h2 id="mathematical-framework">Mathematical Framework</h2> <p>The binding energy between receptor \(R\) and ligand \(L\) can be approximated by:</p> \[\Delta G_{bind} = G_{complex} - (G_{receptor} + G_{ligand})\] <p>Where scoring functions typically estimate this using terms like:</p> \[Score = w_{vdw} \cdot E_{vdw} + w_{elec} \cdot E_{elec} + w_{hbond} \cdot E_{hbond} + w_{desolv} \cdot E_{desolv} + \cdots\] <h2 id="applications">Applications</h2> <h3 id="drug-discovery"><strong>Drug Discovery</strong></h3> <ul> <li>Virtual screening of compound libraries</li> <li>Lead optimization</li> <li>Structure-based drug design</li> <li>Predicting drug-target interactions</li> </ul> <h3 id="biological-research"><strong>Biological Research</strong></h3> <ul> <li>Understanding protein-ligand interactions</li> <li>Studying enzyme mechanisms</li> <li>Investigating molecular recognition</li> <li>Predicting binding affinities</li> </ul> <h2 id="types-of-docking-approaches">Types of Docking Approaches</h2> <h3 id="rigid-docking"><strong>Rigid Docking</strong></h3> <ul> <li>Treats both receptor and ligand as rigid bodies</li> <li>Faster but less accurate</li> <li>Suitable for preliminary screening</li> </ul> <h3 id="flexible-docking"><strong>Flexible Docking</strong></h3> <ul> <li>Allows conformational changes in ligand</li> <li>More computationally intensive</li> <li>Better accuracy for complex systems</li> </ul> <h3 id="induced-fit-docking"><strong>Induced Fit Docking</strong></h3> <ul> <li>Considers receptor flexibility</li> <li>Most realistic but computationally expensive</li> <li>Accounts for conformational changes upon binding</li> </ul> <h2 id="software-tools">Software Tools</h2> <p>Popular molecular docking software includes:</p> <ul> <li><strong>AutoDock</strong> and <strong>AutoDock Vina</strong></li> <li><strong>GOLD</strong> (Genetic Optimization for Ligand Docking)</li> <li><strong>Glide</strong> (Grid-based Ligand Docking with Energetics)</li> <li><strong>DOCK</strong></li> <li><strong>SwissDock</strong></li> </ul> <h2 id="challenges-and-limitations">Challenges and Limitations</h2> <ul> <li><strong>Scoring Function Accuracy</strong>: Current scoring functions have limitations in accurately predicting binding affinities</li> <li><strong>Solvent Effects</strong>: Proper treatment of water molecules and solvent effects</li> <li><strong>Conformational Flexibility</strong>: Handling receptor flexibility remains challenging</li> <li><strong>Computational Cost</strong>: High computational requirements for large-scale screening</li> </ul> <p>Molecular docking continues to evolve with advances in computational power, machine learning, and improved physical models, making it an indispensable tool in modern drug discovery and molecular biology research.</p>]]></content><author><name></name></author><category term="molecular-docking"/><category term="drug-discovery"/><category term="protein-ligand-interaction"/><category term="computational-biology"/><category term="bioinformatics"/><summary type="html"><![CDATA[Computational method for predicting how molecules interact and bind to target proteins.]]></summary></entry><entry><title type="html">Ubiquitin-Proteasome System</title><link href="https://chunde.github.io/blog/2025/Ubiquitin-Proteasome-System-Overview/" rel="alternate" type="text/html" title="Ubiquitin-Proteasome System"/><published>2025-10-11T14:08:35+00:00</published><updated>2025-10-11T14:08:35+00:00</updated><id>https://chunde.github.io/blog/2025/Ubiquitin-Proteasome-System-Overview</id><content type="html" xml:base="https://chunde.github.io/blog/2025/Ubiquitin-Proteasome-System-Overview/"><![CDATA[<p>The <strong>Ubiquitin-Proteasome System (UPS)</strong> is a highly regulated and selective intracellular pathway responsible for the degradation of damaged, misfolded, or short-lived proteins. It plays a crucial role in maintaining cellular homeostasis, regulating various cellular processes such as cell cycle progression, signal transduction, and immune responses.</p> <h3 id="key-components">Key Components</h3> <ol> <li> <p><strong>Ubiquitin</strong>: A small, 76-amino acid protein that acts as a molecular tag. It is covalently attached to target proteins through a process called ubiquitination.</p> </li> <li><strong>Ubiquitination Enzymes</strong>: <ul> <li><strong>E1 (Ubiquitin-Activating Enzyme)</strong>: Activates ubiquitin in an ATP-dependent manner.</li> <li><strong>E2 (Ubiquitin-Conjugating Enzyme)</strong>: Transfers the activated ubiquitin from E1 to the target protein.</li> <li><strong>E3 (Ubiquitin Ligase)</strong>: Recognizes specific target proteins and facilitates the transfer of ubiquitin from E2 to the target, forming an isopeptide bond between ubiquitin and a lysine residue on the target protein.</li> </ul> </li> <li><strong>Proteasome</strong>: A large, multi-subunit protease complex that degrades ubiquitinated proteins. It consists of: <ul> <li><strong>20S Core Particle</strong>: Contains proteolytic sites that break down proteins into small peptides.</li> <li><strong>19S Regulatory Particle</strong>: Recognizes ubiquitinated proteins, removes ubiquitin chains, and unfolds the target protein for degradation.</li> </ul> </li> </ol> <h3 id="mechanism-of-the-ups">Mechanism of the UPS</h3> <ol> <li><strong>Ubiquitination</strong>: <ul> <li>A polyubiquitin chain (typically linked through lysine 48 of ubiquitin) is attached to the target protein, marking it for degradation.</li> <li>The specificity of this process is determined by E3 ligases, which recognize degradation signals (e.g., specific amino acid sequences or post-translational modifications) on target proteins.</li> </ul> </li> <li><strong>Recognition and Degradation</strong>: <ul> <li>The polyubiquitinated protein is recognized by the 19S regulatory particle of the proteasome.</li> <li>The ubiquitin chain is removed and recycled, while the target protein is unfolded and translocated into the 20S core particle.</li> <li>Proteolytic enzymes within the 20S core degrade the protein into short peptides (typically 3–25 amino acids long).</li> </ul> </li> <li><strong>Peptide Recycling</strong>: <ul> <li>The resulting peptides are further broken down by cellular peptidases into amino acids, which can be reused for protein synthesis or other metabolic processes.</li> </ul> </li> </ol> <h3 id="biological-significance">Biological Significance</h3> <ul> <li><strong>Protein Quality Control</strong>: The UPS eliminates misfolded or damaged proteins, preventing their accumulation and potential toxicity.</li> <li><strong>Regulation of Key Processes</strong>: <ul> <li>Cell cycle control (e.g., degradation of cyclins).</li> <li>Apoptosis (programmed cell death).</li> <li>DNA repair and transcription regulation.</li> <li>Immune response (e.g., antigen presentation via MHC class I molecules).</li> </ul> </li> <li><strong>Disease Implications</strong>: <ul> <li>Dysregulation of the UPS is linked to neurodegenerative diseases (e.g., Alzheimer’s, Parkinson’s), cancer, and autoimmune disorders.</li> <li>Therapeutic targets, such as proteasome inhibitors (e.g., bortezomib), are used in cancer treatment.</li> </ul> </li> </ul> <h3 id="mathematical-representation">Mathematical Representation</h3> <p>The efficiency of protein degradation via the UPS can be modeled using Michaelis-Menten kinetics, where the degradation rate \(v\) is given by:</p> \[v = \frac{V_{\text{max}} \cdot [S]}{K_m + [S]}\] <p>Here, \([S]\) represents the concentration of ubiquitinated substrate, \(V_{\text{max}}\) is the maximum degradation rate, and \(K_m\) is the Michaelis constant, reflecting the affinity of the proteasome for the substrate.</p> <h3 id="summary">Summary</h3> <p>The Ubiquitin-Proteasome System is a sophisticated and essential cellular machinery for controlled protein turnover. Its precision in targeting specific proteins for degradation ensures proper cellular function and adaptation to environmental changes, making it a focal point in both basic research and therapeutic development.</p>]]></content><author><name></name></author><category term="Cell"/><category term="Biology"/><category term="Protein"/><category term="Degradation"/><category term="Ubiquitin"/><category term="Proteasome"/><category term="Regulation"/><summary type="html"><![CDATA[A cellular mechanism for targeted protein degradation and regulation.]]></summary></entry><entry><title type="html">Ubiquitin Overview</title><link href="https://chunde.github.io/blog/2025/Ubiquitin-Detailed-Explanation/" rel="alternate" type="text/html" title="Ubiquitin Overview"/><published>2025-10-11T10:18:45+00:00</published><updated>2025-10-11T10:18:45+00:00</updated><id>https://chunde.github.io/blog/2025/Ubiquitin-Detailed-Explanation</id><content type="html" xml:base="https://chunde.github.io/blog/2025/Ubiquitin-Detailed-Explanation/"><![CDATA[<h2 id="introduction-to-ubiquitin">Introduction to Ubiquitin</h2> <p>Ubiquitin is a small regulatory protein found in eukaryotic cells that plays a fundamental role in protein degradation and numerous cellular processes. Discovered in 1975 by Gideon Goldstein and colleagues, ubiquitin has since been recognized as one of the most important regulatory molecules in cell biology.</p> <h2 id="structural-characteristics">Structural Characteristics</h2> <p>Ubiquitin is a highly conserved protein consisting of 76 amino acids with a molecular weight of approximately 8.5 kDa. Its structure features:</p> <ul> <li><strong>Compact globular fold</strong> with a mixed \(\beta\)-sheet and \(\alpha\)-helix arrangement</li> <li><strong>Seven \(\beta\)-strands</strong> and <strong>three \(\alpha\)-helices</strong> forming a stable tertiary structure</li> <li><strong>C-terminal glycine residue</strong> (Gly76) that serves as the attachment point for target proteins</li> <li><strong>Hydrophobic patch</strong> on the surface that facilitates interactions with ubiquitin-binding domains</li> </ul> <h2 id="the-ubiquitination-process">The Ubiquitination Process</h2> <p>Ubiquitination involves a three-step enzymatic cascade:</p> <h3 id="1-activation-e1-enzyme">1. Activation (E1 Enzyme)</h3> <p>Ubiquitin-activating enzyme (E1) activates ubiquitin in an ATP-dependent reaction: \(\text{Ubiquitin} + \text{ATP} \rightarrow \text{Ubiquitin-AMP} + \text{PP}_i\) The activated ubiquitin is then transferred to the E1 enzyme’s active site cysteine residue.</p> <h3 id="2-conjugation-e2-enzyme">2. Conjugation (E2 Enzyme)</h3> <p>Ubiquitin-conjugating enzymes (E2) receive the activated ubiquitin from E1 through a trans-thioesterification reaction.</p> <h3 id="3-ligation-e3-enzyme">3. Ligation (E3 Enzyme)</h3> <p>Ubiquitin ligases (E3) facilitate the transfer of ubiquitin from E2 to specific target proteins, forming an isopeptide bond between ubiquitin’s C-terminal glycine and the \(\epsilon\)-amino group of a lysine residue on the target protein.</p> <h2 id="types-of-ubiquitin-chains">Types of Ubiquitin Chains</h2> <p>Ubiquitin can form different chain types through its seven internal lysine residues (K6, K11, K27, K29, K33, K48, K63):</p> <ul> <li><strong>K48-linked chains</strong>: Primarily target proteins for proteasomal degradation</li> <li><strong>K63-linked chains</strong>: Involved in DNA repair, endocytosis, and signal transduction</li> <li><strong>Linear chains</strong>: Formed through N-terminal methionine (M1)</li> <li><strong>Mixed/branched chains</strong>: Combinations of different linkages create complex signaling patterns</li> </ul> <h2 id="cellular-functions">Cellular Functions</h2> <h3 id="protein-degradation">Protein Degradation</h3> <p>The most well-known function involves targeting proteins for degradation by the 26S proteasome through K48-linked polyubiquitin chains.</p> <h3 id="signal-transduction">Signal Transduction</h3> <ul> <li>Regulation of NF-\(\kappa\)B pathway</li> <li>Control of cell cycle progression</li> <li>DNA damage response and repair</li> </ul> <h3 id="membrane-trafficking">Membrane Trafficking</h3> <ul> <li>Endocytosis and lysosomal degradation</li> <li>Regulation of receptor internalization</li> </ul> <h3 id="dna-repair">DNA Repair</h3> <ul> <li>Coordination of DNA damage response pathways</li> <li>Regulation of repair protein activity</li> </ul> <h2 id="ubiquitin-system-components">Ubiquitin System Components</h2> <h3 id="enzymatic-machinery">Enzymatic Machinery</h3> <ul> <li><strong>E1 enzymes</strong>: 2 types in humans (UBA1, UBA6)</li> <li><strong>E2 enzymes</strong>: Approximately 40 different types</li> <li><strong>E3 ligases</strong>: Over 600 types, providing substrate specificity</li> </ul> <h3 id="deubiquitinating-enzymes-dubs">Deubiquitinating Enzymes (DUBs)</h3> <p>Approximately 100 DUBs reverse ubiquitination by cleaving ubiquitin from substrates or processing ubiquitin precursors.</p> <h2 id="clinical-significance">Clinical Significance</h2> <h3 id="therapeutic-targets">Therapeutic Targets</h3> <p>Ubiquitin pathway components are targets for:</p> <ul> <li>Cancer therapies (proteasome inhibitors like bortezomib)</li> <li>Neurodegenerative disease treatments</li> <li>Anti-inflammatory drugs</li> </ul> <h3 id="disease-associations">Disease Associations</h3> <ul> <li>Mutations in ubiquitin system components linked to various cancers</li> <li>Parkinson’s disease (involves parkin, an E3 ubiquitin ligase)</li> <li>Angelman syndrome (UBE3A mutation)</li> </ul> <h2 id="research-techniques">Research Techniques</h2> <p>Common methods for studying ubiquitin include:</p> <ul> <li>Immunoprecipitation with ubiquitin-specific antibodies</li> <li>Tandem ubiquitin-binding entities (TUBEs) for purification</li> <li>Mass spectrometry for ubiquitin chain topology analysis</li> <li>Fluorescent ubiquitin-based cell cycle indicators (FUCCI)</li> </ul> <p>The ubiquitin system represents a sophisticated post-translational modification network that regulates virtually all aspects of cellular function, making it a critical area of study in molecular biology and therapeutic development.</p>]]></content><author><name></name></author><category term="biochemistry"/><category term="cell-biology"/><category term="protein-degradation"/><category term="post-translational-modification"/><category term="proteasome"/><summary type="html"><![CDATA[Comprehensive analysis of ubiquitin's structure, function, and cellular roles]]></summary></entry><entry><title type="html">PBS Solution in Biology</title><link href="https://chunde.github.io/blog/2025/PBS-Solution-Guide/" rel="alternate" type="text/html" title="PBS Solution in Biology"/><published>2025-10-08T20:11:12+00:00</published><updated>2025-10-08T20:11:12+00:00</updated><id>https://chunde.github.io/blog/2025/PBS-Solution-Guide</id><content type="html" xml:base="https://chunde.github.io/blog/2025/PBS-Solution-Guide/"><![CDATA[<h1 id="comprehensive-guide-to-pbs-solution-in-biology">Comprehensive Guide to PBS Solution in Biology</h1> <h2 id="introduction-to-pbs">Introduction to PBS</h2> <p>Phosphate Buffered Saline (PBS) is one of the most commonly used buffer solutions in biological and biochemical laboratories. It’s an isotonic, non-toxic solution that closely mimics the salt composition and pH of physiological fluids, making it ideal for various biological applications.</p> <h2 id="chemical-composition">Chemical Composition</h2> <p>Standard PBS typically contains:</p> <ul> <li><strong>Sodium chloride (NaCl)</strong>: 137 mM</li> <li><strong>Potassium chloride (KCl)</strong>: 2.7 mM</li> <li><strong>Disodium hydrogen phosphate (Na₂HPO₄)</strong>: 10 mM</li> <li><strong>Potassium dihydrogen phosphate (KH₂PO₄)</strong>: 1.8 mM</li> </ul> <p>The final solution has:</p> <ul> <li><strong>pH</strong>: 7.4 (physiological pH)</li> <li><strong>Osmolarity</strong>: Approximately 290 mOsm/L</li> <li><strong>Ionic strength</strong>: Balanced for biological systems</li> </ul> <h2 id="core-principles-of-pbs-usage">Core Principles of PBS Usage</h2> <h3 id="1-physiological-compatibility-principle">1. <strong>Physiological Compatibility Principle</strong></h3> <p>PBS is designed to maintain cellular integrity by matching the osmotic pressure and ionic composition of mammalian cells. The balanced salt concentration prevents osmotic shock, making it safe for cell washing and suspension.</p> <h3 id="2-ph-buffering-principle">2. <strong>pH Buffering Principle</strong></h3> <p>The phosphate buffer system (H₂PO₄⁻/HPO₄²⁻) maintains a stable pH around 7.4, which is crucial because:</p> <ul> <li>Most enzymes function optimally at physiological pH</li> <li>Protein structure and function are pH-dependent</li> <li>Cell membrane stability requires specific pH conditions</li> </ul> <h3 id="3-ionic-strength-principle">3. <strong>Ionic Strength Principle</strong></h3> <p>The specific ion concentrations in PBS serve multiple purposes:</p> <ul> <li><strong>Na⁺ and K⁺</strong>: Maintain membrane potential and cellular functions</li> <li><strong>Cl⁻</strong>: Most abundant anion in extracellular fluid</li> <li><strong>Phosphate ions</strong>: Provide buffering capacity and participate in cellular processes</li> </ul> <h3 id="4-isotonicity-principle">4. <strong>Isotonicity Principle</strong></h3> <p>PBS is isotonic with mammalian cells, meaning it has the same osmotic pressure as cellular fluids. This prevents:</p> <ul> <li>Cell swelling and bursting in hypotonic solutions</li> <li>Cell shrinkage in hypertonic solutions</li> </ul> <h2 id="key-applications-and-their-principles">Key Applications and Their Principles</h2> <h3 id="cell-culture-and-maintenance"><strong>Cell Culture and Maintenance</strong></h3> <ul> <li><strong>Principle</strong>: Provides a physiological environment for cells outside the body</li> <li><strong>Applications</strong>: Cell washing, media preparation, cell suspension</li> </ul> <h3 id="immunological-techniques"><strong>Immunological Techniques</strong></h3> <ul> <li><strong>Principle</strong>: Maintains antibody and antigen stability</li> <li><strong>Applications</strong>: ELISA, Western blotting, immunohistochemistry, flow cytometry</li> </ul> <h3 id="protein-studies"><strong>Protein Studies</strong></h3> <ul> <li><strong>Principle</strong>: Preserves protein structure and function</li> <li><strong>Applications</strong>: Protein extraction, purification, storage</li> </ul> <h3 id="molecular-biology"><strong>Molecular Biology</strong></h3> <ul> <li><strong>Principle</strong>: Provides stable ionic environment for enzymatic reactions</li> <li><strong>Applications</strong>: DNA/RNA extraction, PCR, restriction digestion</li> </ul> <h3 id="tissue-processing"><strong>Tissue Processing</strong></h3> <ul> <li><strong>Principle</strong>: Maintains tissue integrity during processing</li> <li><strong>Applications</strong>: Fixation, washing, storage</li> </ul> <h2 id="preparation-guidelines">Preparation Guidelines</h2> <h3 id="standard-pbs-recipe"><strong>Standard PBS Recipe</strong></h3> <div class="language-plaintext highlighter-rouge"><div class="highlight"><pre class="highlight"><code>For 1 liter of 1× PBS:
- NaCl: 8.0 g
- KCl: 0.2 g
- Na₂HPO₄: 1.44 g
- KH₂PO₄: 0.24 g
- Adjust pH to 7.4 with HCl/NaOH
- Bring to final volume with distilled water
</code></pre></div></div> <h3 id="critical-considerations"><strong>Critical Considerations</strong></h3> <ul> <li><strong>Sterilization</strong>: Autoclave or filter sterilize for cell culture use</li> <li><strong>Storage</strong>: Store at room temperature or refrigerated</li> <li><strong>Stability</strong>: Stable for months when properly stored</li> </ul> <h2 id="variations-and-modifications">Variations and Modifications</h2> <h3 id="calcium-and-magnesium-free-pbs"><strong>Calcium and Magnesium-Free PBS</strong></h3> <ul> <li>Used when divalent cations might interfere with experiments</li> <li>Applications: Cell dissociation, enzyme assays</li> </ul> <h3 id="pbs-with-additives"><strong>PBS with Additives</strong></h3> <ul> <li><strong>EDTA</strong>: For cell dissociation (chelates calcium)</li> <li><strong>BSA</strong>: For blocking in immunoassays</li> <li><strong>Sucrose</strong>: For cryoprotection</li> </ul> <h2 id="troubleshooting-common-issues">Troubleshooting Common Issues</h2> <h3 id="ph-drift"><strong>pH Drift</strong></h3> <ul> <li><strong>Cause</strong>: Contamination or improper storage</li> <li><strong>Solution</strong>: Prepare fresh solution and check storage conditions</li> </ul> <h3 id="precipitation"><strong>Precipitation</strong></h3> <ul> <li><strong>Cause</strong>: Incorrect preparation or contamination</li> <li><strong>Solution</strong>: Filter through 0.22 μm membrane</li> </ul> <h3 id="cellular-toxicity"><strong>Cellular Toxicity</strong></h3> <ul> <li><strong>Cause</strong>: Incorrect osmolarity or contamination</li> <li><strong>Solution</strong>: Verify preparation and use sterile techniques</li> </ul> <h2 id="safety-and-best-practices">Safety and Best Practices</h2> <ul> <li>Always use analytical grade reagents</li> <li>Verify pH and osmolarity before critical experiments</li> <li>Use aseptic techniques for cell culture applications</li> <li>Label solutions clearly with preparation date and concentration</li> </ul> <h2 id="conclusion">Conclusion</h2> <p>PBS remains fundamental in biological research due to its physiological relevance and versatility. Understanding the principles behind its composition and usage ensures optimal experimental outcomes and maintains biological sample integrity across diverse applications.</p>]]></content><author><name></name></author><category term="biology"/><category term="cell-culture"/><category term="buffer"/><category term="solution"/><category term="laboratory-techniques"/><category term="biochemistry"/><summary type="html"><![CDATA[A comprehensive guide to Phosphate Buffered Saline (PBS) focusing on its principles and applications in biological research.]]></summary></entry><entry><title type="html">DNA Histone Modifications</title><link href="https://chunde.github.io/blog/2025/DNA-Histone-Modifications/" rel="alternate" type="text/html" title="DNA Histone Modifications"/><published>2025-09-27T22:24:37+00:00</published><updated>2025-09-27T22:24:37+00:00</updated><id>https://chunde.github.io/blog/2025/DNA-Histone-Modifications</id><content type="html" xml:base="https://chunde.github.io/blog/2025/DNA-Histone-Modifications/"><![CDATA[<p>DNA histone modifications are chemical changes made to histone proteins, around which DNA is wound to form chromatin. These modifications do not alter the underlying DNA sequence itself, making them a key mechanism of <strong>epigenetic regulation</strong>. They act as a dynamic control system for gene expression, determining whether a gene is accessible for transcription (turned on) or silenced (turned off).</p> <p>Histones are proteins that package DNA into structural units called nucleosomes. The N-terminal tails of these histone proteins are subject to various chemical modifications, including:</p> <ul> <li><strong>Methylation (\(-CH_3\)):</strong> The addition of methyl groups. This can be associated with both gene activation or repression, depending on which histone and which specific amino acid (e.g., lysine, arginine) is methylated. For example, methylation of histone H3 at lysine 4 (\(H3K4me3\)) is a hallmark of active gene promoters.</li> <li><strong>Acetylation (\(-COCH_3\)):</strong> The addition of acetyl groups. This typically neutralizes the positive charge on histones, loosening their grip on the negatively charged DNA. This open chromatin state (euchromatin) makes genes accessible for transcription.</li> <li><strong>Phosphorylation (\(-PO_4\)):</strong> The addition of phosphate groups. This is often involved in chromosome condensation during cell division and DNA damage response.</li> <li><strong>Ubiquitination:</strong> The attachment of ubiquitin proteins. This is a larger modification that can influence chromatin structure and other histone modifications.</li> </ul> <p>These modifications are written by <strong>“writer”</strong> enzymes (e.g., histone acetyltransferases, HATs) and erased by <strong>“eraser”</strong> enzymes (e.g., histone deacetylases, HDACs). Their presence is interpreted by <strong>“reader”</strong> proteins, which recruit other complexes to either activate or repress transcription.</p> <p>In essence, DNA histone modifications create a “histone code” that expands the information potential of the genome, allowing cells with identical DNA sequences to develop into different types (e.g., muscle cells, neurons) and respond appropriately to environmental cues.</p>]]></content><author><name></name></author><category term="epigenetics"/><category term="chromatin"/><category term="gene-regulation"/><category term="histone-DNA-interaction"/><summary type="html"><![CDATA[Chemical alterations to histone proteins that regulate gene expression without changing DNA sequence.]]></summary></entry><entry><title type="html">Rejection Sampling</title><link href="https://chunde.github.io/blog/2025/Rejection-Sampling-in-LLM-Training/" rel="alternate" type="text/html" title="Rejection Sampling"/><published>2025-09-23T23:37:40+00:00</published><updated>2025-09-23T23:37:40+00:00</updated><id>https://chunde.github.io/blog/2025/Rejection-Sampling-in-LLM-Training</id><content type="html" xml:base="https://chunde.github.io/blog/2025/Rejection-Sampling-in-LLM-Training/"><![CDATA[<p>Rejection sampling is a technique used in large language model (LLM) training, particularly in reinforcement learning from human feedback (RLHF) and related alignment methods. It serves as a simple but effective approach to generate high-quality training data by selectively accepting or rejecting samples based on predefined criteria.</p> <h2 id="core-concept">Core Concept</h2> <p>The process involves:</p> <ol> <li><strong>Generating multiple candidate responses</strong> from the LLM for a given prompt</li> <li><strong>Evaluating each candidate</strong> against a quality metric (typically using a reward model or human raters)</li> <li><strong>Rejecting low-quality samples</strong> and <strong>accepting only the best candidates</strong> for training</li> </ol> <p>Mathematically, if we have a reward function \(R(x)\) that scores sample quality, rejection sampling selects samples where \(R(x) &gt; \tau\) for some threshold \(\tau\).</p> <h2 id="applications-in-llm-training">Applications in LLM Training</h2> <h3 id="reward-modeling">Reward Modeling</h3> <p>In RLHF Phase 2, rejection sampling helps create high-quality demonstration data by filtering out poor responses, ensuring the policy model learns from only the best examples.</p> <h3 id="data-augmentation">Data Augmentation</h3> <p>By generating diverse candidates and selecting the highest-quality ones, rejection sampling creates additional training data that aligns better with human preferences.</p> <h3 id="quality-control">Quality Control</h3> <p>During inference or data generation, rejection sampling acts as a filter to ensure only acceptable outputs are used or presented.</p> <h2 id="advantages-and-limitations">Advantages and Limitations</h2> <p><strong>Advantages:</strong></p> <ul> <li>Simple to implement and understand</li> <li>Requires no gradient computations</li> <li>Can be highly effective when the reward model is accurate</li> </ul> <p><strong>Limitations:</strong></p> <ul> <li>Computationally expensive (requires generating multiple candidates)</li> <li>Sample inefficient (many generations are discarded)</li> <li>Performance depends heavily on the quality of the reward function</li> </ul> <p>Rejection sampling represents a fundamental building block in modern LLM alignment pipelines, often serving as a baseline for more sophisticated sampling techniques like proximal policy optimization (PPO) or best-of-n sampling.</p>]]></content><author><name></name></author><category term="LLM"/><category term="training"/><category term="sampling"/><category term="reinforcement-learning"/><category term="alignment"/><summary type="html"><![CDATA[A method for generating high-quality samples by filtering model outputs against quality criteria]]></summary></entry><entry><title type="html">FastAPI</title><link href="https://chunde.github.io/blog/2025/FastAPI-Overview/" rel="alternate" type="text/html" title="FastAPI"/><published>2025-09-20T23:30:07+00:00</published><updated>2025-09-20T23:30:07+00:00</updated><id>https://chunde.github.io/blog/2025/FastAPI-Overview</id><content type="html" xml:base="https://chunde.github.io/blog/2025/FastAPI-Overview/"><![CDATA[<p>FastAPI is a modern, high-performance web framework for building APIs with Python 3.7+ that is based on standard Python type hints. It is designed to be fast to code, with high performance, and to provide automatic interactive API documentation.</p> <p>Key features of FastAPI include:</p> <ul> <li><strong>High performance</strong>: Built on Starlette for the web parts and Pydantic for the data parts, it is one of the fastest Python frameworks available, comparable to Node.js and Go.</li> <li><strong>Automatic documentation</strong>: Generates interactive API documentation (using Swagger UI and ReDoc) automatically from your code, making it easy to test and understand endpoints.</li> <li><strong>Type safety</strong>: Leverages Python type hints to validate data, serialize and deserialize inputs and outputs, reducing bugs and improving developer productivity.</li> <li><strong>Asynchronous support</strong>: Native support for async and await, making it efficient for handling high levels of concurrency with ASGI.</li> <li><strong>Dependency injection</strong>: Built-in support for dependency injection, simplifying code organization and reuse.</li> </ul> <p>Example of a simple FastAPI endpoint:</p> <div class="language-python highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="kn">from</span> <span class="n">fastapi</span> <span class="kn">import</span> <span class="n">FastAPI</span>

<span class="n">app</span> <span class="o">=</span> <span class="nc">FastAPI</span><span class="p">()</span>

<span class="nd">@app.get</span><span class="p">(</span><span class="sh">"</span><span class="s">/</span><span class="sh">"</span><span class="p">)</span>
<span class="k">def</span> <span class="nf">read_root</span><span class="p">():</span>
    <span class="k">return</span> <span class="p">{</span><span class="sh">"</span><span class="s">Hello</span><span class="sh">"</span><span class="p">:</span> <span class="sh">"</span><span class="s">World</span><span class="sh">"</span><span class="p">}</span>
</code></pre></div></div> <p>This code creates a basic API that returns a JSON response. FastAPI automatically generates docs at <code class="language-plaintext highlighter-rouge">/docs</code> and <code class="language-plaintext highlighter-rouge">/redoc</code>.</p> <p>Overall, FastAPI is ideal for developing RESTful APIs, microservices, and other web services where speed, ease of use, and robustness are priorities.</p>]]></content><author><name></name></author><category term="Python"/><category term="Web-Development"/><category term="API"/><category term="FastAPI"/><summary type="html"><![CDATA[A modern, high-performance web framework for building APIs with Python.]]></summary></entry><entry><title type="html">Scaffolding in Protein Design</title><link href="https://chunde.github.io/blog/2025/Scaffolding-in-Protein-Design/" rel="alternate" type="text/html" title="Scaffolding in Protein Design"/><published>2025-09-20T11:33:59+00:00</published><updated>2025-09-20T11:33:59+00:00</updated><id>https://chunde.github.io/blog/2025/Scaffolding-in-Protein-Design</id><content type="html" xml:base="https://chunde.github.io/blog/2025/Scaffolding-in-Protein-Design/"><![CDATA[<p>In protein design, scaffolding refers to the use of a stable, well-characterized protein framework to support and stabilize functional motifs or domains that may be unstable or non-functional on their own. This approach is essential for creating chimeric or engineered proteins with desired activities, such as enzymes with novel catalytic functions, binding proteins, or biosensors.</p> <h3 id="how-scaffolding-works">How Scaffolding Works:</h3> <ol> <li><strong>Selection of Scaffold</strong>: A robust, structurally stable protein (e.g., a small globular protein like GFP, fibronectin domains, or helical bundles) is chosen as the scaffold. It should have minimal functional interference and high expression compatibility.</li> <li><strong>Integration of Functional Motifs</strong>: Functional elements (e.g., active sites, binding loops, or epitopes) are grafted onto the scaffold, often by replacing or inserting sequences into flexible regions (like loops) or surface-exposed areas. This is done while preserving the scaffold’s overall fold and stability.</li> <li><strong>Stabilization and Optimization</strong>: The scaffold provides a rigid backbone that reduces conformational entropy, enhancing the stability and proper folding of the functional motif. Computational tools (e.g., Rosetta) or directed evolution may be used to optimize interactions and minimize destabilization.</li> <li><strong>Validation</strong>: The designed protein is tested experimentally for stability, function, and structure (e.g., via X-ray crystallography or NMR) to ensure the scaffold supports the intended activity.</li> </ol> <h3 id="key-principles">Key Principles:</h3> <ul> <li><strong>Spatial Positioning</strong>: The scaffold orients functional groups in specific geometries, crucial for activities like catalysis or molecular recognition.</li> <li><strong>Minimized Interference</strong>: Scaffolds are often “defunctionalized” to avoid unwanted interactions.</li> <li><strong>Modularity</strong>: Scaffolds allow modular design, where multiple motifs can be incorporated into one framework.</li> </ul> <h3 id="applications">Applications:</h3> <p>Scaffolding is used in designing enzymes, antibodies, synthetic receptors, and protein-based materials. For example, in enzyme engineering, a scaffold can stabilize a nascent active site, while in biosensors, it can position binding domains for optimal signal transduction.</p> <h3 id="mathematical-representation">Mathematical Representation:</h3> <p>The stability of a scaffolded protein can be modeled using energy functions, where the total free energy \(\Delta G_{\text{total}}\) is a sum of contributions from the scaffold (\(\Delta G_{\text{scaffold}}\)) and the grafted motif (\(\Delta G_{\text{motif}}\)), adjusted for interactions (\(\Delta G_{\text{interaction}}\)):<br/> \(\Delta G_{\text{total}} = \Delta G_{\text{scaffold}} + \Delta G_{\text{motif}} + \Delta G_{\text{interaction}}\)<br/> A negative \(\Delta G_{\text{total}}\) indicates a stable design.</p> <p>By leveraging scaffolding, protein designers can create functional proteins that might otherwise be inaccessible due to instability or misfolding.</p>]]></content><author><name></name></author><category term="protein-design"/><category term="scaffolding"/><category term="structural-biology"/><category term="bioengineering"/><summary type="html"><![CDATA[Scaffolding provides structural support to stabilize functional protein motifs in engineered proteins.]]></summary></entry><entry><title type="html">Cavitron Ultrasonic Surgical Aspirator</title><link href="https://chunde.github.io/blog/2025/Cavitron-Ultrasonic-Surgical-Aspirator/" rel="alternate" type="text/html" title="Cavitron Ultrasonic Surgical Aspirator"/><published>2025-09-05T22:23:46+00:00</published><updated>2025-09-05T22:23:46+00:00</updated><id>https://chunde.github.io/blog/2025/Cavitron-Ultrasonic-Surgical-Aspirator</id><content type="html" xml:base="https://chunde.github.io/blog/2025/Cavitron-Ultrasonic-Surgical-Aspirator/"><![CDATA[<p>The Cavitron Ultrasonic Surgical Aspirator (CUSA) is a sophisticated medical device used in surgery to selectively fragment and remove tissues with high precision, while minimizing damage to surrounding structures. It employs ultrasonic vibrations—typically at frequencies around \(23 \text{kHz}\) to \(36 \text{kHz}\)—generated by a piezoelectric or magnetostrictive transducer. These vibrations are transmitted to a titanium tip, which oscillates longitudinally at amplitudes on the order of micrometers.</p> <p>When the tip contacts tissue, the ultrasonic energy causes cavitation and mechanical disruption, effectively emulsifying or fragmenting the target material (e.g., soft tumors, necrotic tissue, or parenchyma). Simultaneously, an integrated irrigation and aspiration system flushes the area with saline and suctions away the debris, keeping the surgical field clear. The device allows surgeons to control parameters such as amplitude (power), aspiration strength, and irrigation rate, tailoring the effect to tissue consistency—softer tissues are more readily fragmented than firmer, collagen-rich ones.</p> <p>CUSA is particularly valued in neurosurgery for resecting brain tumors (e.g., gliomas or meningiomas) where precision is critical to spare healthy neural tissue, and in hepatic, pancreatic, or other oncologic surgeries for debulking masses. Advantages include reduced bleeding, enhanced visualization, and selective tissue removal, though it requires skill to avoid inadvertent injury. It represents a key tool in minimally invasive and delicate surgical procedures.</p>]]></content><author><name></name></author><category term="surgery"/><category term="ultrasonic"/><category term="CUSA"/><category term="medical-device"/><category term="neurosurgery"/><category term="oncology"/><summary type="html"><![CDATA[A surgical device using ultrasonic vibrations to fragment and aspirate tissue with precision.]]></summary></entry></feed>