Archive
2025
- 04 Oct论文阅读:Unlearning-Based Neural Interpretations
- 02 OctHugging Face Datasets 库核心用法
- 01 OctPeft 库:LoRA 对网络模型结构的修改
- 01 Oct论文阅读:PINNs for PTV
- 01 Octlm_eval 库简介
- 01 Oct代码阅读:KL 散度正则项计算
- 29 SepLearning Dynamics of LLM Fine-Tuning
- 28 Sep神经正切核 Neural Tangent Kernel, NTK
- 24 Sep线性可控性 Linearized Controllability Property LPC
- 24 Sep论文阅读:Control Theoretic Approach to Fine-Tuning and Transfer Learning
- 22 Septorchinfo 库可视化网络架构
- 22 SepPyTorch 网络结构基础
- 22 SepPeft 获取原模型 get_base_model 的 Bug
- 19 Sep约束问题的 Primal-Dual 算法
- 18 SepApproximate KL Divergence using Fisher Information Matrix
- 17 Sep线性时不变系统的能控性 Controllable
- 15 SepPytorch-Lightning + Peft 实现 LoRA 微调示例
- 15 Sep论文阅读:Parameter-Efficient Fine-Tuning with Controls
- 13 SepPytorch 中的 Dataset、DataLoader
- 13 Sep论文阅读-Benchmarking Optimizers for Large Language Model Pretraining
- 11 SepPytest 中的 Fixture
- 11 SepPytest + VSCode 快速上手
- 10 Sep论文阅读:RL’s Razor Why Online Reinforcement Learning Forgets Less
- 07 SepPytorch Tensor 形状操作总结
- 07 SepPytorch 求导操作总结
- 06 Sep论文阅读:The Super Weight in Large Language Models
- 05 SepPytorch Tensor 数学操作总结
- 04 Sep拉格朗日乘子法
- 01 SepFisher 矩阵与弹性权重巩固 Elastic Weight Consolidation, EWC
- 01 Sep先验、后验概率、贝叶斯公式
- 30 AugMethod of Successive Approximations 通用格式
- 28 Aug论文阅读-Recall and Learn Fine Tuning with Less Forgetting
- 27 AugAdamW: Adam with Decoupled Weight Decay
- 25 AugAdam: Adaptive Moment Estimation
- 22 Aug论文阅读:Method of Successive Approximations
- 21 AugKL 散度、交叉熵与对数似然
- 19 AugMutagen:好用的端口转发与远程文件同步工具
- 19 AugTransformers 库 CLM 预训练与微调代码阅读
- 18 AugLoRA (Low-Rank Adaptation)
- 18 AugLaSalle’s Invariance Principle
- 16 AugHugging Face trl 微调库 SFT 代码阅读
- 16 AugHeavy-ball 与 Momentum 算法等价性
- 16 Aug动力系统稳定性概览
- 15 Aug最优控制 Optimal Control 概览
- 15 Aug论文阅读:连续学习的最优控制方式
- 13 AugTransformers 库 Trainer 使用与自定义优化器
- 12 AugTransformers 库图像分类微调代码阅读
- 09 AugAnaconda 全家桶介绍
- 08 AugPytorch Parameters 结构与 Muon 的调用
- 29 JulMAE 测试环境搭建