Posts
All the articles I've posted.
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Published: at 16:40这两天在弄SNN训练的事情,需要验证一下用的Surrogate Gradient的准确性,老师介绍读一下这篇文章,用Evolution Strategy验证一下现在梯度估计的准确性。
SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute
Published: at 11:06sparTA,带稀疏优化的DNN编译器,把tensor的稀疏性作为一种重要属性考虑到编译过程中,生成高效的代码。
Scalable Diffusion Models with Transformers
Published: at 16:29Diffusion Transformer.
初探AI Infra
Updated: at 18:30Published: at 16:04趁最近找实习的机会学习、总结一下之前零散接触过的模型推理/训练加速的知识,还有一些CUDA编程的体系架构之类的内容。
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
Updated: at 14:57Published: at 14:39使用大kernel DS卷积替代self-attention。字节新加坡的工作。
SpikeCV: Open a Continuous Computer Vision Era
Updated: at 14:57Published: at 15:33事件相机开源框架。
Neuromorphic computing at scale
Updated: at 14:57Published: at 22:11发在Nature上的一篇review,讨论了SNN/神经模态计算社区现在面临的一些问题、挑战,和一些可能的发展方向。
Titans: Learning to Memorize at Test Time
Updated: at 14:57Published: at 18:36从TTT改进而来的新架构,尝试通过TTT的方式改进模型的记忆能力。
Segment Anything
Updated: at 14:57Published: at 13:48Meta的SAM。
SDiT: Spiking Diffusion Model with Transformer
Updated: at 14:57Published: at 14:10脉冲Diffusion Transformer,里面的Transformer的结构是RWKV的。