Tag: 视觉
All the articles with the tag "视觉".
Scalable Diffusion Models with Transformers
Published: at 16:29Diffusion Transformer.
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
Updated: at 14:57Published: at 14:39使用大kernel DS卷积替代self-attention。字节新加坡的工作。
Segment Anything
Updated: at 14:57Published: at 13:48Meta的SAM。
SDiT: Spiking Diffusion Model with Transformer
Updated: at 14:57Published: at 14:10脉冲Diffusion Transformer,里面的Transformer的结构是RWKV的。
ConvUNeXt:An efficient convolution neural network for medical image segmentation
Updated: at 14:57Published: at 15:59ConvNext + UNet,发在一个C刊上,借鉴学习一下,想想我的模块怎么设计。
ConvNext V2: Co-designing and Scaling ConvNets with Masked Autoencoders
Updated: at 14:57Published: at 06:05ConvNext续作,引入了MAE。
A ConvNet for the 2020s
Updated: at 14:57Published: at 15:22CVPR2022。Meta的工作,在ViT相关工作占视觉大头的情况下重构纯卷积的网络,并且取得了很好的效果。
VPRTempo: A Fast Temporally Encoded Spiking Neural Network for Visual Place Recognition
Updated: at 15:06Published: at 15:34ICRA2024的论文,用Temporal Encoding的STDP Direct Training的SNN做场景识别的任务。太简单了
Integer-Valued Training and Spike-Driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection
Updated: at 15:06Published: at 12:46SpikeYOLO,中科院自动化所的工作,ECCV2024 Oral