Vision Transformers & Attention

Attention mechanisms, vision transformers, and the architectures replacing convolutions across vision tasks. We unpack how attention is used, misused, and reinvented in current research, from efficient attention branches to Mamba-style state space models.

Weak-Mamba-UNet: How CNN, ViT, and Visual Mamba Collaborate to Segment Medical Images from Scribbles

Weak-Mamba-UNet: How CNN, ViT, and Visual Mamba Collaborate to Segment Medical Images from Scribbles

Weak-Mamba-UNet: How CNN, ViT, and Visual Mamba Collaborate to Segment Medical Images from Scribbles | AI Trend Blend AITrendBlend Machine Learning Computer Vision About Medical AI & Weakly-Supervised Learning · arXiv:2402.10887 · University of Oxford / Mianyang Visual Engineering Center · 25 min read Teaching Three Different Brains to Agree — How Weak-Mamba-UNet Segments Hearts […]

Weak-Mamba-UNet: How CNN, ViT, and Visual Mamba Collaborate to Segment Medical Images from Scribbles Read More »

Mamba-3: Three Simple Ideas That Finally Fix What Transformers Get Wrong at Inference.

Mamba-3: Three Simple Ideas That Finally Fix What Transformers Get Wrong at Inference

Mamba-3: Three Simple Ideas That Finally Fix What Transformers Get Wrong at Inference | AI Trend Blend AITrendBlend Machine Learning NLP & LLMs About Efficient AI · arXiv:2603.15569 · CMU & Princeton · March 2026 · 22 min read Mamba-3: Three Simple Ideas That Finally Fix What Transformers Get Wrong at Inference Time Researchers at

Mamba-3: Three Simple Ideas That Finally Fix What Transformers Get Wrong at Inference Read More »

GateMamba: Feature Gated Mixer in State Space Model for Point Cloud 3D Object Detection.

GateMamba: Feature Gated Mixer in State Space Model for Point Cloud 3D Object Detection

GateMamba: Feature Gated Mixer in State Space Model for Point Cloud 3D Object Detection | AI Trend Blend AITrendBlend Machine Learning Computer Vision About Autonomous Driving AI · ISPRS Journal of Photogrammetry and Remote Sensing 236 (2026) 640–653 · 22 min read GateMamba: How Three Gated Mixers Taught a Mamba Network to Stop Ignoring Cyclists

GateMamba: Feature Gated Mixer in State Space Model for Point Cloud 3D Object Detection Read More »

The Moon's Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction

The Moon’s Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction

The Moon’s Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction | AI Trend Blend AITrendBlend Machine Learning Computer Vision About Planetary AI & 3D Reconstruction · ISPRS J. Photogramm. Remote Sens. 236 (2026) 363–379 · TU Dortmund University · 26 min read The Moon’s Many Faces: How One Transformer Learned to Speak All

The Moon’s Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction Read More »

Fusion-Mamba: Hidden State Space Fusion for Cross-Modality Object Detection

Fusion-Mamba: Hidden State Space Fusion for Cross-Modality Object Detection

Fusion-Mamba: Hidden State Space Fusion for Cross-Modality Object Detection | AI Trend Blend AITrendBlend Machine Learning Computer Vision About Computer Vision · arXiv:2404.09146 · Beihang University · 21 min read Mamba Goes Multimodal: How Fusion-Mamba Built a Hidden State Space to End Modality Disparity Researchers at Beihang University asked what happens when you stop treating

Fusion-Mamba: Hidden State Space Fusion for Cross-Modality Object Detection Read More »

BGPANet: How Bi-Granular Progressive Attention Cracked the Skin Cancer Diagnosis Problem

BGPANet: How Bi-Granular Progressive Attention Cracked the Skin Cancer Diagnosis Problem

BGPANet: How Bi-Granular Progressive Attention Cracked the Skin Cancer Diagnosis Problem | AI Medical Research AIMedical Research Machine Learning Medical AI About Medical Image AI · Expert Systems With Applications 321 (2026) 132169 · 16 min read BGPANet: The Bi-Granular Attention Breakthrough That Finally Taught AI to Diagnose Skin Cancer Like a Dermatologist How a

BGPANet: How Bi-Granular Progressive Attention Cracked the Skin Cancer Diagnosis Problem Read More »

CFFormer: Cross CNN-Transformer Attention Model

CFFormer: How Cross CNN-Transformer Attention Finally Solves the Blurry Ultrasound Problem

CFFormer: How Cross CNN-Transformer Attention Finally Solves the Blurry Ultrasound Problem | AI Trend Blend AITrendBlend Machine Learning Computer Vision Medical AI About Medical Image Segmentation · Expert Systems with Applications · 2025 · 24 min read CFFormer: How Cross CNN-Transformer Attention Finally Solves the Blurry Ultrasound Problem Researchers at University of Nottingham Ningbo built

CFFormer: How Cross CNN-Transformer Attention Finally Solves the Blurry Ultrasound Problem Read More »

PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation.

PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation

PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation | AI Trend Blend AITrendBlend Machine Learning Computer Vision About Medical Computer Vision · Computational Visual Media (2026) · 18 min read PraNet-V2: How Dual-Supervised Reverse Attention Finally Fixes Background Blindness in Medical Segmentation Researchers at Nankai University tore apart the reverse attention mechanism they invented five

PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation Read More »