Skip to content

zhtjtcz/Mine-Arxiv

Repository files navigation

GitHub forks Gitea Stars

Updated on 2024.04.04

Table of Contents
  1. diffusion
  2. sketch
  3. 3D reconstruction
  4. generate
  5. generation

diffusion

Publish Date Title Authors PDF Code
2024-04-03 Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Keyu Tian et.al. 2404.02905v1 link
2024-04-03 LidarDM: Generative LiDAR Simulation in a Generated World Vlas Zyrianov et.al. 2404.02903v1 null
2024-04-03 MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment Duygu Ceylan et.al. 2404.02899v1 null
2024-04-03 On the Scalability of Diffusion-based Text-to-Image Generation Hao Li et.al. 2404.02883v1 null
2024-04-03 Fast Diffusion Model For Seismic Data Noise Attenuation Junheng Peng et.al. 2404.02767v1 null
2024-04-03 Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models Wentian Zhang et.al. 2404.02747v1 link
2024-04-03 InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation Haofan Wang et.al. 2404.02733v1 link
2024-04-03 Harnessing the Power of Large Vision Language Models for Synthetic Image Detection Mamadou Keita et.al. 2404.02726v1 null
2024-04-02 Diffusion $^2$ : Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models Zeyu Yang et.al. 2404.02148v1 link
2024-04-02 WcDT: World-centric Diffusion Transformer for Traffic Scene Generation Chen Yang et.al. 2404.02082v1 link
2024-04-02 AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design Xinze Li et.al. 2404.02003v1 null
2024-04-02 Bi-LORA: A Vision-Language Approach for Synthetic Image Detection Mamadou Keita et.al. 2404.01959v1 null
2024-03-29 Relation Rectification in Diffusion Model Yinwei Wu et.al. 2403.20249v1 null
2024-03-29 Graph Neural Aggregation-diffusion with Metastability Kaiyuan Cui et.al. 2403.20221v1 null
2024-03-29 Motion Inversion for Video Customization Luozhou Wang et.al. 2403.20193v1 null
2024-03-29 FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models Barbara Toniella Corradini et.al. 2403.20105v1 null
2024-03-29 SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior Zhongrui Yu et.al. 2403.20079v1 null
2024-03-29 Optimal s-boxes against alternative operations Marco Calderini et.al. 2403.20059v1 null
2024-03-28 GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling Bowen Zhang et.al. 2403.19655v1 null
2024-03-28 Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond Katherine Xu et.al. 2403.19653v1 link
2024-03-28 InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction Sirui Xu et.al. 2403.19652v1 null
2024-03-28 GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models Yusuf Dalva et.al. 2403.19645v1 null
2024-03-28 Generalisation of the Spectral Difference scheme for the diffused-interface five equation model Niccolò Tonicello et.al. 2403.19623v1 null
2024-03-28 Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model Zhicai Wang et.al. 2403.19600v1 link
2024-03-28 Frame by Familiar Frame: Understanding Replication in Video Diffusion Models Aimon Rahman et.al. 2403.19593v1 null
2024-03-28 Keypoint Action Tokens Enable In-Context Imitation Learning in Robotics Norman Di Palo et.al. 2403.19578v1 null
2024-03-27 ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion Daniel Winter et.al. 2403.18818v1 null
2024-03-27 Garment3DGen: 3D Garment Stylization and Texture Generation Nikolaos Sarafianos et.al. 2403.18816v1 null
2024-03-28 ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation Suraj Patni et.al. 2403.18807v2 link
2024-03-27 Object Pose Estimation via the Aggregation of Diffusion Features Tianfu Wang et.al. 2403.18791v1 link
2024-03-27 ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object Chenshuang Zhang et.al. 2403.18775v1 link
2024-03-28 FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing Trong-Tung Nguyen et.al. 2403.18605v2 null
2024-03-27 HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions Hao Xu et.al. 2403.18575v1 link
2024-03-26 ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis Muhammad Hamza Mughal et.al. 2403.17936v1 null
2024-03-26 SLEDGE: Synthesizing Simulation Environments for Driving Agents with Generative Models Kashyap Chitta et.al. 2403.17933v1 null
2024-03-26 AID: Attention Interpolation of Text-to-Image Diffusion Qiyuan He et.al. 2403.17924v1 link
2024-03-26 Boosting Diffusion Models with Moving Average Sampling in Frequency Domain Yurui Qian et.al. 2403.17870v1 null
2024-03-26 The memory of Rayleigh-Taylor turbulence S. Thévenin et.al. 2403.17832v1 null
2024-03-26 DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions Sammy Christen et.al. 2403.17827v1 null
2024-03-25 Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning Sicong Pan et.al. 2403.16803v1 null
2024-03-25 Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise Dilum Fernando et.al. 2403.16790v1 null
2024-03-25 Multilevel Modeling as a Methodology for the Simulation of Human Mobility Luca Serena et.al. 2403.16745v1 null
2024-03-25 A Robotic Skill Learning System Built Upon Diffusion Policies and Foundation Models Nils Ingelhag et.al. 2403.16730v1 null
2024-03-25 Improving Diffusion Models's Data-Corruption Resistance using Scheduled Pseudo-Huber Loss Artem Khrapov et.al. 2403.16728v1 link
2024-03-25 The effect of inter-track coupling on H $_2$O$_2$ productions Ramin Abolfath et.al. 2403.16722v1 null
2024-03-25 The Directionality of Gravitational and Thermal Diffusive Transport in Geologic Fluid Storage Anna Herring et.al. 2403.16659v1 null
2024-03-25 SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions Yuda Song et.al. 2403.16627v1 link
2024-03-25 SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation Aysim Toker et.al. 2403.16605v1 null
2024-03-22 DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data Hanrong Ye et.al. 2403.15389v1 null
2024-03-22 LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis Kevin Xie et.al. 2403.15385v1 null
2024-03-22 Controlled Training Data Generation with Diffusion Models Teresa Yeo et.al. 2403.15309v1 null
2024-03-22 Parametric PDE Control with Deep Reinforcement Learning and Differentiable L0-Sparse Polynomial Policies Nicolò Botteghi et.al. 2403.15267v1 null
2024-03-22 Spectral Motion Alignment for Video Motion Transfer using Diffusion Models Geon Yeong Park et.al. 2403.15249v1 null
2024-03-22 Shadow Generation for Composite Image Using Diffusion model Qingyang Liu et.al. 2403.15234v1 link
2024-03-22 Broad Instantaneous Bandwidth Microwave Spectrum Analyzer with a Microfabricated Atomic Vapor Cell Yongqi Shi et.al. 2403.15155v1 null
2024-03-22 Oxygenation of CO and NO on Amorphous Solid Water Meenu Upadhyay et.al. 2403.15141v1 null
2024-03-21 Simplified Diffusion Schrödinger Bridge Zhicong Tang et.al. 2403.14623v1 link
2024-03-21 GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation Yinghao Xu et.al. 2403.14621v1 link
2024-03-21 Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion Xiang Fan et.al. 2403.14617v1 null
2024-03-21 DreamReward: Text-to-3D Generation with Human Preference Junliang Ye et.al. 2403.14613v1 null
2024-03-21 ReNoise: Real Image Inversion Through Iterative Noising Daniel Garibi et.al. 2403.14602v1 null
2024-03-21 Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion Descriptors Nikolaos Tsagkas et.al. 2403.14526v1 null
2024-03-21 Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation Mathias Öttl et.al. 2403.14429v1 null
2024-03-20 On Pretraining Data Diversity for Self-Supervised Learning Hasan Abed Al Kader Hammoud et.al. 2403.13808v1 link
2024-03-20 Editing Massive Concepts in Text-to-Image Diffusion Models Tianwei Xiong et.al. 2403.13807v1 link
2024-03-20 ZigMa: Zigzag Mamba Diffusion Model Vincent Tao Hu et.al. 2403.13802v1 link
2024-03-20 TimeRewind: Rewinding Time with Image-and-Events Video Diffusion Jingxi Chen et.al. 2403.13800v1 null
2024-03-20 DepthFM: Fast Monocular Depth Estimation with Flow Matching Ming Gui et.al. 2403.13788v1 null
2024-03-20 Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation Fu-Yun Wang et.al. 2403.13745v1 link
2024-03-20 Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes Yifan Chen et.al. 2403.13724v1 null
2024-03-19 FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis Linjiang Huang et.al. 2403.12963v1 link
2024-03-19 FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation Shuai Yang et.al. 2403.12962v1 link
2024-03-19 TexTile: A Differentiable Metric for Texture Tileability Carlos Rodriguez-Pardo et.al. 2403.12961v1 null
2024-03-19 GVGEN: Text-to-3D Generation with Volumetric Representation Xianglong He et.al. 2403.12957v1 null
2024-03-19 Zero-Reference Low-Light Enhancement via Physical Quadruple Priors Wenjing Wang et.al. 2403.12933v1 null
2024-03-19 You Only Sample Once: Taming One-Step Text-To-Image Synthesis by Self-Cooperative Diffusion GANs Yihong Luo et.al. 2403.12931v1 link
2024-03-19 Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model Jiajie Yang et.al. 2403.12915v1 link
2024-03-19 D-Cubed: Latent Diffusion Trajectory Optimisation for Dexterous Deformable Manipulation Jun Yamada et.al. 2403.12861v1 null
2024-03-18 Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models Emilian Postolache et.al. 2403.11706v1 link
2024-03-19 Urban Scene Diffusion through Semantic Occupancy Map Junge Zhang et.al. 2403.11697v2 null
2024-03-18 Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection Julia Wolleb et.al. 2403.11667v1 null
2024-03-18 Diffusion-Based Environment-Aware Trajectory Prediction Theodor Westny et.al. 2403.11643v1 null
2024-03-18 Arc2Face: A Foundation Model of Human Faces Foivos Paraperas Papantoniou et.al. 2403.11641v1 link
2024-03-18 LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models Yang Yang et.al. 2403.11627v1 link
2024-03-18 CRS-Diff: Controllable Generative Remote Sensing Foundation Model Datao Tang et.al. 2403.11614v1 link
2024-03-15 Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives Ronghui Li et.al. 2403.10518v1 link
2024-03-15 MusicHiFi: Fast High-Fidelity Stereo Vocoding Ge Zhu et.al. 2403.10493v1 null
2024-03-15 SculptDiff: Learning Robotic Clay Sculpting from Humans with Goal Conditioned Diffusion Policy Alison Bartsch et.al. 2403.10401v1 null
2024-03-15 Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding Pengkun Liu et.al. 2403.10395v1 link
2024-03-15 Denoising Task Difficulty-based Curriculum for Training Diffusion Models Jin-Young Kim et.al. 2403.10348v1 null
2024-03-15 Towards Generalizable Deepfake Video Detection with Thumbnail Layout and Graph Reasoning Yuting Xu et.al. 2403.10261v1 link
2024-03-14 SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior Huan-ang Gao et.al. 2403.09638v1 null
2024-03-14 3D-VLA: A 3D Vision-Language-Action Generative World Model Haoyu Zhen et.al. 2403.09631v1 null
2024-03-14 Generalized Predictive Model for Autonomous Driving Jiazhi Yang et.al. 2403.09630v1 link
2024-03-14 Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation Fangfu Liu et.al. 2403.09625v1 null
2024-03-14 Score-Guided Diffusion for 3D Human Recovery Anastasis Stathopoulos et.al. 2403.09623v1 link
2024-03-14 Explore In-Context Segmentation via Latent Diffusion Models Chaoyang Wang et.al. 2403.09616v1 null
2024-03-14 The effect of spatially-varying collision frequency on the development of the Rayleigh-Taylor instability John Rodman et.al. 2403.09591v1 null
2024-03-14 MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models Zunnan Xu et.al. 2403.09471v1 null
2024-03-14 Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing Wonjun Kang et.al. 2403.09468v1 link
2024-03-13 VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Enric Corona et.al. 2403.08764v1 null
2024-03-14 GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing Jing Wu et.al. 2403.08733v2 null
2024-03-13 Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data Asad Aali et.al. 2403.08728v1 link
2024-03-13 Historical Astronomical Diagrams Decomposition in Geometric Primitives Syrine Kalleli et.al. 2403.08721v1 null
2024-03-12 Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation Shihao Zhao et.al. 2403.07860v1 link
2024-03-12 Quantifying and Mitigating Privacy Risks for Tabular Generative Models Chaoyi Zhu et.al. 2403.07842v1 null
2024-03-12 MPCPA: Multi-Center Privacy Computing with Predictions Aggregation based on Denoising Diffusion Probabilistic Model Guibo Luo et.al. 2403.07838v1 null
2024-03-13 SemCity: Semantic Scene Generation with Triplane Diffusion Jumin Lee et.al. 2403.07773v2 link
2024-03-12 Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model Yuxuan Zhang et.al. 2403.07764v1 null
2024-03-13 Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion Dongyang Li et.al. 2403.07721v2 link
2024-03-12 SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces Yuta Oshima et.al. 2403.07711v1 link
2024-03-12 Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal Yijun Yang et.al. 2403.07684v1 null
2024-03-11 BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion Xuan Ju et.al. 2403.06976v1 link
2024-03-11 Bayesian Diffusion Models for 3D Shape Reconstruction Haiyang Xu et.al. 2403.06973v1 null
2024-03-11 SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data Jialu Li et.al. 2403.06952v1 null
2024-03-12 DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations Tianhao Qi et.al. 2403.06951v2 link
2024-03-08 VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models Yabo Zhang et.al. 2403.05438v1 link
2024-03-08 DiffSF: Diffusion Models for Scene Flow Estimation Yushan Zhang et.al. 2403.05327v1 link
2024-03-07 ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes Hashmat Shadab Malik et.al. 2403.04701v1 link
2024-03-07 Delving into the Trajectory Long-tail Distribution for Muti-object Tracking Sijia Chen et.al. 2403.04700v1 link
2024-03-07 PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Junsong Chen et.al. 2403.04692v1 null
2024-03-07 Pix2Gif: Motion-Guided Diffusion for GIF Generation Hitesh Kandala et.al. 2403.04634v1 null
2024-03-06 3D Diffusion Policy Yanjie Ze et.al. 2403.03954v1 link
2024-03-06 GUIDE: Guidance-based Incremental Learning with Diffusion Models Bartosz Cywiński et.al. 2403.03938v1 link
2024-03-06 Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation Xiao Ma et.al. 2403.03890v1 null
2024-03-06 Latent Dataset Distillation with Diffusion Models Brian B. Moser et.al. 2403.03881v1 null
2024-03-06 Accelerating Convergence of Score-Based Diffusion Models, Provably Gen Li et.al. 2403.03852v1 null
2024-03-06 Diffusion on language model embeddings for protein sequence generation Viacheslav Meshchaninov et.al. 2403.03726v1 null
2024-03-05 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Patrick Esser et.al. 2403.03206v1 null
2024-03-05 MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets Hossein Aboutalebi et.al. 2403.03194v1 null
2024-03-05 Behavior Generation with Latent Actions Seungjae Lee et.al. 2403.03181v1 link
2024-03-05 Enhanced beam-beam modeling to include longitudinal variation during weak-strong simulation Derong Xu et.al. 2403.03137v1 null
2024-03-02 Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models Neta Shaul et.al. 2403.01329v1 null
2024-03-02 Anomalous mass dependency in Hydra endoderm cell cluster diffusion Aline Lütz et.al. 2403.01294v1 null
2024-03-02 DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction Junwen Xiong et.al. 2403.01226v1 null
2024-03-02 TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion Salaheldin Mohamed et.al. 2403.01212v1 null
2024-02-29 DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models Muyang Li et.al. 2402.19481v1 link
2024-02-29 Structure Preserving Diffusion Models Haoye Lu et.al. 2402.19369v1 null
2024-02-29 A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation Hanxi Li et.al. 2402.19330v1 link
2024-02-29 DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly Gianluca Scarpellini et.al. 2402.19302v1 link
2024-02-29 Generative models struggle with kirigami metamaterials Gerrit Felsch et.al. 2402.19196v1 null
2024-02-28 Diffusion Language Models Are Versatile Protein Learners Xinyou Wang et.al. 2402.18567v1 null
2024-02-28 Photon statistics of resonantly driven spectrally diffusive quantum emitters Aymeric Delteil et.al. 2402.18542v1 null
2024-02-28 Dynamical Regimes of Diffusion Models Giulio Biroli et.al. 2402.18491v1 null
2024-02-28 Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model Sangjoon Park et.al. 2402.18362v1 null
2024-02-27 Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning Xiaoyu Zhang et.al. 2402.17768v1 null
2024-02-27 Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners Yazhou Xing et.al. 2402.17723v1 null
2024-02-27 Structure-Guided Adversarial Training of Diffusion Models Ling Yang et.al. 2402.17563v1 null
2024-02-27 Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class Label Xinliang Zhang et.al. 2402.17555v1 link
2024-02-27 Diffusion Model-Based Image Editing: A Survey Yi Huang et.al. 2402.17525v1 link
2024-02-27 Label-Noise Robust Diffusion Models Byeonghu Na et.al. 2402.17517v1 link
2024-02-27 The Unwanted Dissemination of Science: The Usage of Academic Articles as Ammunition in Contested Discursive Arenas on Twitter Richard Zhang et.al. 2402.17495v1 null
2024-02-27 EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions Linrui Tian et.al. 2402.17485v1 null
2024-02-26 Stochastic Conditional Diffusion Models for Semantic Image Synthesis Juyeon Ko et.al. 2402.16506v1 null
2024-02-26 Outline-Guided Object Inpainting with Diffusion Models Markus Pobitzer et.al. 2402.16421v1 null
2024-02-26 Placing Objects in Context via Inpainting for Out-of-distribution Segmentation Pau de Jorge et.al. 2402.16392v1 link
2024-02-26 Generative AI in Vision: A Survey on Models, Metrics and Applications Gaurav Raut et.al. 2402.16369v1 null
2024-02-26 Feedback Efficient Online Fine-Tuning of Diffusion Models Masatoshi Uehara et.al. 2402.16359v1 null
2024-02-26 Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion Xuantong Liu et.al. 2402.16305v1 null
2024-02-26 Graph Diffusion Policy Optimization Yijing Liu et.al. 2402.16302v1 link
2024-02-23 Seamless Human Motion Composition with Blended Positional Encodings German Barquero et.al. 2402.15509v1 link
2024-02-23 Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition Chun-Hsiao Yeh et.al. 2402.15504v1 link
2024-02-23 Solute transport due to periodic loading in a soft porous material Matilde Fiori et.al. 2402.15451v1 null
2024-02-23 ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation Yi Zhang et.al. 2402.15429v1 link
2024-02-23 Understanding Oversmoothing in Diffusion-Based GNNs From the Perspective of Operator Semigroup Theory Weichen Zhao et.al. 2402.15326v1 null
2024-02-23 Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models Shunyu Liu et.al. 2402.15289v1 link
2024-02-22 Cameras as Rays: Pose Estimation via Ray Diffusion Jason Y. Zhang et.al. 2402.14817v1 null
2024-02-22 GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion Xueyi Liu et.al. 2402.14810v1 link
2024-02-22 Consolidating Attention Features for Multi-view Image Editing Or Patashnik et.al. 2402.14792v1 null
2024-02-22 Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models Yixuan Ren et.al. 2402.14780v1 null
2024-02-22 Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormality Screening Zhenrong Shen et.al. 2402.14707v1 null
2024-02-22 Debiasing Text-to-Image Diffusion Models Ruifei He et.al. 2402.14577v1 null
2024-02-22 DynGMA: a robust approach for learning stochastic differential equations from data Aiqing Zhu et.al. 2402.14475v1 link
2024-02-21 D-Flow: Differentiating through Flows for Controlled Generation Heli Ben-Hamu et.al. 2402.14017v1 null
2024-02-21 SDXL-Lightning: Progressive Adversarial Diffusion Distillation Shanchuan Lin et.al. 2402.13929v1 null
2024-02-21 Non-asymptotic Convergence of Discrete-time Diffusion Models: New Approach and Improved Rate Yuchen Liang et.al. 2402.13901v1 null
2024-02-21 NeuralDiffuser: Controllable fMRI Reconstruction with Primary Visual Feature Guided Diffusion Haoyu Li et.al. 2402.13809v1 null
2024-02-21 The Geography of Information Diffusion in Online Discourse on Europe and Migration Elisa Leonardelli et.al. 2402.13800v1 null
2024-02-21 Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions Jiayu Chen et.al. 2402.13777v1 link
2024-02-21 Music Style Transfer with Time-Varying Inversion of Diffusion Models Sifei Li et.al. 2402.13763v1 null
2024-02-20 Neural Network Diffusion Kai Wang et.al. 2402.13144v1 link
2024-02-20 Excited state-specific CASSCF theory for the torsion of ethylene Sandra Saade et.al. 2402.13046v1 null
2024-02-20 Text-Guided Molecule Generation with Diffusion Language Model Haisong Gong et.al. 2402.13040v1 link
2024-02-20 Visual Style Prompting with Swapping Self-Attention Jaeseok Jeong et.al. 2402.12974v1 link
2024-02-20 CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection Sohail Ahmed Khan et.al. 2402.12927v1 null
2024-02-20 RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models Xinchen Zhang et.al. 2402.12908v1 link
2024-02-19 FiT: Flexible Vision Transformer for Diffusion Model Zeyu Lu et.al. 2402.12376v1 link
2024-02-19 Analysis of Persian News Agencies on Instagram, A Words Co-occurrence Graph-based Approach Mohammad Heydari et.al. 2402.12272v1 null
2024-02-19 Synthetic location trajectory generation using categorical diffusion models Simon Dirmeier et.al. 2402.12242v1 link
2024-02-19 Diffusion Tempering Improves Parameter Estimation with Probabilistic Integrators for Ordinary Differential Equations Jonas Beck et.al. 2402.12231v1 link
2024-02-19 Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training Leo Hyun Park et.al. 2402.12187v1 null
2024-02-19 Human Video Translation via Query Warping Haiming Zhu et.al. 2402.12099v1 null
2024-02-16 Fusion of Diffusion Weighted MRI and Clinical Data for Predicting Functional Outcome after Acute Ischemic Stroke with Deep Contrastive Learning Chia-Ling Tsai et.al. 2402.10894v1 null
2024-02-16 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations Tsung-Wei Ke et.al. 2402.10885v1 null
2024-02-16 Control Color: Multimodal Diffusion-based Interactive Image Colorization Zhexin Liang et.al. 2402.10855v1 null
2024-02-16 Training Class-Imbalanced Diffusion Model Via Overlap Optimization Divin Yan et.al. 2402.10821v1 link
2024-02-16 VATr++: Choose Your Words Wisely for Handwritten Text Generation Bram Vanherle et.al. 2402.10798v1 null
2024-02-16 Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation Hongbin Na et.al. 2402.10699v1 null
2024-02-16 Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm Yuanzhen Xie et.al. 2402.10671v1 link
2024-02-15 Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Huizhuo Yuan et.al. 2402.10210v1 null
2024-02-15 Recovering the Pre-Fine-Tuning Weights of Generative Models Eliahu Horwitz et.al. 2402.10208v1 link
2024-02-15 Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment Rui Yang et.al. 2402.10207v1 link
2024-02-15 Energy Flux Decomposition in Magnetohydrodynamic Turbulence D. Capocci et.al. 2402.10125v1 null
2024-02-15 Collision efficiency of droplets across diffusive, electrostatic and inertial regimes Florian Poydenot et.al. 2402.10117v1 null
2024-02-15 Quantized Embedding Vectors for Controllable Diffusion Language Models Cheng Kang et.al. 2402.10107v1 null
2024-02-15 Classification Diffusion Models Shahar Yadin et.al. 2402.10095v1 null
2024-02-14 Magic-Me: Identity-Specific Video Customized Diffusion Ze Ma et.al. 2402.09368v1 link
2024-02-14 Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music Audio Pablo Alonso-Jiménez et.al. 2402.09318v1 null
2024-02-14 Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection Pengfei Zhou et.al. 2402.09242v1 link
2024-02-13 IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation Luke Melas-Kyriazi et.al. 2402.08682v1 null
2024-02-13 Target Score Matching Valentin De Bortoli et.al. 2402.08667v1 null
2024-02-13 Learning Continuous 3D Words for Text-to-Image Generation Ta-Ying Cheng et.al. 2402.08654v1 null
2024-02-13 Latent Inversion with Timestep-aware Sampling for Training-free Non-rigid Editing Yunji Jung et.al. 2402.08601v1 null
2024-02-13 Denoising Diffusion Restoration Tackles Forward and Inverse Problems for the Laplace Operator Amartya Mukherjee et.al. 2402.08563v1 null
2024-02-13 Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases Ziyi Zhang et.al. 2402.08552v1 null
2024-02-13 Hyperballistic transport in dense ionized matter under external AC electric fields Daniele Gamba et.al. 2402.08519v1 null
2024-02-12 Label-Efficient Model Selection for Text Generation Shir Ashury-Tahan et.al. 2402.07891v1 null
2024-02-12 High-order harmonic generation in 2D Transition Metal Disulphides Jose Manuel Iglesias et.al. 2402.07850v1 null
2024-02-12 Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models Jiacheng Ye et.al. 2402.07754v1 link
2024-02-12 Topological Edge States in Reconfigurable Multi-stable Mechanical Metamaterials Zhen Wang et.al. 2402.07707v1 null
2024-02-12 Higher-order Connection Laplacians for Directed Simplicial Complexes Xue Gong et.al. 2402.07631v1 null
2024-02-09 Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following Brian Yang et.al. 2402.06559v1 null
2024-02-09 Sequential Flow Matching for Generative Modeling Jongmin Yoon et.al. 2402.06461v1 null
2024-02-09 ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic Segmentation Fengyi Shen et.al. 2402.06446v1 null
2024-02-09 Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation Peter Hönig et.al. 2402.06436v1 null
2024-02-09 Enhanced bubble growth near an advancing solidification front Jochem G. Meijer et.al. 2402.06409v1 null
2024-02-08 InstaGen: Enhancing Object Detection by Training on Synthetic Dataset Chengjian Feng et.al. 2402.05937v1 null
2024-02-08 Time Series Diffusion in the Frequency Domain Jonathan Crabbé et.al. 2402.05933v1 link
2024-02-08 AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning Wamiq Reyaz Para et.al. 2402.05803v1 null
2024-02-08 Determining the significance and relative importance of parameters of a simulated quenching algorithm using statistical tools Pedro A. Castillo et.al. 2402.05791v1 null
2024-02-08 DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer Zhiyuan Ma et.al. 2402.05712v1 link
2024-02-08 Scalable Diffusion Models with State Space Backbone Zhengcong Fei et.al. 2402.05608v1 link
2024-02-07 On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling Marcin Sendera et.al. 2402.05098v1 link
2024-02-07 NITO: Neural Implicit Fields for Resolution-free Topology Optimization Amin Heyrani Nobari et.al. 2402.05073v1 null
2024-02-07 LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation Jiaxiang Tang et.al. 2402.05054v1 null
2024-02-06 SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models Yichen Shi et.al. 2402.04178v1 link
2024-02-06 Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning Ruoqi Zhang et.al. 2402.04080v1 link
2024-02-06 Generative Modeling of Graphs via Joint Diffusion of Node and Edge Attributes Nimrod Berman et.al. 2402.04046v1 null
2024-02-06 Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced Segmentation Zolnamar Dorjsembe et.al. 2402.04031v1 link
2024-02-06 Space Group Constrained Crystal Generation Rui Jiao et.al. 2402.03992v1 null
2024-02-06 Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting Yiming Xu et.al. 2402.03981v1 null
2024-02-06 Weibel- and non-resonant Whistler wave growth in an expanding plasma in a 1D simulation geometry M E Dieckmann et.al. 2402.03925v1 null
2024-02-05 Do Diffusion Models Learn Semantically Meaningful and Efficient Representations? Qiyao Liang et.al. 2402.03305v1 null
2024-02-05 Zero-shot Object-Level OOD Detection with Context-Aware Inpainting Quang-Huy Nguyen et.al. 2402.03292v1 null
2024-02-05 InstanceDiffusion: Instance-level Control for Image Generation Xudong Wang et.al. 2402.03290v1 link
2024-02-05 Organic or Diffused: Can We Distinguish Human Art from AI-generated Images? Anna Yoo Jeong Ha et.al. 2402.03214v1 null
2024-02-05 Light and Optimal Schrödinger Bridge Matching Nikita Gushchin et.al. 2402.03207v1 link
2024-02-05 Guidance with Spherical Gaussian Constraint for Conditional Diffusion Lingxiao Yang et.al. 2402.03201v1 null
2024-02-05 Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion Shiyuan Yang et.al. 2402.03162v1 null
2024-02-05 DARTS: Diffusion Approximated Residual Time Sampling for Low Variance Time-of-flight Rendering in Homogeneous Scattering Medium Qianyue He et.al. 2402.03106v1 null
2024-02-02 NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties Jingyuan Sun et.al. 2402.01590v1 null
2024-02-02 Boximator: Generating Rich and Controllable Motions for Video Synthesis Jiawei Wang et.al. 2402.01566v1 null
2024-02-02 Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations Panos Kakoulidis et.al. 2402.01520v1 null
2024-02-02 Cross-view Masked Diffusion Transformers for Person Image Synthesis Trung X. Pham et.al. 2402.01516v1 null
2024-02-01 AToM: Amortized Text-to-Mesh using 2D Diffusion Guocheng Qian et.al. 2402.00867v1 null
2024-02-01 ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields Jiahua Dong et.al. 2402.00864v1 link
2024-02-01 Distilling Conditional Diffusion Models for Offline Reinforcement Learning through Trajectory Stitching Shangzhe Li et.al. 2402.00807v1 null
2024-02-01 AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning Fu-Yun Wang et.al. 2402.00769v1 link
2024-02-01 CapHuman: Capture Your Moments in Parallel Universes Chao Liang et.al. 2402.00627v1 link
2024-02-01 Diffusion-based Light Field Synthesis Ruisheng Gao et.al. 2402.00575v1 null
2024-01-31 Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators Daniel Geng et.al. 2401.18085v1 null
2024-01-31 An electrodynamic wave model for the action potential Vitaly L. Galinsky et.al. 2401.18051v1 null
2024-01-31 Investigation of Microstructure and Corrosion Resistance of Ti-Al-V Titanium Alloys Obtained by Spark Plasma Sintering Aleksey Nokhrin et.al. 2401.17941v1 null
2024-01-31 AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error Jonas Ricker et.al. 2401.17879v1 link
2024-01-30 You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation Mehdi Noroozi et.al. 2401.17258v1 null
2024-01-30 ContactGen: Contact-Guided Interactive 3D Human Generation for Partners Dongjun Gu et.al. 2401.17212v1 null
2024-01-30 Transfer Learning for Text Diffusion Models Kehang Han et.al. 2401.17181v1 null
2024-01-29 Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models Zhongjie Duan et.al. 2401.16224v1 null
2024-01-29 Rapidly rotating radiatively driven convection: experimental and numerical validation of the `geostrophic turbulence' scaling predictions Gabriel Hadjerci et.al. 2401.16200v1 null
2024-01-29 Spatial-Aware Latent Initialization for Controllable Image Generation Wenqiang Sun et.al. 2401.16157v1 null
2024-01-29 Acoustic Screens based on Sonic Crystals with high Diffusion properties M. P. Peiró-Torres et.al. 2401.16074v1 null
2024-01-26 Annotated Hands for Generative Models Yue Yang et.al. 2401.15075v1 link
2024-01-26 Emulating Complex Synapses Using Interlinked Proton Conductors Lifu Zhang et.al. 2401.15045v1 null
2024-01-26 DAM: Diffusion Activation Maximization for 3D Global Explanations Hanxiao Tan et.al. 2401.14938v1 link
2024-01-26 Social norms and cooperation in higher-order networks Yin-Jie Ma et.al. 2401.14905v1 null
2024-01-25 Deconstructing Denoising Diffusion Models for Self-Supervised Learning Xinlei Chen et.al. 2401.14404v1 null
2024-01-25 pix2gestalt: Amodal Segmentation by Synthesizing Wholes Ege Ozguroglu et.al. 2401.14398v1 link
2024-01-25 Manifold GCN: Diffusion-based Convolutional Neural Network for Manifold-valued Graphs Martin Hanik et.al. 2401.14381v1 null
2024-01-25 UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models Timo Kapsalis et.al. 2401.14379v1 null
2024-01-25 Modeling Global Surface Dust Deposition Using Physics-Informed Neural Networks Constanza A. Molina Catricheo et.al. 2401.14372v1 link
2024-01-25 Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation Minglin Chen et.al. 2401.14257v1 null
2024-01-24 Bi-Hamiltonian in Semiflexible Polymer as Strongly Coupled System Heeyuen Koh et.al. 2401.13655v1 null
2024-01-24 On the self-similarity of unbounded viscous Marangoni flows Fernando Temprano-Coleto et.al. 2401.13647v1 null
2024-01-24 Winding Clearness for Differentiable Point Cloud Optimization Dong Xiao et.al. 2401.13639v1 null
2024-01-24 Guided Diffusion for Fast Inverse Design of Density-based Mechanical Metamaterials Yanyan Yang et.al. 2401.13570v1 null
2024-01-24 Expressive Acoustic Guitar Sound Synthesis with an Instrument-Specific Input Representation and Diffusion Outpainting Hounsu Kim et.al. 2401.13498v1 null
2024-01-23 GALA: Generating Animatable Layered Assets from a Single Scan Taeksoo Kim et.al. 2401.12979v1 null
2024-01-23 Zero-Shot Learning for the Primitives of 3D Affordance in General Objects Hyeonwoo Kim et.al. 2401.12978v1 null
2024-01-23 Lumiere: A Space-Time Diffusion Model for Video Generation Omer Bar-Tal et.al. 2401.12945v1 null
2024-01-23 Long-range three-dimensional tracking of nanoparticles using interferometric scattering (iSCAT) microscopy Kiarash Kasaian et.al. 2401.12939v1 null
2024-01-22 DITTO: Diffusion Inference-Time T-Optimization for Music Generation Zachary Novack et.al. 2401.12179v1 null
2024-01-22 Single-View 3D Human Digitalization with Large Reconstruction Models Zhenzhen Weng et.al. 2401.12175v1 null
2024-01-22 Improved accuracy of continuum surface flux models for metal additive manufacturing melt pool simulations Nils Much et.al. 2401.12114v1 null
2024-01-22 Experimental investigation and scale analysis on melting of salty ice in a 3D-printed cavity filled with porous media Xiaotian Liand Yuming Wang et.al. 2401.12009v1 null
2024-01-22 Claim Detection for Automated Fact-checking: A Survey on Monolingual, Multilingual and Cross-Lingual Research Rrubaa Panchendrarajan et.al. 2401.11969v1 null
2024-01-22 Feature Denoising Diffusion Model for Blind Image Quality Assessment Xudong Li et.al. 2401.11949v1 null
2024-01-19 Synthesizing Moving People with 3D Control Boyi Li et.al. 2401.10889v1 null
2024-01-19 ActAnywhere: Subject-Aware Video Background Generation Boxiao Pan et.al. 2401.10822v1 null
2024-01-19 Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion Zuoyue Li et.al. 2401.10786v1 null
2024-01-19 Signatures of s-wave scattering in bound electronic states Robin E. Moorby et.al. 2401.10714v1 null
2024-01-19 Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model Yinan Zheng et.al. 2401.10700v1 link
2024-01-19 Refractive index measurement of pharmaceutical powders in the short-wave infrared range using index matching assisted with phase imaging Cory Juntunen et.al. 2401.10667v1 null
2024-01-19 Analysis of the Patent of a Protective Cover for Vertical-Axis Wind Turbines (VAWTs): Simulations of Wind Flow JA Moleón Baca et.al. 2401.10656v1 null
2024-01-18 A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting Wouter Van Gansbeke et.al. 2401.10227v1 link
2024-01-18 Towards Language-Driven Video Inpainting via Multimodal Large Language Models Jianzong Wu et.al. 2401.10226v1 null
2024-01-18 Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation Changgu Chen et.al. 2401.10150v1 null
2024-01-18 DiffusionGPT: LLM-Driven Text-to-Image Generation System Jie Qin et.al. 2401.10061v1 null
2024-01-18 CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects Zhao Wang et.al. 2401.09962v1 null
2024-01-17 TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion Yu-Ying Yeh et.al. 2401.09416v1 null
2024-01-17 Vlogger: Make Your Dream A Vlog Shaobin Zhuang et.al. 2401.09414v1 link
2024-01-17 Siamese Meets Diffusion Network: SMDNet for Enhanced Change Detection in High-Resolution RS Imagery Jia Jia et.al. 2401.09325v1 null
2024-01-17 Tailoring chaotic motion of microcavity photons in ray and wave dynamics by tuning the curvature of space Wei Lin et.al. 2401.09303v1 null
2024-01-17 T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis Yoonjin Chung et.al. 2401.09294v1 null
2024-01-16 Robotic Imitation of Human Actions Josua Spisak et.al. 2401.08381v1 null
2024-01-16 Optimization of the plasmonic properties of titanium nitride films sputtered at room temperature through microstructure and thickness control Mateusz Nieborek et.al. 2401.08353v1 null
2024-01-16 Modeling Spoof Noise by De-spoofing Diffusion and its Application in Face Anti-spoofing Bin Zhang et.al. 2401.08275v1 null
2024-01-16 Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization Chongzhi Zhang et.al. 2401.08232v1 null
2024-01-12 Decoupling Pixel Flipping and Occlusion Strategy for Consistent XAI Benchmarks Stefan Blücher et.al. 2401.06654v1 link
2024-01-12 Adversarial Examples are Misaligned in Diffusion Model Manifolds Peter Lorenz et.al. 2401.06637v1 null
2024-01-12 Motion2VecSets: 4D Latent Vector Set Diffusion for Non-rigid Shape Reconstruction and Tracking Wei Cao et.al. 2401.06614v1 null
2024-01-12 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model Qian Wang et.al. 2401.06578v1 null
2024-01-11 E $^{2}$ GAN: Efficient Training of Efficient GANs for Image-to-Image Translation Yifan Gong et.al. 2401.06127v1 null
2024-01-11 Numerical thermalization in 2D PIC simulations: Practical estimates for low temperature plasma simulations Sierra Jubin et.al. 2401.06057v1 null
2024-01-11 DiffDA: a diffusion model for weather-scale data assimilation Langwen Huang et.al. 2401.05932v1 null
2024-01-11 Efficient Image Deblurring Networks based on Diffusion Models Kang Chen et.al. 2401.05907v1 link
2024-01-10 InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes Mohamad Shahbazi et.al. 2401.05335v1 null
2024-01-10 Score Distillation Sampling with Learned Manifold Corrective Thiemo Alldieck et.al. 2401.05293v1 null
2024-01-10 PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models Junsong Chen et.al. 2401.05252v1 link
2024-01-10 Derm-T2IM: Harnessing Synthetic Skin Lesion Data via Stable Diffusion Models for Enhanced Skin Disease Classification using ViT and CNN Muhammad Ali Farooq et.al. 2401.05159v1 null
2024-01-10 CrossDiff: Exploring Self-Supervised Representation of Pansharpening via Cross-Predictive Diffusion Model Yinghui Xing et.al. 2401.05153v1 null
2024-01-09 Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation Xiyi Chen et.al. 2401.04728v1 null
2024-01-09 EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models Jingyuan Yang et.al. 2401.04608v1 null
2024-01-09 Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models Xuewen Liu et.al. 2401.04585v1 link
2024-01-09 MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation Weimin Wang et.al. 2401.04468v1 null
2024-01-09 D3AD: Dynamic Denoising Diffusion Probabilistic Model for Anomaly Detection Justin Tebbe et.al. 2401.04463v1 link
2024-01-08 D3PRefiner: A Diffusion-based Denoise Method for 3D Human Pose Refinement Danqi Yan et.al. 2401.03914v1 null
2024-01-05 Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction Yuxin Yang et.al. 2401.02916v1 null
2024-01-05 Plug-in Diffusion Model for Sequential Recommendation Haokai Ma et.al. 2401.02913v1 link
2024-01-05 Generating Non-Stationary Textures using Self-Rectification Yang Zhou et.al. 2401.02847v1 link
2024-01-05 Diffbody: Diffusion-based Pose and Shape Editing of Human Images Yuta Okuyama et.al. 2401.02804v1 link
2024-01-05 Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors Top Piriyakulkij et.al. 2401.02739v1 null
2024-01-05 Geometric-Facilitated Denoising Diffusion Model for 3D Molecule Generation Can Xu et.al. 2401.02683v1 link
2024-01-04 Bring Metric Functions into Diffusion Models Jie An et.al. 2401.02414v1 null
2024-01-04 Image denoising and model-independent parameterization for improving IVIM MRI Caleb Sample et.al. 2401.02394v1 null
2024-01-04 Integration of physics-informed operator learning and finite element method for parametric learning of partial differential equations Shahed Rezaei et.al. 2401.02363v1 null
2024-01-04 Robust Physics Informed Neural Networks Marcin Łoś et.al. 2401.02300v1 null
2024-01-03 From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations Evonne Ng et.al. 2401.01885v1 link
2024-01-03 DGDNN: Decoupled Graph Diffusion Neural Network for Stock Movement Prediction Zinuo You et.al. 2401.01846v1 link
2024-01-03 Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions David Junhao Zhang et.al. 2401.01827v1 link
2024-01-03 aMUSEd: An Open MUSE Reproduction Suraj Patil et.al. 2401.01808v1 link
2024-01-03 Short-time expansion of one-dimensional Fokker-Planck equations with heterogeneous diffusion Tom Dupont et.al. 2401.01765v1 null
2024-01-02 Influence of scanning plane on Human Spinal Cord functional Magnetic Resonance echo planar imaging Marta Moraschi et.al. 2401.01281v1 null
2024-01-02 Fairness Certification for Natural Language Processing and Large Language Models Vincent Freiberger et.al. 2401.01262v1 null
2024-01-02 VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM Fuchen Long et.al. 2401.01256v1 null
2024-01-02 Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation Renshuai Liu et.al. 2401.01207v1 null
2024-01-02 Learning Surface Scattering Parameters From SAR Images Using Differentiable Ray Tracing Jiangtao Wei et.al. 2401.01175v1 null
2024-01-02 Joint Generative Modeling of Scene Graphs and Images via Diffusion Models Bicheng Xu et.al. 2401.01130v1 null
2023-12-29 FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis Feng Liang et.al. 2312.17681v1 null
2023-12-29 Data Augmentation for Supervised Graph Outlier Detection with Latent Diffusion Models Kay Liu et.al. 2312.17679v1 link
2023-12-29 Leveraging Open-Vocabulary Diffusion to Camouflaged Instance Segmentation Tuan-Anh Vu et.al. 2312.17505v1 null
2023-12-28 iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views Chin-Hsuan Wu et.al. 2312.17250v1 link
2023-12-28 Amodal Ground Truth and Completion in the Wild Guanqi Zhan et.al. 2312.17247v1 link
2023-12-28 Personalized Restoration via Dual-Pivot Tuning Pradyumna Chari et.al. 2312.17234v1 null
2023-12-28 4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency Yuyang Yin et.al. 2312.17225v1 null
2023-12-28 EFHQ: Multi-purpose ExtremePose-Face-HQ dataset Trung Tuan Dao et.al. 2312.17205v1 null
2023-12-28 Restoration by Generation with Constrained Priors Zheng Ding et.al. 2312.17161v1 null
2023-12-28 InsActor: Instruction-driven Physics-based Characters Jiawei Ren et.al. 2312.17135v1 null
2023-12-28 100-fold improvement in relaxed eddy accumulation flux estimates through error diffusion Anas Emad et.al. 2312.17027v1 link
2023-12-26 One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications Mengyao Lyu et.al. 2312.16145v1 null
2023-12-26 HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D Sangmin Woo et.al. 2312.15980v1 link
2023-12-26 Semantic Guidance Tuning for Text-To-Image Diffusion Models Hyun Kang et.al. 2312.15964v1 null
2023-12-26 EnchantDance: Unveiling the Potential of Music-Driven Dance Movement Bo Han et.al. 2312.15946v1 link
2023-12-22 MACS: Mass Conditioned 3D Hand and Object Motion Synthesis Soshi Shimada et.al. 2312.14929v1 null
2023-12-22 BrainVis: Exploring the Bridge between Brain and Visual Signals via Image Reconstruction Honghao Fu et.al. 2312.14871v1 null
2023-12-22 Dreaming of Electrical Waves: Generative Modeling of Cardiac Excitation Waves using Diffusion Models Tanish Baranwal et.al. 2312.14830v1 null
2023-12-22 Neural network models for preferential concentration of particles in two-dimensional turbulence Thibault Maurel-Oujia et.al. 2312.14829v1 null
2023-12-22 Plan, Posture and Go: Towards Open-World Text-to-Motion Generation Jinpeng Liu et.al. 2312.14828v1 null
2023-12-22 Disorder-induced non-linear growth of viscously-unstable immiscible two-phase flow fingers in porous media Santanu Sinha et.al. 2312.14799v1 null
2023-12-22 Diffusion Maps for Signal Filtering in Graph Learning Todd Hildebrant et.al. 2312.14758v1 null
2023-12-21 Diffusion Reward: Learning Rewards via Conditional Video Diffusion Tao Huang et.al. 2312.14134v1 null
2023-12-21 Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation Philipp Schröppel et.al. 2312.14124v1 link
2023-12-21 HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models Hayk Manukyan et.al. 2312.14091v1 link
2023-12-21 Designing Artificial Intelligence Equipped Social Decentralized Autonomous Organizations for Tackling Sextortion Cases Version 0.7 Norta Alex et.al. 2312.14090v1 null
2023-12-21 The influence of controlled vibration effects on fluid flow Alexey Fedyushkin et.al. 2312.14079v1 null
2023-12-21 Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning Desai Xie et.al. 2312.13980v1 null
2023-12-21 Controllable 3D Face Generation with Conditional Style Code Diffusion Xiaolong Shen et.al. 2312.13941v1 link
2023-12-20 Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting Junwu Zhang et.al. 2312.13271v1 link
2023-12-20 Conditional Image Generation with Pretrained Generative Model Rajesh Shrestha et.al. 2312.13253v1 null
2023-12-20 Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model Saurabh Saxena et.al. 2312.13252v1 null
2023-12-20 Diffusion Models With Learned Adaptive Noise Subham Sekhar Sahoo et.al. 2312.13236v1 link
2023-12-20 MoSAR: Monocular Semi-Supervised Model for Avatar Reconstruction using Differentiable Shading Abdallah Dib et.al. 2312.13091v1 null
2023-12-20 DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis Yuming Gu et.al. 2312.13016v1 link
2023-12-20 A comparative study of analytical models of diffuse reflectance in homogeneous biological tissues: Gelatin based phantoms and Monte Carlo experiments Anisha Bahl et.al. 2312.12935v1 null
2023-12-19 On Inference Stability for Diffusion Models Viet Nguyen et.al. 2312.12431v1 link
2023-12-19 SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process Mengyu Wang et.al. 2312.12425v1 link
2023-12-19 Scene-Conditional 3D Object Stylization and Composition Jinghao Zhou et.al. 2312.12419v1 null
2023-12-19 LASA: Instance Reconstruction from Real Scans using A Large-scale Aligned Shape Annotation Dataset Haolin Liu et.al. 2312.12418v1 null
2023-12-19 Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models Shweta Mahajan et.al. 2312.12416v1 null
2023-12-19 Intrinsic Image Diffusion for Single-view Material Estimation Peter Kocsis et.al. 2312.12274v1 link
2023-12-19 Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model Lingjun Zhang et.al. 2312.12232v1 link
2023-12-18 A novel diffusion recommendation algorithm based on multi-scale cnn and residual lstm Yong Niu et.al. 2312.10885v1 null
2023-12-17 Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models Nikita Starodubcev et.al. 2312.10835v1 link
2023-12-17 From mixing to displacement of miscible phases in porous media: The role of heterogeneity and inlet pressure Yahel Eliyahu-Yakir et.al. 2312.10722v1 null
2023-12-17 CogCartoon: Towards Practical Story Visualization Zhongyang Zhu et.al. 2312.10718v1 null
2023-12-17 A Framework of Full-Process Generation Design for Park Green Spaces Based on Remote Sensing Segmentation-GAN-Diffusion Ran Chen et.al. 2312.10674v1 null
2023-12-15 Movement Primitive Diffusion: Learning Gentle Robotic Manipulation of Deformable Objects Paul Maria Scheikl et.al. 2312.10008v1 null
2023-12-15 Contributions to the geomagnetic secular variation from a reanalysis of core surface dynamics Olivier Barrois et.al. 2312.09942v1 null
2023-12-15 Assimilation of ground and satellite magnetic measurements: inference of core surface magnetic and velocity field changes Olivier Barrois et.al. 2312.09878v1 null
2023-12-15 Integrating New Technologies into Science: The case of AI Stefano Bianchini et.al. 2312.09843v1 null
2023-12-15 Socio-Economic Deprivation Analysis: Diffusion Maps June Moh Goo et.al. 2312.09830v1 null
2023-12-15 Comparison of Quasi-Geostrophic, Hybrid and 3D models of planetary core convection Olivier Barrois et.al. 2312.09826v1 null
2023-12-15 Neural networks for turbulent transport prediction in a simplified model of tokamak plasmas L. M. Pomârjanschi et.al. 2312.09807v1 null
2023-12-14 LIME: Localized Image Editing via Attention Regularization in Diffusion Models Enis Simsar et.al. 2312.09256v1 null
2023-12-14 FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection Hongsuk Choi et.al. 2312.09252v1 null
2023-12-14 Single Mesh Diffusion Models with Field Latents for Texture Generation Thomas W. Mitchel et.al. 2312.09250v1 null
2023-12-14 Text2Immersion: Generative Immersive Scene with 3D Gaussians Hao Ouyang et.al. 2312.09242v1 null
2023-12-14 A framework for conditional diffusion modelling with applications in motif scaffolding for protein design Kieran Didi et.al. 2312.09236v1 null
2023-12-14 Reliability in Semantic Segmentation: Can We Use Synthetic Data? Thibaut Loiseau et.al. 2312.09231v1 null
2023-12-14 Mosaic-SDF for 3D Generative Models Lior Yariv et.al. 2312.09222v1 null
2023-12-14 Measurement in the Age of LLMs: An Application to Ideological Scaling Sean O'Hagan et.al. 2312.09203v1 null
2023-12-14 Fast Sampling via De-randomization for Discrete Diffusion Models Zixiang Chen et.al. 2312.09193v1 null
2023-12-13 PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary Confusion Xin You et.al. 2312.08323v1 link
2023-12-13 Black-box Membership Inference Attacks against Fine-tuned Diffusion Models Yan Pang et.al. 2312.08207v1 link
2023-12-13 SPD-DDPM: Denoising Diffusion Probabilistic Models in the Symmetric Positive Definite Space Yunchen Li et.al. 2312.08200v1 link
2023-12-13 Concept-centric Personalization with Large-scale Diffusion Priors Pu Cao et.al. 2312.08195v1 link
2023-12-13 $ρ$ -Diffusion: A diffusion-based density estimation framework for computational physics Maxwell X. Cai et.al. 2312.08153v1 link
2023-12-13 Clockwork Diffusion: Efficient Generation With Model-Step Distillation Amirhossein Habibian et.al. 2312.08128v1 link
2023-12-12 FreeInit: Bridging Initialization Gap in Video Diffusion Models Tianxing Wu et.al. 2312.07537v1 link
2023-12-12 FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition Sicheng Mo et.al. 2312.07536v1 null
2023-12-12 PEEKABOO: Interactive Video Generation via Masked-Diffusion Yash Jain et.al. 2312.07509v1 null
2023-12-12 MinD-3D: Reconstruct High-quality 3D objects in Human Brain Jianxiong Gao et.al. 2312.07485v1 null
2023-12-12 DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing Kaiwen Zhang et.al. 2312.07409v1 null
2023-12-12 Boosting Latent Diffusion with Flow Matching Johannes S. Fischer et.al. 2312.07360v1 link
2023-12-12 Momentum Particle Maximum Likelihood Jen Ning Lim et.al. 2312.07335v1 null
2023-12-11 CAD: Photorealistic 3D Generation via Adversarial Distillation Ziyu Wan et.al. 2312.06663v1 null
2023-12-11 Photorealistic Video Generation with Diffusion Models Agrim Gupta et.al. 2312.06662v1 null
2023-12-11 UpFusion: Novel View Diffusion from Unposed Sparse View Observations Bharath Raj Nagoor Kani et.al. 2312.06661v1 null
2023-12-11 Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior Fangfu Liu et.al. 2312.06655v1 link
2023-12-11 Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution Shangchen Zhou et.al. 2312.06640v1 null
2023-12-11 DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection Haoyang He et.al. 2312.06607v1 link
2023-12-11 ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models Denis Zavadski et.al. 2312.06573v1 link
2023-12-11 HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models Xiaogang Peng et.al. 2312.06553v1 null
2023-12-11 In-situ Synchrotron X-Ray Photoelectron Spectroscopy Study of Medium-Temperature Baking of Niobium for SRF Application Alena Prudnikava et.al. 2312.06529v1 null
2023-12-08 KBFormer: A Diffusion Model for Structured Entity Completion Ouail Kitouni et.al. 2312.05253v1 null
2023-12-08 SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation Thuan Hoang Nguyen et.al. 2312.05239v1 null
2023-12-08 Stoichiometry preservation and generalization of Bilger mixture fraction for non-premixed combustion with differential molecular diffusion Haifeng Wang et.al. 2312.05204v1 null
2023-12-08 Membership Inference Attacks on Diffusion Models via Quantile Regression Shuai Tang et.al. 2312.05140v1 null
2023-12-08 DreaMoving: A Human Dance Video Generation Framework based on Diffusion Models Mengyang Feng et.al. 2312.05107v1 null
2023-12-08 Application of deep learning to the estimation of normalization coefficients in diffusion-based covariance models Folke K Skrunes et.al. 2312.05068v1 link
2023-12-08 SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control Jaskirat Singh et.al. 2312.05039v1 null
2023-12-08 Numerical determination of iron dust laminar flame speeds with the counterflow twin-flame technique C. E. A. G. van Gool et.al. 2312.04994v1 null
2023-12-07 Gen2Det: Generate to Detect Saksham Suri et.al. 2312.04566v1 null
2023-12-07 NeRFiller: Completing Scenes via Generative 3D Inpainting Ethan Weber et.al. 2312.04560v1 null
2023-12-07 PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation Zhaoxi Chen et.al. 2312.04559v1 link
2023-12-07 GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation Shoufa Chen et.al. 2312.04557v1 null
2023-12-07 SPIDeRS: Structured Polarization for Invisible Depth and Reflectance Sensing Tomoki Ichikawa et.al. 2312.04553v1 null
2023-12-07 Generating Illustrated Instructions Sachit Menon et.al. 2312.04552v1 null
2023-12-07 PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play Lili Chen et.al. 2312.04549v1 null
2023-12-07 HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image Tong Wu et.al. 2312.04543v1 null
2023-12-07 Diffusion Reflectance Map: Single-Image Stochastic Inverse Rendering of Illumination and Reflectance Yuto Enyo et.al. 2312.04529v1 null
2023-12-07 RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models Ozgur Kara et.al. 2312.04524v1 link
2023-12-06 Relightable Gaussian Codec Avatars Shunsuke Saito et.al. 2312.03704v1 null
2023-12-06 Self-conditioned Image Generation via Generating Representations Tianhong Li et.al. 2312.03701v1 link
2023-12-06 Memory Triggers: Unveiling Memorization in Text-To-Image Generative Models through Word-Level Duplication Ali Naseh et.al. 2312.03692v1 null
2023-12-06 WarpDiffusion: Efficient Diffusion Model for High-Fidelity Virtual Try-on xujie zhang et.al. 2312.03667v1 null
2023-12-06 TokenCompose: Grounding Diffusion with Token-level Supervision Zirui Wang et.al. 2312.03626v1 link
2023-12-06 DreamComposer: Controllable 3D Object Generation via Multi-View Conditions Yunhan Yang et.al. 2312.03611v1 null
2023-12-06 DiffusionSat: A Generative Foundation Model for Satellite Imagery Samar Khanna et.al. 2312.03606v1 null
2023-12-06 MMM: Generative Masked Motion Model Ekkasit Pinyoanuntapong et.al. 2312.03596v1 link
2023-12-05 ReconFusion: 3D Reconstruction with Diffusion Priors Rundi Wu et.al. 2312.02981v1 null
2023-12-05 Alchemist: Parametric Control of Material Properties with Diffusion Models Prafull Sharma et.al. 2312.02970v1 null
2023-12-05 AmbiGen: Generating Ambigrams from Pre-trained Diffusion Model Boheng Zhao et.al. 2312.02967v1 null
2023-12-05 Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection Cheng-Ju Ho et.al. 2312.02966v1 link
2023-12-05 Drag-A-Video: Non-rigid Video Editing with Point-based Interaction Yao Teng et.al. 2312.02936v1 null
2023-12-05 WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation Jiachen Lu et.al. 2312.02934v1 link
2023-12-05 LivePhoto: Real Image Animation with Text-guided Motion Control Xi Chen et.al. 2312.02928v1 null
2023-12-05 Multimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image Restoration Yuang Ai et.al. 2312.02918v1 null
2023-12-04 Latent Feature-Guided Diffusion Models for Shadow Removal Kangfu Mei et.al. 2312.02156v1 null
2023-12-04 Readout Guidance: Learning Control from Diffusion Features Grace Luo et.al. 2312.02150v1 null
2023-12-04 Generative Powers of Ten Xiaojuan Wang et.al. 2312.02149v1 null
2023-12-04 Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation Bingxin Ke et.al. 2312.02145v1 link
2023-12-04 DiffiT: Diffusion Vision Transformers for Image Generation Ali Hatamizadeh et.al. 2312.02139v1 link
2023-12-04 Style Aligned Image Generation via Shared Attention Amir Hertz et.al. 2312.02133v1 link
2023-12-04 VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence Yuchao Gu et.al. 2312.02087v1 null
2023-12-04 Computational Investigation on Collective Dynamical Behaviors of Flickering Laminar Buoyant Diffusion Flames in Circular Arrays Tao Yang et.al. 2312.02018v1 null
2023-12-01 MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video Hengyi Wang et.al. 2312.00778v1 null
2023-12-01 VideoBooth: Diffusion-based Video Generation with Image Prompts Yuming Jiang et.al. 2312.00777v1 null
2023-12-01 CompuCell3D Model of Cell Migration Reproduces Chemotaxis Pedro C. Dal-Castel et.al. 2312.00776v1 link
2023-12-01 Effects of three-dimensional slit geometry on flashback of premixed hydrogen flames in perforated burners Filippo Fruzza et.al. 2312.00744v1 null
2023-12-01 Resource-constrained knowledge diffusion processes inspired by human peer learning Ehsan Beikihassan et.al. 2312.00660v1 null
2023-12-01 TrackDiffusion: Multi-object Tracking Data Generation via Diffusion Models Pengxiang Li et.al. 2312.00651v1 null
2023-12-01 How the zebra got its stripes: Curvature-dependent diffusion orients Turing patterns on 3D surfaces Michael F. Staddon et.al. 2312.00637v1 null
2023-11-30 VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models Zhen Xing et.al. 2311.18837v1 null
2023-11-30 ART $\boldsymbol{\cdot}$ V: Auto-Regressive Text-to-Video Generation with Diffusion Models Wenming Weng et.al. 2311.18834v1 null
2023-11-30 Exploiting Diffusion Prior for Generalizable Pixel-Level Semantic Prediction Hsin-Ying Lee et.al. 2311.18832v1 link
2023-11-30 MotionEditor: Editing Video Motion via Content-Aware Diffusion Shuyuan Tu et.al. 2311.18830v1 link
2023-11-30 MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation Yanhui Wang et.al. 2311.18829v1 null
2023-11-30 One-step Diffusion with Distribution Matching Distillation Tianwei Yin et.al. 2311.18828v1 null
2023-11-30 ElasticDiffusion: Training-free Arbitrary Size Image Generation Moayed Haji-Ali et.al. 2311.18822v1 link
2023-11-30 Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters James Seale Smith et.al. 2311.18763v1 null
2023-11-29 Do text-free diffusion models learn discriminative visual representations? Soumik Mukhopadhyay et.al. 2311.17921v1 link
2023-11-29 Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models Daniel Geng et.al. 2311.17919v1 null
2023-11-29 AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text Jianfeng Zhang et.al. 2311.17917v1 null
2023-11-29 CG3D: Compositional Generation for Text-to-3D via Gaussian Splatting Alexander Vilesov et.al. 2311.17907v1 null
2023-11-29 SODA: Bottleneck Diffusion Models for Representation Learning Drew A. Hudson et.al. 2311.17901v1 null
2023-11-29 Leveraging Graph Diffusion Models for Network Refinement Tasks Puja Trivedi et.al. 2311.17856v1 null
2023-11-29 SPiC-E : Structural Priors in 3D Diffusion Models using Cross Entity Attention Etai Sella et.al. 2311.17834v1 null
2023-11-29 Analyzing and Explaining Image Classifiers via Diffusion Guidance Maximilian Augustin et.al. 2311.17833v1 null
2023-11-28 Material Palette: Extraction of Materials from a Single Image Ivan Lopes et.al. 2311.17060v1 null
2023-11-28 ReMoS: Reactive 3D Motion Synthesis for Two-Person Interactions Anindita Ghosh et.al. 2311.17057v1 null
2023-11-28 DiffuseBot: Breeding Soft Robots With Physics-Augmented Generative Diffusion Models Tsun-Hsuan Wang et.al. 2311.17053v1 null
2023-11-28 Surf-D: High-Quality Surface Generation for Arbitrary Topologies using Diffusion Models Zhengming Yu et.al. 2311.17050v1 null
2023-11-28 Adversarial Diffusion Distillation Axel Sauer et.al. 2311.17042v1 link
2023-11-28 Rumors with Changing Credibility Charlotte Out et.al. 2311.17040v1 null
2023-11-28 Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features Niladri Shekhar Dutt et.al. 2311.17024v1 link
2023-11-28 Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer Danah Yatim et.al. 2311.17009v1 null
2023-11-28 Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following Yutong Feng et.al. 2311.17002v1 null
2023-11-27 Test-time Adaptation of Discriminative Models via Diffusion Generative Feedback Mihir Prabhudesai et.al. 2311.16102v1 null
2023-11-27 CG-HOI: Contact-Guided 3D Human-Object Interaction Generation Christian Diller et.al. 2311.16097v1 null
2023-11-27 Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images Aiyu Cui et.al. 2311.16094v1 null
2023-11-27 Self-correcting LLM-controlled Diffusion Models Tsung-Han Wu et.al. 2311.16090v1 null
2023-11-27 DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization Zhaoyang Xia et.al. 2311.16060v1 link
2023-11-27 Exploring Attribute Variations in Style-based GANs using Diffusion Models Rishubh Parihar et.al. 2311.16052v1 null
2023-11-27 GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions Jiemin Fang et.al. 2311.16037v1 null
2023-11-27 Closing the ODE-SDE gap in score-based diffusion models through the Fokker-Planck equation Teo Deveney et.al. 2311.15996v1 null
2023-11-27 DiffAnt: Diffusion Models for Action Anticipation Zeyun Zhong et.al. 2311.15991v1 null
2023-11-24 CatVersion: Concatenating Embeddings for Diffusion-Based Text-to-Image Personalization Ruoyu Zhao et.al. 2311.14631v1 null
2023-11-24 Received Signal and Channel Parameter Estimation in Molecular Communications O. Tansel Baydas et.al. 2311.14621v1 null
2023-11-24 Animate124: Animating One Image to 4D Dynamic Scene Yuyang Zhao et.al. 2311.14603v1 null
2023-11-24 On the thermodynamic invariance of fine-grain and coarse-grain fluid models Thomas Dubos et.al. 2311.14564v1 null
2023-11-24 ToddlerDiffusion: Flash Interpretable Controllable Diffusion Model Eslam Mohamed Bakr et.al. 2311.14542v1 null
2023-11-24 GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting Yiwen Chen et.al. 2311.14521v1 link
2023-11-24 MVControl: Adding Conditional Control to Multi-view Diffusion for Controllable Text-to-3D Generation Zhiqi Li et.al. 2311.14494v1 link
2023-11-22 On diffusion-based generative models and their error bounds: The log-concave case with full convergence estimates Stefano Bruno et.al. 2311.13584v1 null
2023-11-22 WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space Katja Schwarz et.al. 2311.13570v1 null
2023-11-22 ADriver-I: A General World Model for Autonomous Driving Fan Jia et.al. 2311.13549v1 null
2023-11-22 DiffusionMat: Alpha Matting as Sequential Refinement Learning Yangyang Xu et.al. 2311.13535v1 null
2023-11-22 Guided Flows for Generative Modeling and Decision Making Qinqing Zheng et.al. 2311.13443v1 null
2023-11-22 LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes Jaeyoung Chung et.al. 2311.13384v1 null
2023-11-21 Bubble departure and sliding in high-pressure flow boiling of water Artyom Kossolapov et.al. 2311.12749v1 null
2023-11-21 GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning Jiaxi Lv et.al. 2311.12631v1 null
2023-11-21 HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis Sang-Hoon Lee et.al. 2311.12454v1 link
2023-11-21 Stable Diffusion For Aerial Object Detection Yanan Jian et.al. 2311.12345v1 null
2023-11-21 LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis Peiang Zhao et.al. 2311.12342v1 null
2023-11-21 Overcoming Pathology Image Data Deficiency: Generating Images from Pathological Transformation Process Zeyu Liu et.al. 2311.12316v1 link
2023-11-20 Macroscopic description of a heavy particle immersed within a flow of light particles Radek Erban et.al. 2311.12021v1 null
2023-11-20 An Image is Worth Multiple Words: Multi-attribute Inversion for Constrained Text-to-Image Synthesis Aishwarya Agarwal et.al. 2311.11919v1 null
2023-11-20 Evolution of internal gravity waves in meso-scale eddies Pablo Sebastia Saez et.al. 2311.11916v1 null
2023-11-20 Log-periodic oscillations as real-time signatures of hierarchical dynamics in proteins Emanuel Dorbath et.al. 2311.11839v1 null
2023-11-20 Holistic Inverse Rendering of Complex Facade via Aerial 3D Scanning Zixuan Xie et.al. 2311.11825v1 null
2023-11-17 Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning Rohit Girdhar et.al. 2311.10709v1 null
2023-11-17 SelfEval: Leveraging the discriminative nature of generative models for evaluation Sai Saketh Rambhatla et.al. 2311.10708v1 null
2023-11-17 Enhancing Object Coherence in Layout-to-Image Synthesis Yibin Wang et.al. 2311.10522v1 link
2023-11-16 The Chosen One: Consistent Characters in Text-to-Image Diffusion Models Omri Avrahami et.al. 2311.10093v1 null
2023-11-16 Spontaneous Opinion Swings in the Voter Model with Latency Giovanni Palermo et.al. 2311.10045v1 null
2023-11-16 TransFusion -- A Transparency-Based Diffusion Model for Anomaly Detection Matic Fučka et.al. 2311.09999v1 null
2023-11-16 The divergence-free velocity formulation of the consistent Navier-Stokes Cahn-Hilliard model with non-matching densities, divergence-conforming discretization, and benchmarks M. ten Eikelder et.al. 2311.09966v1 null
2023-11-16 DSR-Diff: Depth Map Super-Resolution with Diffusion Model Yuan Shi et.al. 2311.09919v1 null
2023-11-15 Single-Image 3D Human Digitization with Shape-Guided Diffusion Badour AlBahar et.al. 2311.09221v1 null
2023-11-15 DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model Yinghao Xu et.al. 2311.09217v1 null
2023-11-15 Finding polarised communities and tracking information diffusion on Twitter: The Irish Abortion Referendum Caroline Pena et.al. 2311.09196v1 null
2023-11-15 Fast Detection of Phase Transitions with Multi-Task Learning-by-Confusion Julian Arnold et.al. 2311.09128v1 null
2023-11-15 Contrastive Transformer Learning with Proximity Data Generation for Text-Based Person Search Hefeng Wu et.al. 2311.09084v1 link
2023-11-15 A Spectral Diffusion Prior for Hyperspectral Image Super-Resolution Jianjun Liu et.al. 2311.08955v1 null
2023-11-13 Fast and Space-Efficient Parallel Algorithms for Influence Maximization Letong Wang et.al. 2311.07554v1 link
2023-11-13 Harnessing elastic instabilities for enhanced mixing and reaction kinetics in porous media Christopher A. Browne et.al. 2311.07431v1 link
2023-11-13 Robust semi-supervised segmentation with timestep ensembling diffusion models Margherita Rosnati et.al. 2311.07421v1 null
2023-11-10 Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization Weiyang Liu et.al. 2311.06243v1 null
2023-11-10 Diffusion Models for Earth Observation Use-cases: from cloud removal to urban change detection Fulvio Sanguigni et.al. 2311.06222v1 null
2023-11-10 Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model Jiahao Li et.al. 2311.06214v1 null
2023-11-10 Turbulence Scaling from Deep Learning Diffusion Generative Models Tim Whittaker et.al. 2311.06112v1 null
2023-11-09 Diffusion-Generative Multi-Fidelity Learning for Physical Simulation Zheng Wang et.al. 2311.05606v1 null
2023-11-09 Bayesian Methods for Media Mix Modelling with shape and funnel effects Javier Marin et.al. 2311.05587v1 null
2023-11-09 LCM-LoRA: A Universal Stable-Diffusion Acceleration Module Simian Luo et.al. 2311.05556v1 link
2023-11-09 From Stability to Change: The Potential Application of Bifurcation Theory to Opinion Dynamics Considerations Yasuko Kawahata et.al. 2311.05488v1 null
2023-11-09 Lithium-ion battery performance model including solvent segregation effects Ruihe Li et.al. 2311.05467v1 null
2023-11-09 3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models Haibo Yang et.al. 2311.05464v1 link
2023-11-09 ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors Jingwen Chen et.al. 2311.05463v1 null
2023-11-08 Transferability of atomic energies from alchemical decomposition Michael J. Sahre et.al. 2311.04784v1 link
2023-11-08 Weakly-supervised deepfake localization in diffusion-generated images Dragos Tantaru et.al. 2311.04584v1 link
2023-11-07 I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models Shiwei Zhang et.al. 2311.04145v1 link
2023-11-07 Simple Bundles of Complex Networks Alexandre Benatti et.al. 2311.04133v1 null
2023-11-07 Generative Structural Design Integrating BIM and Diffusion Model Zhili He et.al. 2311.04052v1 link
2023-11-07 A Method to Improve the Performance of Reinforcement Learning Based on the Y Operator for a Class of Stochastic Differential Equation-Based Child-Mother Systems Cheng Yin et.al. 2311.04014v1 null
2023-11-06 TS-Diffusion: Generating Highly Complex Time Series with Diffusion Models Yangming Li et.al. 2311.03303v1 null
2023-11-06 LDM3D-VR: Latent Diffusion Model for 3D VR Gabriela Ben Melech Stan et.al. 2311.03226v1 null
2023-11-06 Persistent homology for high-dimensional data based on spectral methods Sebastian Damrich et.al. 2311.03087v1 link
2023-11-06 AnyText: Multilingual Visual Text Generation And Editing Yuxiang Tuo et.al. 2311.03054v1 link
2023-11-03 Latent Diffusion Model for Conditional Reservoir Facies Generation Daesoo Lee et.al. 2311.01968v1 null
2023-11-03 DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoder Tao Liu et.al. 2311.01811v1 null
2023-11-03 On the Generalization Properties of Diffusion Models Puheng Li et.al. 2311.01797v1 link
2023-11-03 PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation Yuhan Ding et.al. 2311.01773v1 null
2023-11-03 CDGraph: Dual Conditional Social Graph Synthesizing via Diffusion Model Jui-Yi Tsai et.al. 2311.01729v1 null
2023-11-02 Time Series Anomaly Detection using Diffusion-based Models Ioana Pintilie et.al. 2311.01452v1 link
2023-11-02 Constrained-Context Conditional Diffusion Models for Imitation Learning Vaibhav Saxena et.al. 2311.01419v1 null
2023-11-02 The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing Shen Nie et.al. 2311.01410v1 null
2023-11-02 Sim2Real Bilevel Adaptation for Object Surface Classification using Vision-Based Tactile Sensors Gabriele M. Caddeo et.al. 2311.01380v1 link
2023-11-02 DP-Mix: Mixup-based Data Augmentation for Differentially Private Learning Wenxuan Bao et.al. 2311.01295v1 link
2023-11-02 Unraveling Diffusion in Fusion Plasma: A Case Study of In Situ Processing and Particle Sorting Junmin Gu et.al. 2311.01288v1 null
2023-11-01 De-Diffusion Makes Text a Strong Cross-Modal Interface Chen Wei et.al. 2311.00618v1 null
2023-11-01 Controllable Music Production with Diffusion Models and Guidance Gradients Mark Levy et.al. 2311.00613v1 null
2023-11-01 Intriguing Properties of Data Attribution on Diffusion Models Xiaosen Zheng et.al. 2311.00500v1 link
2023-11-01 Diffusion models for probabilistic programming Simon Dirmeier et.al. 2311.00474v1 link
2023-11-01 Dual Conditioned Diffusion Models for Out-Of-Distribution Detection: Application to Fetal Ultrasound Videos Divyanshu Mishra et.al. 2311.00469v1 null
2023-10-31 SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction Xinyuan Chen et.al. 2310.20700v1 null
2023-10-31 Diffusion Reconstruction of Ultrasound Images with Informative Uncertainty Yuxin Zhang et.al. 2310.20618v1 null
2023-10-29 JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation Yao Yao et.al. 2310.19180v1 null
2023-10-29 Learning to Follow Object-Centric Image Editing Instructions Faithfully Tuhin Chakrabarty et.al. 2310.19145v1 link
2023-10-29 Backward and Forward Inference in Interacting Independent-Cascade Processes: A Scalable and Convergent Message-Passing Approach Nouman Khan et.al. 2310.19138v1 null
2023-10-29 Bespoke Solvers for Generative Flow Models Neta Shaul et.al. 2310.19075v1 null
2023-10-29 Controllable Group Choreography using Contrastive Diffusion Nhat Le et.al. 2310.18986v1 null
2023-10-29 Adversarial Examples Are Not Real Features Ang Li et.al. 2310.18936v1 link
2023-10-27 Gen2Sim: Scaling up Robot Learning in Simulation with Generative Models Pushkal Katara et.al. 2310.18308v1 null
2023-10-27 Unsteady evolution of slip and drag in surfactant-contaminated superhydrophobic channels Samuel D. Tomlinson et.al. 2310.18184v1 null
2023-10-27 Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN Neeraj Kumar et.al. 2310.18169v1 null
2023-10-27 Lost in Translation -- Multilingual Misinformation and its Evolution Dorian Quelle et.al. 2310.18089v1 null
2023-10-27 ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image Kyle Sargent et.al. 2310.17994v1 null
2023-10-26 6-DoF Stability Field via Diffusion Models Takuma Yoneda et.al. 2310.17649v1 null
2023-10-26 Generative Fractional Diffusion Models Gabriel Nobis et.al. 2310.17638v1 null
2023-10-26 Orbital-optimized Density Functional Calculations of Molecular Rydberg Excited States with Real Space Grid Representation and Self-Interaction Correction Alec E. Sigurðarson et.al. 2310.17605v1 null
2023-10-26 Noise-Free Score Distillation Oren Katzir et.al. 2310.17590v1 null
2023-10-27 Global Structure-Aware Diffusion Process for Low-Light Image Enhancement Jinhui Hou et.al. 2310.17577v2 link
2023-10-26 DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation Yongxin Zhu et.al. 2310.17570v1 null
2023-10-26 SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching Xinghui Li et.al. 2310.17569v1 null
2023-10-27 The Expressive Power of Low-Rank Adaptation Yuchen Zeng et.al. 2310.17513v2 link
2023-10-25 PERF: Panoramic Neural Radiance Field from a Single Panorama Guangcong Wang et.al. 2310.16831v1 link
2023-10-25 CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images Aaron Gokaslan et.al. 2310.16825v1 link
2023-10-26 DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior Jingxiang Sun et.al. 2310.16818v2 link
2023-10-25 Optical Kinetic Theory of Nonlinear Multi-mode Photonic Networks Arkady Kurnosov et.al. 2310.16784v1 null
2023-10-25 Kiki or Bouba? Sound Symbolism in Vision-and-Language Models Morris Alper et.al. 2310.16781v1 null
2023-10-25 Multi-scale Diffusion Denoised Smoothing Jongheon Jeong et.al. 2310.16779v1 link
2023-10-25 Discrete variance decay analysis of spurious mixing Tridib Banerjee et.al. 2310.16768v1 null
2023-10-25 Scalar mass conservation in turbulent mixture fraction based combustion models through consistent local flow parameters Marco Davidovic et.al. 2310.16743v1 null
2023-10-24 From Posterior Sampling to Meaningful Diversity in Image Restoration Noa Cohen et.al. 2310.16047v1 null
2023-10-24 CVPR 2023 Text Guided Video Editing Competition Jay Zhangjie Wu et.al. 2310.16003v1 link
2023-10-24 Classical wave-particle localization in disordered landscapes Abel J. Abraham et.al. 2310.16000v1 null
2023-10-25 Improving Robustness and Reliability in Medical Image Classification with Latent-Guided Diffusion and Nested-Ensembles Xing Shen et.al. 2310.15952v2 null
2023-10-24 Language-driven Scene Synthesis using Multi-conditional Diffusion Model An Vuong et.al. 2310.15948v1 link
2023-10-23 FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling Haonan Qiu et.al. 2310.15169v1 link
2023-10-23 Matryoshka Diffusion Models Jiatao Gu et.al. 2310.15111v1 null
2023-10-23 Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model Ruoxi Shi et.al. 2310.15110v1 link
2023-10-24 Wonder3D: Single Image to 3D using Cross-Domain Diffusion Xiaoxiao Long et.al. 2310.15008v2 null
2023-10-23 Orientation-Aware Leg Movement Learning for Action-Driven Human Motion Prediction Chunzhi Gu et.al. 2310.14907v1 null
2023-10-20 Achieving Single-Electron Sensitivity at Enhanced Speed in Fully-Depleted CCDs with Double-Gate MOSFETs Miguel Sofo-Haro et.al. 2310.13644v1 null
2023-10-20 ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection Zhongzhan Huang et.al. 2310.13545v1 link
2023-10-20 A Critical Insight into Pretransitional Behavior and Dielectric Tunability of Relaxor Ceramics Sylwester J. Rzoska et.al. 2310.13326v1 null
2023-10-19 Variational Inference for SDEs Driven by Fractional Noise Rembert Daems et.al. 2310.12975v1 null
2023-10-19 A Markovian dynamics for $C. elegans$ behavior across scales Antonio C. Costa et.al. 2310.12883v1 link
2023-10-19 EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model Zheyuan Zhang et.al. 2310.12868v1 null
2023-10-19 An effective theory of collective deep learning Lluís Arola-Fernández et.al. 2310.12802v1 link
2023-10-19 Energy-Based Models For Speech Synthesis Wanli Sun et.al. 2310.12765v1 null
2023-10-18 Object-aware Inversion and Reassembly for Image Editing Zhen Yang et.al. 2310.12149v1 null
2023-10-18 Quality Diversity through Human Feedback Li Ding et.al. 2310.12103v1 link
2023-10-18 Image Super-resolution Via Latent Diffusion: A Sampling-space Mixture Of Experts And Frequency-augmented Decoder Approach Feng Luo et.al. 2310.12004v1 link
2023-10-18 Bayesian Flow Networks in Continual Learning Mateusz Pyla et.al. 2310.12001v1 null
2023-10-18 InfoDiffusion: Information Entropy Aware Diffusion Process for Non-Autoregressive Text Generation Renzhi Wang et.al. 2310.11976v1 link
2023-10-17 Elucidating The Design Space of Classifier-Guided Diffusion Generation Jiajun Ma et.al. 2310.11311v1 link
2023-10-17 Favorable and unfavorable many-body interactions for near-field radiative heat transfer in nanoparticle networks Minggang Luo et.al. 2310.11273v1 null
2023-10-17 A diffusive wetting model for water entry/exit based on the weakly-compressible SPH method Shuoguo Zhang et.al. 2310.11179v1 null
2023-10-17 Leveraging Content-based Features from Multiple Acoustic Models for Singing Voice Conversion Xueyao Zhang et.al. 2310.11160v1 link
2023-10-17 BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference Siqi Kou et.al. 2310.11142v1 link
2023-10-17 3D Structure-guided Network for Tooth Alignment in 2D Photograph Yulong Dou et.al. 2310.11106v1 link
2023-10-16 A Survey on Video Diffusion Models Zhen Xing et.al. 2310.10647v1 link
2023-10-16 TOSS:High-quality Text-guided Novel View Synthesis from a Single Image Yukai Shi et.al. 2310.10644v1 null
2023-10-16 LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts Hanan Gani et.al. 2310.10640v1 link
2023-10-16 Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models Kevin Black et.al. 2310.10639v1 link
2023-10-16 DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing Jia-Wei Liu et.al. 2310.10624v1 null
2023-10-16 ViPE: Visualise Pretty-much Everything Hassan Shahmohammadi et.al. 2310.10543v1 link
2023-10-13 Hypernymy Understanding Evaluation of Text-to-Image Models via WordNet Hierarchy Anton Baryshnikov et.al. 2310.09247v1 link
2023-10-13 Unseen Image Synthesis with Diffusion Models Ye Zhu et.al. 2310.09213v1 null
2023-10-13 The effect of solar wind on the charged particles' diffusion coefficients J. F. Wang et.al. 2310.09211v1 null
2023-10-12 OmniControl: Control Any Joint at Any Time for Human Motion Generation Yiming Xie et.al. 2310.08580v1 link
2023-10-12 HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion Xian Liu et.al. 2310.08579v1 null
2023-10-12 NetDiffusion: Network Data Augmentation Through Protocol-Constrained Traffic Generation Xi Jiang et.al. 2310.08543v1 null
2023-10-12 GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors Taoran Yi et.al. 2310.08529v1 link
2023-10-12 MotionDirector: Motion Customization of Text-to-Video Diffusion Models Rui Zhao et.al. 2310.08465v1 link
2023-10-12 Debias the Training of Diffusion Models Hu Yu et.al. 2310.08442v1 null
2023-10-12 Neural Diffusion Models Grigory Bartosh et.al. 2310.08337v1 null
2023-10-11 ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models Yingqing He et.al. 2310.07702v1 link
2023-10-11 ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation Bo Peng et.al. 2310.07697v1 link
2023-10-11 Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models Lai Zeqiang et.al. 2310.07653v1 link
2023-10-11 Flux gradient relations and their dependence on turbulence anisotropy Samuele Mosso et.al. 2310.07503v1 null
2023-10-11 Boosting Black-box Attack to Deep Neural Networks with Conditional Diffusion Models Renyang Liu et.al. 2310.07492v1 null
2023-10-11 Multi-Concept T2I-Zero: Tweaking Only The Text Embeddings and Nothing Else Hazarapet Tunanyan et.al. 2310.07419v1 null
2023-10-10 What Does Stable Diffusion Know about the 3D Scene? Guanqi Zhan et.al. 2310.06836v1 link
2023-10-10 Impact of grain boundary and surface diffusion on predicted fission gas bubble behavior and release in UO $_2$ fuel Md Ali Muntaha et.al. 2310.06795v1 null
2023-10-10 HiFi-123: Towards High-fidelity One Image to 3D Content Generation Wangbo Yu et.al. 2310.06744v1 null
2023-10-10 Latent Diffusion Counterfactual Explanations Karim Farid et.al. 2310.06668v1 null
2023-10-10 Tertiary Lymphoid Structures Generation through Graph-based Diffusion Manuel Madeira et.al. 2310.06661v1 null
2023-10-09 FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing Yuren Cong et.al. 2310.05922v1 null
2023-10-10 Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models Zhili Liu et.al. 2310.05873v2 null
2023-10-09 A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models Sebastian G. Gruber et.al. 2310.05833v1 null
2023-10-09 DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models Shansan Gong et.al. 2310.05793v1 link
2023-10-09 Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation Lijun Yu et.al. 2310.05737v1 link
2023-10-09 CIFAR-10-Warehouse: Broad and More Realistic Testbeds in Model Generalization Analysis Xiaoxiao Sun et.al. 2310.04414v2 null
2023-10-06 Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference Simian Luo et.al. 2310.04378v1 link
2023-10-05 Aligning Text-to-Image Diffusion Models with Reward Backpropagation Mihir Prabhudesai et.al. 2310.03739v1 link
2023-10-05 Stochastic interpolants with data-dependent couplings Michael S. Albergo et.al. 2310.03725v1 null
2023-10-05 Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints Chuan Fang et.al. 2310.03602v1 null
2023-10-05 Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion Anton Razzhigaev et.al. 2310.03502v1 link
2023-10-05 Deep Generative Models of Music Expectation Ninon Lizé Masclef et.al. 2310.03500v1 null
2023-10-05 An Extended Phase Graph-based framework for DANTE-SPACE simulations including physiological, temporal, and spatial variations Matthijs H. S. de Buck et.al. 2310.03429v1 null
2023-10-04 Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models Jianglong Ye et.al. 2310.03020v1 null
2023-10-04 Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day Yifan Jiang et.al. 2310.03015v1 null
2023-10-04 Probing Intersectional Biases in Vision-Language Models with Counterfactual Examples Phillip Howard et.al. 2310.02988v1 null
2023-10-04 T $^3$ Bench: Benchmarking Current Progress in Text-to-3D Generation Yuze He et.al. 2310.02977v1 link
2023-10-04 Fast, Expressive SE $(n)$ Equivariant Networks through Weight-Sharing in Position-Orientation Space Erik J Bekkers et.al. 2310.02970v1 link
2023-10-04 Optimal Transport with Adaptive Regularisation Hugues Van Assel et.al. 2310.02925v1 null
2023-10-04 Boosting Dermatoscopic Lesion Segmentation via Diffusion Models with Visual and Textual Prompts Shiyi Du et.al. 2310.02906v1 null
2023-10-03 Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models Huaijin Pi et.al. 2310.02242v1 null
2023-10-03 Leveraging Diffusion Disentangled Representations to Mitigate Shortcuts in Underspecified Visual Tasks Luca Scimeca et.al. 2310.02230v1 null
2023-10-03 Global Attractor for a Reaction-Diffusion Model Arising in Biological Dynamic in 3D Soil Structure Mohamed Elghandouri et.al. 2310.02060v1 null
2023-10-03 AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model Zibin Dong et.al. 2310.02054v1 null
2023-10-03 Spectral operator learning for parametric PDEs without data reliance Junho Choi et.al. 2310.02013v1 null
2023-10-03 Optimizing microlens arrays for incoherent HiLo microscopy Ziao Jiao et.al. 2310.01939v1 null
2023-10-02 LLM-grounded Video Diffusion Models Long Lian et.al. 2309.17444v2 null
2023-09-29 Directly Fine-Tuning Diffusion Models on Differentiable Rewards Kevin Clark et.al. 2309.17400v1 null
2023-09-29 Physics-Informed Neural Network for the Transient Diffusivity Equation in Reservoir Engineering Daniel Badawi et.al. 2309.17345v1 null
2023-09-28 KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing Jiancheng Huang et.al. 2309.16608v1 null
2023-09-28 CCEdit: Creative and Controllable Video Editing via Diffusion Models Ruoyu Feng et.al. 2309.16496v1 null
2023-09-28 Distilling ODE Solvers of Diffusion Models into Smaller Steps Sanghwan Kim et.al. 2309.16421v1 null
2023-09-27 Exploiting the Signal-Leak Bias in Diffusion Models Martin Nicolas Everaert et.al. 2309.15842v1 null
2023-09-27 Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation David Junhao Zhang et.al. 2309.15818v1 link
2023-09-27 Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack Xiaoliang Dai et.al. 2309.15807v1 null
2023-09-27 Factorized Diffusion Architectures for Unsupervised Image Generation and Segmentation Xin Yuan et.al. 2309.15726v1 null
2023-09-27 Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing Kai Wang et.al. 2309.15664v1 link
2023-09-27 Direct Sensing of Remote Nuclei: Expanding the Reach of Cross-Effect Dynamic Nuclear Polarization Amaria Javed et.al. 2309.15653v1 null
2023-09-26 Generating Visual Scenes from Touch Fengyu Yang et.al. 2309.15117v1 null
2023-09-27 LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models Yaohui Wang et.al. 2309.15103v2 link
2023-09-26 FEC: Three Finetuning-free Methods to Enhance Consistency for Real Image Editing Songyan Chen et.al. 2309.14934v1 null
2023-09-27 ITEM3D: Illumination-Aware Directional Texture Editing for 3D Models Shengqi Liu et.al. 2309.14872v2 null
2023-09-26 Navigating Text-To-Image Customization:From LyCORIS Fine-Tuning to Model Evaluation Shin-Ying Yeh et.al. 2309.14859v1 link
2023-09-25 Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic Segmentation Quang Nguyen et.al. 2309.14303v1 link
2023-09-25 Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion Models Yangming Li et.al. 2309.14068v1 null
2023-09-25 Mixing as a correlated aggregation process Joris Heyman et.al. 2309.14040v1 link
2023-09-22 MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation Jiahao Xie et.al. 2309.13042v1 link
2023-09-22 Diffusion Augmentation for Sequential Recommendation Qidong Liu et.al. 2309.12858v1 link
2023-09-22 Accuracy and stability analysis of horizontal discretizations used in unstructured grid ocean models Fabricio Rodrigues Lapolli et.al. 2309.12832v1 null
2023-09-22 Synthetic Boost: Leveraging Synthetic Data for Enhanced Vision-Language Segmentation in Echocardiography Rabin Adhikari et.al. 2309.12829v1 link
2023-09-22 Semantic Change Driven Generative Semantic Communication Framework Wanting Yang et.al. 2309.12775v1 link
2023-09-21 A Diffusion-Model of Joint Interactive Navigation Matthew Niedoba et.al. 2309.12508v1 null
2023-09-21 License Plate Super-Resolution Using Diffusion Models Sawsan AlHalawani et.al. 2309.12506v1 null
2023-09-21 Performance Conditioning for Diffusion-Based Multi-Instrument Music Synthesis Ben Maman et.al. 2309.12283v1 null
2023-09-20 FreeU: Free Lunch in Diffusion U-Net Chenyang Si et.al. 2309.11497v1 link
2023-09-20 Generative Agent-Based Modeling: Unveiling Social System Dynamics through Coupling Mechanistic Models with Generative Artificial Intelligence Navid Ghaffarzadegan et.al. 2309.11456v1 null
2023-09-20 Deep Networks as Denoising Algorithms: Sample-Efficient Learning of Diffusion Models in High-Dimensional Graphical Models Song Mei et.al. 2309.11420v1 null
2023-09-20 EDMP: Ensemble-of-costs-guided Diffusion for Motion Planning Kallol Saha et.al. 2309.11414v1 link
2023-09-20 Face Aging via Diffusion-based Editing Xiangyi Chen et.al. 2309.11321v1 link
2023-09-20 FaceDiffuser: Speech-Driven 3D Facial Animation Synthesis Using Diffusion Stefan Stan et.al. 2309.11306v1 link
2023-09-19 PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance Peiqing Yang et.al. 2309.10810v1 link
2023-09-19 Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation Yatong Bai et.al. 2309.10740v1 link
2023-09-19 Reconstruct-and-Generate Diffusion Model for Detail-Preserving Image Denoising Yujin Wang et.al. 2309.10714v1 null
2023-09-18 Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees Alexia Jolicoeur-Martineau et.al. 2309.09968v1 link
2023-09-18 What is a Fair Diffusion Model? Designing Generative Text-To-Image Models to Incorporate Various Worldviews Zoe De Simone et.al. 2309.09944v1 link
2023-09-18 DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving Xiaofeng Wang et.al. 2309.09777v1 null
2023-09-18 Application-driven Validation of Posteriors in Inverse Problems Tim J. Adler et.al. 2309.09764v1 null
2023-09-19 Non-Hermitian physics and topological phenomena in convective thermal metamaterials Zhoufei Liu et.al. 2309.09681v2 null
2023-09-18 Anomalous Diffusion of Lithium-Anion Clusters in Ionic Liquids YeongKyu Lee et.al. 2309.09674v1 null
2023-09-15 Compositional Foundation Models for Hierarchical Planning Anurag Ajay et.al. 2309.08587v1 null
2023-09-15 Denoising Diffusion Probabilistic Models for Hardware-Impaired Communications Mehdi Letafati et.al. 2309.08568v1 null
2023-09-15 Breathing New Life into 3D Assets with Generative Repainting Tianfu Wang et.al. 2309.08523v1 link
2023-09-15 Diffuse-illumination holographic optical coherence tomography Léo Puyo et.al. 2309.08486v1 null
2023-09-15 Large-Vocabulary 3D Diffusion Model with Transformer Ziang Cao et.al. 2309.07920v2 null
2023-09-14 Generative Image Dynamics Zhengqi Li et.al. 2309.07906v1 null
2023-09-14 Beta Diffusion Mingyuan Zhou et.al. 2309.07867v1 link
2023-09-14 Study and evaluation of the Ronen Method accuracy at material interfaces Johan Cufe et.al. 2309.07756v1 null
2023-09-14 Dual-angle interferometric scattering microscopy for optical multiparametric particle characterization Erik Olsén et.al. 2309.07572v1 null
2023-09-13 UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons Sicheng Yang et.al. 2309.07051v1 link
2023-09-13 Experimental Study on the Detection of Frozen Diffused Ammonia Blockage in the Inactive Section of a Variable Conductance Heat Pipe F. K. Miranda et.al. 2309.06936v1 null
2023-09-13 DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models Namhyuk Ahn et.al. 2309.06933v1 null
2023-09-13 MagiCapture: High-Resolution Multi-Concept Portrait Customization Junha Hyung et.al. 2309.06895v1 null
2023-09-13 DCTTS: Discrete Diffusion Model with Contrastive Learning for Text-to-speech Generation Zhichao Wu et.al. 2309.06787v1 null
2023-09-13 High throughput sampling of phase space with deep learning potentials: $δ$ -AlOOH at geophysical conditions Chenxing Luo et.al. 2309.06712v1 null
2023-09-13 Generalizable improvement of the Spalart-Allmaras model through assimilation of experimental data Deepinder Jot Singh Aulakh et.al. 2309.06679v1 null
2023-09-12 InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation Xingchao Liu et.al. 2309.06380v1 link
2023-09-12 Dispersion versus diffusion in mixing fronts Gauthier Rousseau et.al. 2309.06347v1 null
2023-09-12 Unraveling biochemical spatial patterns: machine learning approaches to the inverse problem of Turing patterns Antonio Matas-Gil et.al. 2309.06339v1 link
2023-09-12 Fg-T2M: Fine-Grained Text-Driven Human Motion Generation via Diffusion Model Yin Wang et.al. 2309.06284v1 null
2023-09-11 Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips Yufei Ye et.al. 2309.05663v1 null
2023-09-11 PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud Chengyu Wang et.al. 2309.05534v1 null
2023-09-11 NExT-GPT: Any-to-Any Multimodal LLM Shengqiong Wu et.al. 2309.05519v1 link
2023-09-08 Variations and Relaxations of Normalizing Flows Keegan Kelly et.al. 2309.04433v1 null
2023-09-08 Create Your World: Lifelong Text-to-Image Diffusion Gan Sun et.al. 2309.04430v1 null
2023-09-08 MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask Yupeng Zhou et.al. 2309.04399v1 null
2023-09-08 MoEController: Instruction-based Arbitrary Image Manipulation with Mixture-of-Expert Controllers Sijia Li et.al. 2309.04372v1 null
2023-09-08 The role of tumbling in bacterial scattering at convex obstacles Theresa Jakuszeit et.al. 2309.04326v1 null
2023-09-07 InstructDiffusion: A Generalist Modeling Interface for Vision Tasks Zigang Geng et.al. 2309.03895v1 null
2023-09-07 DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection Manlin Zhang et.al. 2309.03893v1 null
2023-09-07 Text-to-feature diffusion for audio-visual few-shot learning Otniel-Bogdan Mercea et.al. 2309.03869v1 link
2023-09-07 Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption Teng Hu et.al. 2309.03729v1 link
2023-09-07 DiffDefense: Defending against Adversarial Attacks via Diffusion Models Hondamunige Prasanna Silva et.al. 2309.03702v1 link
2023-09-07 Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model Sungwon Hwang et.al. 2309.03550v1 null
2023-09-06 My Art My Choice: Adversarial Protection Against Unruly AI Anthony Rhodes et.al. 2309.03198v1 null
2023-09-06 SLiMe: Segment Like Me Aliasghar Khani et.al. 2309.03179v1 link
2023-09-06 MCM: Multi-condition Motion Synthesis Framework for Multi-scenario Zeyu Ling et.al. 2309.03031v1 null
2023-09-05 Generating Realistic Images from In-the-wild Sounds Taegyeong Lee et.al. 2309.02405v1 null
2023-09-05 A Diffusion Quantum Monte Carlo Approach to the Polaritonic Ground State Braden M. Weight et.al. 2309.02349v1 link
2023-09-05 Robust frequency-dependent diffusion kurtosis computation using an efficient direction scheme, axisymmetric modelling, and spatial regularization J. Hamilton et.al. 2309.02319v1 null
2023-09-05 Neuromorphic nanocluster networks: Critical role of the substrate in nano-link formation Wenkai Wu et.al. 2309.02299v1 null
2023-09-05 Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models Haixu Song et.al. 2309.02218v1 link
2023-09-01 Iterative Multi-granular Image Editing using Diffusion Models K J Joseph et.al. 2309.00613v1 null
2023-09-01 VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation Xin Li et.al. 2309.00398v1 null
2023-09-01 Fast Diffusion EM: a diffusion model for blind inverse problems with application to deconvolution Charles Laroche et.al. 2309.00287v1 link
2023-09-01 Data-driven Topology Optimization of Channel Flow Problems Ce Guan et.al. 2309.00278v1 null
2023-08-31 InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion Sirui Xu et.al. 2308.16905v1 link
2023-09-01 GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields Yanjie Ze et.al. 2308.16891v2 link
2023-08-31 Prediction of Diblock Copolymer Morphology via Machine Learning Hyun Park et.al. 2308.16886v1 null
2023-08-31 Diffusion Models for Interferometric Satellite Aperture Radar Alexandre Tuel et.al. 2308.16847v1 link
2023-09-01 Irregular Traffic Time Series Forecasting Based on Asynchronous Spatio-Temporal Graph Convolutional Network Weijia Zhang et.al. 2308.16818v2 null
2023-09-01 Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models Minheng Ni et.al. 2308.16777v2 null
2023-08-31 Terrain Diffusion Network: Climatic-Aware Terrain Generation with Geological Sketch Guidance Zexin Hu et.al. 2308.16725v1 null
2023-08-30 SignDiff: Learning Diffusion Models for American Sign Language Production Sen Fang et.al. 2308.16082v1 null
2023-08-30 Click Metamaterials: Fast Acquisition of Thermal Conductivity and Functionality Diversities Chengmeng Wang et.al. 2308.16057v1 null
2023-08-30 DiffuVolume: Diffusion Model for Volume based Stereo Matching Dian Zheng et.al. 2308.15989v1 null
2023-08-30 Physics-Informed DeepMRI: Bridging the Gap from Heat Diffusion to k-Space Interpolation Zhuo-Xu Cui et.al. 2308.15918v1 null
2023-08-29 ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer Zachary Horvitz et.al. 2308.15459v1 link
2023-08-29 Vortex core radius in baroclinic turbulence: Implications for scaling predictions Gabriel Hadjerci et.al. 2308.15398v1 null
2023-08-29 Rayleigh-Bénard instability in a horizontal porous layer with anomalous diffusion Antonio Barletta et.al. 2308.15359v1 null
2023-08-30 Elucidating the Exposure Bias in Diffusion Models Mang Ning et.al. 2308.15321v2 link
2023-08-28 Total Selfie: Generating Full-Body Selfies Bowei Chen et.al. 2308.14740v1 null
2023-08-28 Oscillating reaction in porous media under saddle flow Satoshi Izumoto et.al. 2308.14723v1 null
2023-08-28 360-Degree Panorama Generation from Few Unregistered NFoV Images Jionghao Wang et.al. 2308.14686v1 link
2023-08-28 Effect of gas diffusion layer fiber shape on cathode two-phase dynamics in proton exchange membrane fuel cells Danan Yang et.al. 2308.14539v1 null
2023-08-28 Priority-Centric Human Motion Generation in Discrete Latent Space Hanyang Kong et.al. 2308.14480v1 null
2023-08-25 Distribution-Aligned Diffusion for Human Mesh Recovery Lin Geng Foo et.al. 2308.13369v1 null
2023-08-25 Age of Information Diffusion on Social Networks: Optimizing Multi-Stage Seeding Strategies Songhua Li et.al. 2308.13303v1 null
2023-08-25 EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Prior Minda Zhao et.al. 2308.13223v1 link
2023-08-25 Diff-Retinex: Rethinking Low-light Image Enhancement with A Generative Diffusion Model Xunpeng Yi et.al. 2308.13164v1 null
2023-08-25 A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions Tianyi Zhang et.al. 2308.13142v1 null
2023-08-24 Dense Text-to-Image Generation with Attention Modulation Yunji Kim et.al. 2308.12964v1 link
2023-08-24 Language as Reality: A Co-Creative Storytelling Game Experience in 1001 Nights using Generative AI Yuqian Sun et.al. 2308.12915v1 null
2023-08-24 Hydrogen jet diffusion modeling by using physics-informed graph neural network and sparsely-distributed sensor data Xinqi Zhang et.al. 2308.12621v1 null
2023-08-24 APLA: Additional Perturbation for Latent Noise with Adversarial Training Enables Consistency Yupu Yao et.al. 2308.12605v1 null
2023-08-23 On-Manifold Projected Gradient Descent Aaron Mahler et.al. 2308.12279v1 null
2023-08-23 Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning Jiasheng Ye et.al. 2308.12219v1 link
2023-08-23 Pulse shape discrimination for the CONUS experiment in the keV and sub-keV regime H. Bonet et.al. 2308.12105v1 null
2023-08-22 Theory of Transverse Mode Instability in Fiber Amplifiers with Multimode Excitations Kabish Wisal et.al. 2308.11599v1 null
2023-08-22 NIPG-DG schemes for transformed master equations modeling open quantum systems Jose A. Morales Escalante et.al. 2308.11580v1 null
2023-08-22 IT3D: Improved Text-to-3D Generation with Explicit View Synthesis Yiwen Chen et.al. 2308.11473v1 link
2023-08-22 SDeMorph: Towards Better Facial De-morphing from Single Morph Nitish Shukla et.al. 2308.11442v1 null
2023-08-21 TADA! Text to Animatable Digital Avatars Tingting Liao et.al. 2308.10899v1 null
2023-08-21 Election Manipulation in Social Networks with Single-Peaked Agents Vincenzo Auletta et.al. 2308.10845v1 null
2023-08-21 Backdooring Textual Inversion for Concept Censorship Yutong wu et.al. 2308.10718v1 null
2023-08-21 EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints Yutao Chen et.al. 2308.10648v1 null
2023-08-21 Frequency Compensated Diffusion Model for Real-scene Dehazing Jing Wang et.al. 2308.10510v1 link
2023-08-21 Texture Generation on 3D Meshes with Point-UV Diffusion Xin Yu et.al. 2308.10490v1 null
2023-08-18 Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization Soumik Mukhopadhyay et.al. 2308.09716v1 link
2023-08-18 HumanLiff: Layer-wise 3D Human Generation with Diffusion Model Shoukang Hu et.al. 2308.09712v1 null
2023-08-18 SimDA: Simple Diffusion Adapter for Efficient Video Generation Zhen Xing et.al. 2308.09710v1 null
2023-08-18 Guide3D: Create 3D Avatars from Text and Image Guidance Yukang Cao et.al. 2308.09705v1 null
2023-08-18 PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation Hanbing Liu et.al. 2308.09678v1 link
2023-08-18 Constrained Bayesian Optimization Using a Lagrange Multiplier Applied to Power Transistor Design Ping-Ju Chuang et.al. 2308.09612v1 null
2023-08-18 Language-Guided Diffusion Model for Visual Grounding Sijia Chen et.al. 2308.09599v1 null
2023-08-18 StableVideo: Text-driven Consistency-aware Diffusion Video Editing Wenhao Chai et.al. 2308.09592v1 link
2023-08-18 O^2-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion Model Yubin Hu et.al. 2308.09591v1 link
2023-08-16 TeCH: Text-guided Reconstruction of Lifelike Clothed Humans Yangyi Huang et.al. 2308.08545v1 link
2023-08-16 Voxlines: Streamline Transparency through Voxelization and View-Dependent Line Orders Besm Osman et.al. 2308.08436v1 null
2023-08-16 Diff-CAPTCHA: An Image-based CAPTCHA with Security Enhanced by Denoising Diffusion Model Ran Jiang et.al. 2308.08367v1 null
2023-08-18 Dual-Stream Diffusion Net for Text-to-Video Generation Binhui Liu et.al. 2308.08316v2 null
2023-08-16 Electron transfer efficiency in liquid xenon across THGEM holes G. Martínez-Lema et.al. 2308.08314v1 null
2023-08-15 StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion Models Zhizhong Wang et.al. 2308.07863v1 null
2023-08-15 CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction Yan Di et.al. 2308.07837v1 null
2023-08-15 DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding Jeongsoo Choi et.al. 2308.07787v1 link
2023-08-15 Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model Bosheng Qin et.al. 2308.07749v1 null
2023-08-14 Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation Alexander Martin et.al. 2308.07316v1 link
2023-08-14 DiffSED: Sound Event Detection with Denoising Diffusion Swapnil Bhosale et.al. 2308.07293v1 null
2023-08-14 Diffusion Based Augmentation for Captioning and Retrieval in Cultural Heritage Dario Cioni et.al. 2308.07151v1 link
2023-08-14 Temporal clustering of social interactions trades-off disease spreading and knowledge diffusion Giulia Cencetti et.al. 2308.07058v1 link
2023-08-14 Bayesian Flow Networks Alex Graves et.al. 2308.07037v1 link
2023-08-14 An efficient topology optimization method for steady gas flows in all flow regimes Ruifeng Yuan et.al. 2308.07018v1 null
2023-08-14 Discrete Conditional Diffusion for Reranking in Recommendation Xiao Lin et.al. 2308.06982v1 null
2023-08-11 Acoustofluidic Engineering Functional Vessel-on-a-Chip Yue Wu et.al. 2308.06219v1 null
2023-08-11 DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models Weijia Wu et.al. 2308.06160v1 link
2023-08-11 Taming the Power of Diffusion Models for High-Quality Virtual Try-On with Appearance Flow Junhong Gou et.al. 2308.06101v1 link
2023-08-11 Diffusion-based Visual Counterfactual Explanations -- Towards Systematic Quantitative Evaluation Philipp Vaeth et.al. 2308.06100v1 link
2023-08-11 Head Rotation in Denoising Diffusion Models Andrea Asperti et.al. 2308.06057v1 link
2023-08-11 Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning Chun-Mei Feng et.al. 2308.06038v1 link
2023-08-11 Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation Yuki Endo et.al. 2308.06027v1 link
2023-08-14 Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model Fan Zhang et.al. 2308.05995v2 null
2023-08-10 AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Haohe Liu et.al. 2308.05734v1 link
2023-08-10 PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers Phillip Lippe et.al. 2308.05732v1 null
2023-08-10 Masked Diffusion as Self-supervised Representation Learner Zixuan Pan et.al. 2308.05695v1 null
2023-08-10 Generative Diffusion Models for Radio Wireless Channel Modelling and Sampling Ushnish Sengupta et.al. 2308.05583v1 null
2023-08-10 Fokker-Planck-Poisson kinetics: Multi-phase flow beyond equilibrium Mohsen Sadr et.al. 2308.05580v1 null
2023-08-09 LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation Leigang Qu et.al. 2308.05095v1 null
2023-08-09 Do Diffusion Models Suffer Error Propagation? Theoretical Analysis and Consistency Regularization Yangming Li et.al. 2308.05021v1 null
2023-08-10 IDiff-Face: Synthetic-based Face Recognition through Fizzy Identity-Conditioned Diffusion Models Fadi Boutros et.al. 2308.04995v2 link
2023-08-09 CasCIFF: A Cross-Domain Information Fusion Framework Tailored for Cascade Prediction in Social Networks Hongjun Zhu et.al. 2308.04961v1 link
2023-08-09 Interaction-induced directional transport on periodically driven chains Helena Drüeke et.al. 2308.04845v1 null
2023-08-08 DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images Xuechao Zou et.al. 2308.04417v1 link
2023-08-08 Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On Daiheng Gao et.al. 2308.04288v1 null
2023-08-08 MCDAN: a Multi-scale Context-enhanced Dynamic Attention Network for Diffusion Prediction Xiaowen Wang et.al. 2308.04266v1 null
2023-08-08 FLIRT: Feedback Loop In-context Red Teaming Ninareh Mehrabi et.al. 2308.04265v1 null
2023-08-08 MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion Yizhuo Lu et.al. 2308.04249v1 link
2023-08-08 Synthetic Augmentation with Large-scale Unconditional Pre-training Jiarong Ye et.al. 2308.04020v1 link
2023-08-07 Diffusion Model in Causal Inference with Unmeasured Confounders Tatsuhiro Shimizu et.al. 2308.03669v1 link
2023-08-07 AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose Huichao Zhang et.al. 2308.03610v1 link
2023-08-08 DiffSynth: Latent In-Iteration Deflickering for Realistic Video Synthesis Zhongjie Duan et.al. 2308.03463v2 link
2023-08-04 Quantum Dynamical Approach to Predicting the Optical Pumping Threshold for Lasing in Organic Materials Bin Zhang et.al. 2308.02447v1 null
2023-08-04 Diffusion-Augmented Depth Prediction with Sparse Annotations Jiaqi Li et.al. 2308.02283v1 null
2023-08-04 Painterly Image Harmonization using Diffusion Model Lingxiao Lu et.al. 2308.02228v1 link
2023-08-03 Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling Zhao Yang et.al. 2308.01850v1 link
2023-08-03 DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models Jianxin Lin et.al. 2308.01655v1 null
2023-08-03 Reference-Free Isotropic 3D EM Reconstruction using Diffusion Models Kyungryun Lee et.al. 2308.01594v1 null
2023-08-03 Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS Myeongjin Ko et.al. 2308.01573v1 link
2023-08-02 Patched Denoising Diffusion Models For High-Resolution Image Synthesis Zheng Ding et.al. 2308.01316v1 link
2023-08-02 Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image Generation Guojin Zhong et.al. 2308.01147v1 link
2023-08-02 DiffusePast: Diffusion-based Generative Replay for Class Incremental Semantic Segmentation Jingfan Chen et.al. 2308.01127v1 null
2023-08-01 Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models Cheng-Yu Hsieh et.al. 2308.00675v1 null
2023-08-01 Diffusion Model for Camouflaged Object Detection Zhennan Chen et.al. 2308.00303v1 null
2023-07-31 Universal Adversarial Defense in Remote Sensing Based on Pre-trained Denoising Diffusion Models Weikang Yu et.al. 2307.16865v1 null
2023-07-31 From Generation to Suppression: Towards Effective Irregular Glow Removal for Nighttime Visibility Enhancement Wanyu Wu et.al. 2307.16783v1 null
2023-07-31 Understanding Dynamics in Coarse-Grained Models: III. Roles of Rotational Motion and Translation-Rotation Coupling in Coarse-Grained Dynamics Jaehyeok Jin et.al. 2307.16747v1 null
2023-07-31 DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation Runyang Feng et.al. 2307.16687v1 null
2023-07-31 On the Trustworthiness Landscape of State-of-the-art Generative Models: A Comprehensive Survey Mingyuan Fan et.al. 2307.16680v1 null
2023-07-28 Understanding the Anomalous Diffusion of Water in Aqueous Electrolytes Using Machine Learned Potentials Nikhil V. S. Avula et.al. 2307.15576v1 null
2023-07-28 Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding Chunyu Qiang et.al. 2307.15484v1 null
2023-07-27 The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation Lingdong Kong et.al. 2307.15061v1 link
2023-07-27 TEDi: Temporally-Entangled Diffusion for Long-Term Motion Synthesis Zihan Zhang et.al. 2307.15042v1 null
2023-07-27 Generative convective parametrization of dry atmospheric boundary layer Florian Heyder et.al. 2307.14857v1 null
2023-07-27 Empirical analysis of congestion spreading in Seoul traffic network Jung-Hoon Jung et.al. 2307.14800v1 null
2023-07-26 Virtual Mirrors: Non-Line-of-Sight Imaging Beyond the Third Bounce Diego Royo et.al. 2307.14341v1 null
2023-07-26 Visual Instruction Inversion: Image Editing via Visual Prompting Thao Nguyen et.al. 2307.14331v1 link
2023-07-26 Founding a mathematical diffusion model in linguistics. The case study of German syntactic features in the North-Eastern Italian dialects I. Lazzizzera et.al. 2307.14291v1 null
2023-07-26 VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with ControlNet Zhihao Hu et.al. 2307.14073v1 null
2023-07-25 Comparing phase-space and phenomenological modeling approaches for Lagrangian particles settling in a turbulent boundary layer Andrew P. Grace et.al. 2307.13659v1 null
2023-07-25 Fake It Without Making It: Conditioned Face Generation for Accurate 3D Face Shape Estimation Will Rowan et.al. 2307.13639v1 null
2023-07-25 XDLM: Cross-lingual Diffusion Language Model for Machine Translation Linyao Chen et.al. 2307.13560v1 null
2023-07-25 Not with my name! Inferring artists' names of input strings employed by Diffusion Models Roberto Leotta et.al. 2307.13527v1 link
2023-07-24 A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models Jindong Gu et.al. 2307.12980v1 link
2023-07-24 Data-free Black-box Attack based on Diffusion Model Mingwen Shao et.al. 2307.12872v1 null
2023-07-24 Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry Yong-Hyun Park et.al. 2307.12868v1 link
2023-07-24 The ro-vibrational $ν_2$ mode spectrum of methane investigated by ultrabroadband coherent Raman spectroscopy Francesco Mazza et.al. 2307.12740v1 null
2023-07-21 FEDD -- Fair, Efficient, and Diverse Diffusion-based Lesion Segmentation and Malignancy Classification Héctor Carrión et.al. 2307.11654v1 link
2023-07-21 Mixbiotic society measures: Assessment of community well-going as living system Takeshi Kato et.al. 2307.11594v1 null
2023-07-21 Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series Forecasting Marcel Kollovieh et.al. 2307.11494v1 link
2023-07-20 Hypergraph Diffusions and Resolvents for Norm-Based Hypergraph Laplacians Konstantinos Ameranis et.al. 2307.11042v1 null
2023-07-20 Progressive distillation diffusion for raw music generation Svetlana Pavlova et.al. 2307.10994v1 null
2023-07-20 Energy-consistent discretization of viscous dissipation with application to natural convection flow Benjamin Sanderse et.al. 2307.10874v1 null
2023-07-19 FABRIC: Personalizing Diffusion Models with Iterative Feedback Dimitri von Rütte et.al. 2307.10159v1 link
2023-07-19 XSkill: Cross Embodiment Skill Discovery Mengda Xu et.al. 2307.09955v1 link
2023-07-19 Visual Representation for Patterned Proliferation of Social Media Addiction: Quantitative Model and Network Analysis Dibyajyoti Mallick et.al. 2307.09902v1 null
2023-07-19 BSDM: Background Suppression Diffusion Model for Hyperspectral Anomaly Detection Jitao Ma et.al. 2307.09861v1 link
2023-07-19 A Siamese-based Verification System for Open-set Architecture Attribution of Synthetic Images Lydia Abady et.al. 2307.09822v1 link
2023-07-18 AnyDoor: Zero-shot Object-level Image Customization Xi Chen et.al. 2307.09481v1 link
2023-07-18 Augmenting CLIP with Improved Visio-Linguistic Reasoning Samyadeep Basu et.al. 2307.09233v1 null
2023-07-17 Diffusion Models Beat GANs on Image Classification Soumik Mukhopadhyay et.al. 2307.08702v1 null
2023-07-17 Flow Matching in Latent Space Quan Dao et.al. 2307.08698v1 link
2023-07-17 SEMI-DiffusionInst: A Diffusion Model Based Approach for Semiconductor Defect Classification and Segmentation Vic De Ridder et.al. 2307.08693v1 null
2023-07-17 Multimodal Diffusion Segmentation Model for Object Segmentation from Manipulation Instructions Yui Iioka et.al. 2307.08597v1 null
2023-07-17 Identity-Preserving Aging of Face Images via Latent Diffusion Models Sudipta Banerjee et.al. 2307.08585v1 link
2023-07-17 Synthetic Lagrangian Turbulence by Generative Diffusion Models Tianyi Li et.al. 2307.08529v1 link
2023-07-17 How far does turbulence spread? Alexandros Alexakis et.al. 2307.08469v1 null
2023-07-17 Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation Luozhou Wang et.al. 2307.08448v1 link
2023-07-18 Unstoppable Attack: Label-Only Model Inversion via Conditional Diffusion Model Rongke Liu et.al. 2307.08424v2 null
2023-07-14 NIFTY: Neural Object Interaction Fields for Guided Human Motion Synthesis Nilesh Kulkarni et.al. 2307.07511v1 null
2023-07-14 DreamTeacher: Pretraining Image Backbones with Deep Generative Models Daiqing Li et.al. 2307.07487v1 null
2023-07-14 Inverse Evolution Layers: Physics-informed Regularizers for Deep Neural Networks Chaoyu Liu et.al. 2307.07344v1 null
2023-07-14 High-density single-molecule maps reveal transient membrane receptor interactions within a dynamically varying environment Nicolas Mateos et.al. 2307.07334v1 null
2023-07-14 Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection Alessandro Flaborea et.al. 2307.07205v1 link
2023-07-14 Federated Learning-Empowered AI-Generated Content in Wireless Networks Xumin Huang et.al. 2307.07146v1 null
2023-07-13 HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models Nataniel Ruiz et.al. 2307.06949v1 null
2023-07-12 Exposing the Fake: Effective Diffusion-Generated Images Detection Ruipeng Ma et.al. 2307.06272v1 null
2023-07-12 Diffusion Based Multi-Agent Adversarial Tracking Sean Ye et.al. 2307.06244v1 null
2023-07-12 Functional light diffusers based on hybrid CsPbBr $_3$/SiO$_2$ aero-framework structures for laser light illumination and conversion Lena M. Saure et.al. 2307.06197v1 null
2023-07-12 Biofilm.jl: a fast solver for one-dimensional biofilm chemistry and ecology Mark Owkes et.al. 2307.06153v1 link
2023-07-11 Metropolis Sampling for Constrained Diffusion Models Nic Fishman et.al. 2307.05439v1 null
2023-07-11 On the Vulnerability of DeepFake Detectors to Attacks Generated by Denoising Diffusion Models Marija Ivanovska et.al. 2307.05397v1 null
2023-07-10 Shelving, Stacking, Hanging: Relational Pose Diffusion for Multi-modal Rearrangement Anthony Simeonov et.al. 2307.04751v1 null
2023-07-10 Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback Jaskirat Singh et.al. 2307.04749v1 null
2023-07-10 Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning Suzan Ece Ada et.al. 2307.04726v1 null
2023-07-10 AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning Yuwei Guo et.al. 2307.04725v1 link
2023-07-10 Machine learning potentials with Iterative Boltzmann Inversion: training to experiment Sakib Matin et.al. 2307.04712v1 null
2023-07-10 Encapsulation Structure and Dynamics in Hypergraphs Timothy LaRock et.al. 2307.04613v1 link
2023-07-07 Three-dimensional Vorticity Effects on Extinction Behavior of Laminar Flamelets Wes Hellwig et.al. 2307.03695v1 null
2023-07-07 Simulation-free Schrödinger bridges via score and flow matching Alexander Tong et.al. 2307.03672v1 link
2023-07-07 IPO-LDM: Depth-aided 360-degree Indoor RGB Panorama Outpainting via Latent Diffusion Model Tianhao Wu et.al. 2307.03177v2 null
2023-07-06 How to Detect Unauthorized Data Usages in Text-to-image Diffusion Models Zhenting Wang et.al. 2307.03108v1 link
2023-07-06 Origin-Destination Travel Time Oracle for Map-based Services Yan Lin et.al. 2307.03048v1 null
2023-07-06 Multi-modal multi-class Parkinson disease classification using CNN and decision level fusion Sushanta Kumar Sahu et.al. 2307.02978v1 null
2023-07-06 On the Cultural Gap in Text-to-Image Generation Bingshuai Liu et.al. 2307.02971v1 null
2023-07-05 DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models Chong Mou et.al. 2307.02421v1 link
2023-07-05 RADiff: Controllable Diffusion Models for Radio Astronomical Maps Generation Renato Sortino et.al. 2307.02392v1 null
2023-07-06 Error Approximation and Bias Correction in Dynamic Problems using a Recurrent Neural Network/Finite Element Hybrid Model Moritz von Tresckow et.al. 2307.02349v2 null
2023-07-05 Detecting Images Generated by Deep Diffusion Models using their Local Intrinsic Dimensionality Peter Lorenz et.al. 2307.02347v1 link
2023-07-05 SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection Yuguang Shi et.al. 2307.02270v1 null
2023-07-03 Improved sampling via learned diffusions Lorenz Richter et.al. 2307.01198v1 null
2023-07-03 Squeezing Large-Scale Diffusion Models for Mobile Jiwoong Choi et.al. 2307.01193v1 null
2023-07-03 Learning Mixtures of Gaussians Using the DDPM Objective Kulin Shah et.al. 2307.01178v1 null
2023-07-03 Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis Salman Ul Hassan Dar et.al. 2307.01148v1 null
2023-07-03 A phase field-based framework for electro-chemo-mechanical fracture: crack-contained electrolytes, chemical reactions and stabilisation T. Hageman et.al. 2307.01105v1 null
2023-07-03 MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion Shitao Tang et.al. 2307.01097v1 link
2023-07-03 TomatoDIFF: On-plant Tomato Segmentation with Denoising Diffusion Models Marija Ivanovska et.al. 2307.01064v1 link
2023-06-30 Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors Guocheng Qian et.al. 2306.17843v1 link
2023-06-30 Content-Preserving Diffusion Model for Unsupervised AS-OCT image Despeckling Li Sanqian et.al. 2306.17717v1 null
2023-06-29 Generate Anything Anywhere in Any Scene Yuheng Li et.al. 2306.17154v1 null
2023-06-29 Filtered-Guided Diffusion: Fast Filter Guidance for Black-Box Diffusion Models Zeqi Gu et.al. 2306.17141v1 link
2023-06-29 ID-Pose: Sparse-view Camera Pose Estimation by Inverting Diffusion Models Weihao Cheng et.al. 2306.17140v1 null
2023-06-29 Learning Nuclei Representations with Masked Image Modelling Piotr Wójcik et.al. 2306.17116v1 null
2023-06-29 Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation Zibo Zhao et.al. 2306.17115v1 link
2023-06-29 Towards rapid extracellular vesicles colorimetric detection using optofluidics-enhanced color-changing optical metasurface Chuchuan Hong et.al. 2306.17102v1 null
2023-06-28 DiffComplete: Diffusion-based Generative 3D Shape Completion Ruihang Chu et.al. 2306.16329v1 null
2023-06-28 UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data Heeseung Kim et.al. 2306.16083v1 link
2023-06-28 PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment Jianyuan Wang et.al. 2306.15667v2 null
2023-06-27 Stabilizing ultrathin Silver (Ag) films on different substrates Allamula Ashok et.al. 2306.15575v1 null
2023-06-27 Trajectory Generation, Control, and Safety with Denoising Diffusion Probabilistic Models Nicolò Botteghi et.al. 2306.15512v1 link
2023-06-27 Miniaturized gas-solid fluidized beds Fernando David Cúñez Benalcázar et.al. 2306.15463v1 null
2023-06-27 Adversarial Training for Graph Neural Networks Lukas Gosch et.al. 2306.15427v1 null
2023-06-26 Fuzzy-Conditioned Diffusion and Diffusion Projection Attention Applied to Facial Image Correction Majed El Helou et.al. 2306.14891v1 link
2023-06-26 Restart Sampling for Improving Generative Processes Yilun Xu et.al. 2306.14878v1 link
2023-06-26 ViNT: A Foundation Model for Visual Navigation Dhruv Shah et.al. 2306.14846v1 null
2023-06-26 ProtoDiff: Learning to Learn Prototypical Networks by Task-Guided Diffusion Yingjun Du et.al. 2306.14770v1 link
2023-06-23 Fast Macroscopic Forcing Method Spencer H. Bryngelson et.al. 2306.13625v1 link
2023-06-23 DreamEditor: Text-Driven 3D Scene Editing with Neural Fields Jingyu Zhuang et.al. 2306.13455v1 link
2023-06-22 Continuous Layout Editing of Single Images with Diffusion Models Zhiyuan Zhang et.al. 2306.13078v1 null
2023-06-22 Towards More Realistic Membership Inference Attacks on Large Diffusion Models Jan Dubiński et.al. 2306.12983v1 null
2023-06-22 On the nature of the two-positron bond: Evidence for a novel bond type Mohammad Goli et.al. 2306.12899v1 null
2023-06-22 Stress-induced Artificial neuron spiking in Diffusive memristors Debi Pattnaik et.al. 2306.12853v1 null
2023-06-21 DreamTime: An Improved Optimization Strategy for Text-to-3D Content Creation Yukun Huang et.al. 2306.12422v1 null
2023-06-21 HumanDiffusion: diffusion model using perceptual gradients Yota Ueda et.al. 2306.12169v1 null
2023-06-20 Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning Huiguo He et.al. 2306.11731v1 null
2023-06-20 Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision Ayush Tewari et.al. 2306.11719v1 null
2023-06-20 Align, Adapt and Inject: Sound-guided Unified Image Generation Yue Yang et.al. 2306.11504v1 null
2023-06-20 EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model Lianying Yin et.al. 2306.11496v1 null
2023-06-16 Group Orthogonalization Regularization For Vision Models Adaptation and Robustness Yoav Kurtz et.al. 2306.10001v1 link
2023-06-16 Towards Better Certified Segmentation via Diffusion Models Othmane Laousy et.al. 2306.09949v1 null
2023-06-16 Unique information from common diffusion MRI models about white-matter differences across the human adult lifespan Rafael Neto Henriques1 et.al. 2306.09942v1 link
2023-06-16 Drag-guided diffusion models for vehicle image generation Nikos Arechiga et.al. 2306.09935v1 null
2023-06-16 Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models Geon Yeong Park et.al. 2306.09869v1 link
2023-06-16 AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation Yifei Zeng et.al. 2306.09864v1 null
2023-06-15 Generative Proxemics: A Prior for 3D Social Interaction from Images Lea Müller et.al. 2306.09337v1 link
2023-06-15 ArtFusion: Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models Dar-Yen Chen et.al. 2306.09330v1 link
2023-06-15 Diffusion Models for Zero-Shot Open-Vocabulary Segmentation Laurynas Karazija et.al. 2306.09316v1 null
2023-06-15 Fast Training of Diffusion Models with Masked Transformers Hongkai Zheng et.al. 2306.09305v1 link
2023-06-15 Conditional Human Sketch Synthesis with Explicit Abstraction Control Dar-Yen Chen et.al. 2306.09274v1 null
2023-06-15 Training Diffusion Classifiers with Denoising Assistance Chandramouli Sastry et.al. 2306.09192v1 null
2023-06-13 Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation Shuai Yang et.al. 2306.07954v1 null
2023-06-13 Viewset Diffusion: (0-)Image-Conditioned 3D Generative Models from 2D Data Stanislaw Szymanowicz et.al. 2306.07881v1 null
2023-06-13 Diffusive and convective dissolution of carbon dioxide in a vertical cylindrical cell Daniël P. Faasen et.al. 2306.07721v1 null
2023-06-12 Controlling Text-to-Image Diffusion by Orthogonal Finetuning Zeju Qiu et.al. 2306.07280v1 null
2023-06-12 MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images Junchen Zhu et.al. 2306.07257v1 null
2023-06-12 Diffusion Models for Black-Box Optimization Siddarth Krishnamoorthy et.al. 2306.07180v1 link
2023-06-12 InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions Jiale Xu et.al. 2306.07154v1 null
2023-06-12 Latent Dynamical Implicit Diffusion Processes Mohammad R. Rezaei et.al. 2306.07077v1 null
2023-06-09 Bridging Scales: a Hybrid Model to Simulate Vascular Tumor Growth and Treatment Response Tobias Duswald et.al. 2306.05994v1 link
2023-06-09 DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles Tal Daniel et.al. 2306.05957v1 link
2023-06-09 Beyond Surface Statistics: Scene Representations in a Latent Diffusion Model Yida Chen et.al. 2306.05720v1 link
2023-06-12 Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion Haogeng Liu et.al. 2306.05708v2 null
2023-06-09 RePaint-NeRF: NeRF Editting via Semantic Masks and Diffusion Models Xingchen Zhou et.al. 2306.05668v1 null
2023-06-08 Grounded Text-to-Image Synthesis with Attention Refocusing Quynh Phung et.al. 2306.05427v1 null
2023-06-08 ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process Changyao Tian et.al. 2306.05423v1 null
2023-06-08 Stochastic Multi-Person 3D Motion Forecasting Sirui Xu et.al. 2306.05421v1 link
2023-06-08 Improving Negative-Prompt Inversion via Proximal Guidance Ligong Han et.al. 2306.05414v1 link
2023-06-08 PriSampler: Mitigating Property Inference of Diffusion Models Hailong Hu et.al. 2306.05208v1 null
2023-06-08 SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions Yuseung Lee et.al. 2306.05178v1 null
2023-06-07 Designing a Better Asymmetric VQGAN for StableDiffusion Zixin Zhu et.al. 2306.04632v1 link
2023-06-07 ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image Collections Chun-Han Yao et.al. 2306.04619v1 null
2023-06-08 Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt Kai Chen et.al. 2306.04607v2 null
2023-06-07 On the Design Fundamentals of Diffusion Models: A Survey Ziyi Chang et.al. 2306.04542v1 null
2023-06-07 Multi-modal Latent Diffusion Mustapha Bounoua et.al. 2306.04445v1 null
2023-06-07 Synthesizing realistic sand assemblies with denoising diffusion in latent space Nikolaos N. Vlassis et.al. 2306.04411v1 null
2023-06-07 Improving Diffusion-based Image Translation using Asymmetric Gradient Guidance Gihyun Kwon et.al. 2306.04396v1 link
2023-06-06 Emergent Correspondence from Image Diffusion Luming Tang et.al. 2306.03881v1 link
2023-06-06 Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation Xinrong Hu et.al. 2306.03878v1 link
2023-06-06 Newly Formed Cities: an AI Curation Dario Negueruela del Castillo et.al. 2306.03753v1 null
2023-06-06 Towards Visual Foundational Models of Physical Scenes Chethan Parameshwara et.al. 2306.03727v1 null
2023-06-06 Diffusional exchange versus microscopic kurtosis from CTI: two conflicting interpretations of the same data Arthur Chakwizira et.al. 2306.03661v1 null
2023-06-05 Brain Diffusion for Visual Exploration: Cortical Discovery using Large Scale Generative Models Andrew F. Luo et.al. 2306.03089v1 null
2023-06-05 MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion Chiyu Max Jiang et.al. 2306.03083v1 null
2023-06-05 Influence of the finite transverse size of the accelerating region on the relativistic feedback Alexander Sedelnikov et.al. 2306.03059v1 null
2023-06-05 HeadSculpt: Crafting 3D Head Avatars with Text Xiao Han et.al. 2306.03038v1 null
2023-06-05 Interpretable Alzheimer's Disease Classification Via a Contrastive Diffusion Autoencoder Ayodeji Ijishakin et.al. 2306.03022v1 link
2023-06-05 Complex Preferences for Different Convergent Priors in Discrete Graph Diffusion Alex M. Tseng et.al. 2306.02957v1 null
2023-06-05 INDigo: An INN-Guided Probabilistic Diffusion Algorithm for Inverse Problems Di You et.al. 2306.02949v1 null
2023-06-05 Instruct-Video2Avatar: Video-to-Avatar Generation with Instructions Shaoxu Li et.al. 2306.02903v1 link
2023-06-02 Video Colorization with Pre-trained Text-to-Image Diffusion Models Hanyuan Liu et.al. 2306.01732v1 null
2023-06-02 Denoising Diffusion Semantic Segmentation with Mask Prior Modeling Zeqiang Lai et.al. 2306.01721v1 link
2023-06-02 DiffusEmp: A Diffusion Model-Based Framework with Multi-Grained Control for Empathetic Response Generation Guanqun Bi et.al. 2306.01657v1 null
2023-06-02 Influence Maximization with Fairness at Scale (Extended Version) Yuting Feng et.al. 2306.01587v1 null
2023-06-02 PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion Models Jiacheng Chen et.al. 2306.01461v1 link
2023-06-02 Diffusion Self-Guidance for Controllable Image Generation Dave Epstein et.al. 2306.00986v2 null
2023-06-01 StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners Yonglong Tian et.al. 2306.00984v1 link
2023-06-01 StyleDrop: Text-to-Image Generation in Any Style Kihyuk Sohn et.al. 2306.00983v1 null
2023-06-01 SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds Yanyu Li et.al. 2306.00980v1 link
2023-06-01 Intriguing Properties of Text-guided Diffusion Models Qihao Liu et.al. 2306.00974v1 link
2023-06-01 Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models Chang Liu et.al. 2306.00973v1 link
2023-06-01 ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation Shaozhe Hao et.al. 2306.00971v1 link
2023-06-01 The Hidden Language of Diffusion Models Hila Chefer et.al. 2306.00966v1 link
2023-06-01 Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation Minghui Hu et.al. 2306.00964v1 null
2023-06-01 Differential Diffusion: Giving Each Pixel Its Strength Eran Levin et.al. 2306.00950v1 link
2023-05-31 Learning Explicit Contact for Implicit Reconstruction of Hand-held Objects from Monocular Images Junxing Hu et.al. 2305.20089v1 null
2023-05-31 Understanding and Mitigating Copying in Diffusion Models Gowthami Somepalli et.al. 2305.20086v1 link
2023-05-31 Control4D: Dynamic Portrait Editing by Learning 4D GAN from 2D Diffusion-based Editor Ruizhi Shao et.al. 2305.20082v1 null
2023-05-31 Efficient Diffusion Policies for Offline Reinforcement Learning Bingyi Kang et.al. 2305.20081v1 link
2023-05-31 A Unified Conditional Framework for Diffusion-based Image Restoration Yi Zhang et.al. 2305.20049v1 link
2023-06-01 Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust Yuxin Wen et.al. 2305.20030v2 link
2023-05-31 Protein Design with Guided Discrete Diffusion Nate Gruver et.al. 2305.20009v1 link
2023-05-31 GANDiffFace: Controllable Generation of Synthetic Datasets for Face Recognition with Realistic Variations Pietro Melzi et.al. 2305.19962v1 null
2023-05-31 A Geometric Perspective on Diffusion Models Defang Chen et.al. 2305.19947v1 null
2023-05-30 Ambient Diffusion: Learning Clean Distributions from Corrupted Data Giannis Daras et.al. 2305.19256v1 link
2023-05-30 PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation Jialu Li et.al. 2305.19195v1 null
2023-05-30 Video ControlNet: Towards Temporally Consistent Synthetic-to-Real Video Translation Using Conditional Image Diffusion Models Ernie Chu et.al. 2305.19193v1 null
2023-05-30 Calliffusion: Chinese Calligraphy Generation and Style Transfer with Diffusion Modeling Qisheng Liao et.al. 2305.19124v1 null
2023-05-30 DiffMatch: Diffusion Model for Dense Matching Jisu Nam et.al. 2305.19094v1 link
2023-05-30 Likelihood-Based Diffusion Language Models Ishaan Gulrajani et.al. 2305.18619v1 link
2023-05-29 RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths Zeyue Xue et.al. 2305.18295v1 null
2023-05-29 Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models Yuchao Gu et.al. 2305.18292v1 link
2023-05-29 Photoswap: Personalized Subject Swapping in Images Jing Gu et.al. 2305.18286v1 null
2023-05-29 Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors Paul S. Scotti et.al. 2305.18274v1 link
2023-05-29 Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising Fu-Yun Wang et.al. 2305.18264v1 link
2023-05-29 GlyphControl: Glyph Conditional Control for Visual Text Generation Yukang Yang et.al. 2305.18259v1 link
2023-05-26 Improving accuracy of GPT-3/4 results on biomedical data using a retrieval-augmented language model David Soong et.al. 2305.17116v1 null
2023-05-26 ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing Min Zhao et.al. 2305.17098v1 link
2023-05-26 The reaction-diffusion basis of animated patterns in eukaryotic flagella James F. Cass et.al. 2305.17032v1 link
2023-05-26 Accelerating Diffusion Models for Inverse Problems through Shortcut Sampling Gongye Liu et.al. 2305.16965v1 link
2023-05-26 Learning to Imagine: Visually-Augmented Natural Language Generation Tianyi Tang et.al. 2305.16944v1 link
2023-05-26 DiffusionNAG: Task-guided Neural Architecture Generation with Diffusion Models Sohyun An et.al. 2305.16943v1 link
2023-05-26 CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography Jiwen Yu et.al. 2305.16936v1 link
2023-05-26 Turbulence calculation based on the extended Naiver-Stokes equations Shanwen Tan et.al. 2305.16923v1 null
2023-05-25 Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models Shihao Zhao et.al. 2305.16322v1 link
2023-05-25 Eclipse: Disambiguating Illumination and Materials using Unintended Shadows Dor Verbin et.al. 2305.16321v1 null
2023-05-25 Parallel Sampling of Diffusion Models Andy Shih et.al. 2305.16317v1 link
2023-05-25 NAP: Neural 3D Articulation Prior Jiahui Lei et.al. 2305.16315v1 null
2023-05-25 UMat: Uncertainty-Aware Single Image High Resolution Material Capture Carlos Rodriguez-Pardo et.al. 2305.16312v1 null
2023-05-25 Break-A-Scene: Extracting Multiple Concepts from a Single Image Omri Avrahami et.al. 2305.16311v1 link
2023-05-25 Look Ma, No Hands! Agent-Environment Factorization of Egocentric Videos Matthew Chang et.al. 2305.16301v1 null
2023-05-25 Diversify Your Vision Datasets with Automatic Diffusion-Based Augmentation Lisa Dunlap et.al. 2305.16289v1 link
2023-05-25 CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graphs Guangyao Zhai et.al. 2305.16283v1 link
2023-05-25 UDPM: Upsampling Diffusion Probabilistic Models Shady Abu-Hussein et.al. 2305.16269v1 link
2023-05-24 Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape Rundi Wu et.al. 2305.15399v1 link
2023-05-24 A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence Junyi Zhang et.al. 2305.15347v1 link
2023-05-24 Training on Thin Air: Improve Image Classification with Generated Data Yongchao Zhou et.al. 2305.15316v1 link
2023-05-24 MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation Marco Bellagente et.al. 2305.15296v1 null
2023-05-23 Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence Grace Luo et.al. 2305.14334v1 null
2023-05-23 SEEDS: Exponential SDE Solvers for Fast High-Quality Sampling from Diffusion Models Martin Gonzalez et.al. 2305.14267v1 link
2023-05-23 Improved Convergence of Score-Based Diffusion Models via Prediction-Correction Francesco Pedrotti et.al. 2305.14164v1 null
2023-05-23 Realistic Noise Synthesis with Diffusion Models Qi Wu et.al. 2305.14022v1 null
2023-05-23 Lightweight Channel Codes for ISI Mitigation in Molecular Communication between Bionanosensors Dongliang Jing et.al. 2305.14001v1 null
2023-05-23 Node-wise Diffusion for Scalable Graph Learning Keke Huang et.al. 2305.14000v1 link
2023-05-22 VDT: An Empirical Study on Video Diffusion with Transformers Haoyu Lu et.al. 2305.13311v1 link
2023-05-22 If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection Shyamgopal Karthik et.al. 2305.13308v1 link
2023-05-23 Training Diffusion Models with Reinforcement Learning Kevin Black et.al. 2305.13301v2 link
2023-05-22 DiffusionNER: Boundary Diffusion for Named Entity Recognition Yongliang Shen et.al. 2305.13298v1 link
2023-05-22 U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech Xin Jing et.al. 2305.13195v1 null
2023-05-22 Policy Representation via Diffusion Probability Model for Reinforcement Learning Long Yang et.al. 2305.13122v1 link
2023-05-22 Energy cascade in the Garrett-Munk spectrum of internal gravity waves Yue Wu et.al. 2305.13110v1 null
2023-05-19 Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models Byungjun Kim et.al. 2305.11870v1 link
2023-05-19 Any-to-Any Generation via Composable Diffusion Zineng Tang et.al. 2305.11846v1 link
2023-05-19 The probability flow ODE is provably fast Sitan Chen et.al. 2305.11798v1 null
2023-05-19 Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity Zijiao Chen et.al. 2305.11675v1 null
2023-05-19 Few-shot 3D Shape Generation Jingyuan Zhu et.al. 2305.11664v1 null
2023-05-19 Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields Jingbo Zhang et.al. 2305.11588v1 link
2023-05-19 Brain Captioning: Decoding human brain activity into images and text Matteo Ferrante et.al. 2305.11560v1 null
2023-05-19 Efficient Cross-Lingual Transfer for Chinese Stable Diffusion with Images as Pivots Jinyi Hu et.al. 2305.11540v1 null
2023-05-19 Late-Constraint Diffusion Guidance for Controllable Image Synthesis Chang Liu et.al. 2305.11520v1 link
2023-05-19 DiffuSIA: A Spiral Interaction Architecture for Encoder-Decoder Text Diffusion Chao-Hong Tan et.al. 2305.11517v1 null
2023-05-18 UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild Can Qin et.al. 2305.11147v1 link
2023-05-18 Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces Javier E Santos et.al. 2305.11089v1 link
2023-05-18 Inspecting the Geographical Representativeness of Images from Text-to-Image Models Abhipsa Basu et.al. 2305.11080v1 null
2023-05-18 Unsupervised Pansharpening via Low-rank Diffusion Model Xiangyu Rui et.al. 2305.10925v1 link
2023-05-18 Structural Pruning for Diffusion Models Gongfan Fang et.al. 2305.10924v1 link
2023-05-18 VideoFactory: Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation Wenjing Wang et.al. 2305.10874v1 null
2023-05-17 FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention Guangxuan Xiao et.al. 2305.10431v1 link
2023-05-17 Raising the Bar for Certified Adversarial Robustness with Diffusion Models Thomas Altstidl et.al. 2305.10388v1 null
2023-05-17 A phase field model for droplets suspended in viscous liquids under the influence of electric fields Yuzhe Qin et.al. 2305.10296v1 null
2023-05-17 Provably Correct Physics-Informed Neural Networks Francisco Eiras et.al. 2305.10157v1 null
2023-05-18 Controllable Mind Visual Diffusion Model Bohan Zeng et.al. 2305.10135v2 link
2023-05-16 Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation Samaneh Azadi et.al. 2305.09662v1 null
2023-05-16 FitMe: Deep Photorealistic 3D Morphable Model Avatars Alexandros Lattas et.al. 2305.09641v1 null
2023-05-16 AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation Tong Wu et.al. 2305.09515v1 link
2023-05-16 Discrete Diffusion Probabilistic Models for Symbolic Music Generation Matthias Plasser et.al. 2305.09489v1 link
2023-05-17 Multi-Level Global Context Cross Consistency Model for Semi-Supervised Ultrasound Image Segmentation with Diffusion Model Fenghe Tang et.al. 2305.09447v2 link
2023-05-16 Diffusion Dataset Generation: Towards Closing the Sim2Real Gap for Pedestrian Detection Andrew Farley et.al. 2305.09401v1 null
2023-05-17 AMD: Autoregressive Motion Diffusion Bo Han et.al. 2305.09381v2 null
2023-05-15 Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models Antoni Bigata Casademunt et.al. 2305.08854v1 link
2023-05-15 Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts Yuyang Zhao et.al. 2305.08850v1 null
2023-05-15 The role of magnetic helicity when it is absent on average Axel Brandenburg et.al. 2305.08769v1 null
2023-05-15 Diffusion-weighted SPECIAL improves the detection of J-coupled metabolites at ultra-high magnetic field Jessie Mosso et.al. 2305.08708v1 null
2023-05-15 A Reproducible Extraction of Training Images from Diffusion Models Ryan Webster et.al. 2305.08694v1 link
2023-05-12 Sound waves, diffusive transport, and wall slip in nanoconfined compressible fluids Hannes Holey et.al. 2305.07501v1 null
2023-05-12 On a Voter Model with Context-Dependent Opinion Adoption Luca Becchetti et.al. 2305.07377v1 null
2023-05-12 Experimental optimization of lensless digital holographic microscopy with rotating diffuser-based coherent noise reduction Piotr Arcab et.al. 2305.07373v1 null
2023-05-12 Penguin huddling: a continuum model Samuel J. Harris et.al. 2305.07324v1 link
2023-05-15 Phosphorus-Controlled Nanoepitaxy in the Asymmetric Growth of GaAs-InP Core-Shell Bent Nanowires Spencer McDermott et.al. 2305.07252v2 null
2023-05-12 Optimal calibration of optical tweezers with arbitrary integration time and sampling frequencies -- A general framework Laura Pérez-Garcéa et.al. 2305.07245v1 null
2023-05-15 Fully quantum algorithm for lattice Boltzmann methods with application to partial differential equations Fatima Ezahra Chrit et.al. 2305.07148v2 link
2023-05-11 Exploiting Diffusion Prior for Real-World Image Super-Resolution Jianyi Wang et.al. 2305.07015v1 link
2023-05-11 A method for automated regression test in scientific computing libraries: illustration with SPHinXsys Bo Zhang et.al. 2305.06970v1 link
2023-05-11 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model Zhen Ye et.al. 2305.06908v1 link
2023-05-11 Null-text Guidance in Diffusion Models is Secretly a Cartoon-style Creator Jing Zhao et.al. 2305.06710v1 null
2023-05-11 Evaluating Twitter's Algorithmic Amplification of Low-Trust Content: An Observational Study Giulio Corsi et.al. 2305.06125v2 link
2023-05-10 Relightify: Relightable 3D Faces from a Single Image via Diffusion Models Foivos Paraperas Papantoniou et.al. 2305.06077v1 null
2023-05-10 iEdit: Localised Text-guided Image Editing with Weak Supervision Rumeysa Bodur et.al. 2305.05947v1 null
2023-05-09 Large Language Models Humanize Technology Pratyush Kumar et.al. 2305.05576v1 null
2023-05-09 Style-A-Video: Agile Diffusion for Arbitrary Text-based Video Style Transfer Nisha Huang et.al. 2305.05464v1 link
2023-05-10 Large Language Models Need Holistically Thought in Medical Conversational QA Yixuan Weng et.al. 2305.05410v2 link
2023-05-09 The Multi-cluster Two-Wave Fading Model Juan P. Pena-Martin et.al. 2305.05342v1 null
2023-05-08 DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models Sicheng Yang et.al. 2305.04919v1 link
2023-05-08 CaloClouds: Fast Geometry-Independent Highly-Granular Calorimeter Simulation Erik Buhmann et.al. 2305.04847v1 link
2023-05-08 A Drop of Ink may Make a Million Think: The Spread of False Information in Large Language Models Ning Bian et.al. 2305.04812v1 null
2023-05-08 Controllable Light Diffusion for Portraits David Futschik et.al. 2305.04745v1 null
2023-05-08 A Closest Point Method for Surface PDEs with Interior Boundary Conditions for Geometry Processing Nathan King et.al. 2305.04711v1 null
2023-05-08 ReGeneration Learning of Diffusion Models with Rich Prompts for Zero-Shot Image Translation Yupei Lin et.al. 2305.04651v1 null
2023-05-05 Reflection of a Diffuser in a Liquid Interface C. Silva et.al. 2305.03682v1 null
2023-05-05 Conditional Diffusion Feature Refinement for Continuous Sign Language Recognition Leming Guo et.al. 2305.03614v1 null
2023-05-05 Data Curation for Image Captioning with Text-to-Image Generative Models Wenyan Li et.al. 2305.03610v1 link
2023-05-04 Personalize Segment Anything Model with One Shot Renrui Zhang et.al. 2305.03048v1 link
2023-05-05 Capacity Bounds for Vertically-Drifted First Arrival Position Channels under a Second-Moment Constraint Yun-Feng Lo et.al. 2305.02706v2 null
2023-05-03 Nonlocal gravity wave turbulence in presence of condensate A. O. Korotkevich et.al. 2305.01930v1 null
2023-05-04 DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion Kiyohiro Nakayama et.al. 2305.01921v2 null
2023-05-04 The Impacts of Dimensionality, Diffusion, and Directedness on Intrinsic Cross-Model Simulation in Tile-Based Self-Assembly Daniel Hader et.al. 2305.01877v2 null
2023-05-03 Multimodal Data Augmentation for Image Captioning using Diffusion Models Changrong Xiao et.al. 2305.01855v1 link
2023-05-02 Unpaired Downscaling of Fluid Flows with Diffusion Bridges Tobias Bischoff et.al. 2305.01822v1 link
2023-05-02 Multimodal Procedural Planning via Dual Text-Image Prompting Yujie Lu et.al. 2305.01795v1 link
2023-05-02 DiffuSum: Generation Enhanced Extractive Summarization with Diffusion Haopeng Zhang et.al. 2305.01735v1 link
2023-05-02 ContactArt: Learning 3D Interaction Priors for Category-level Articulated Object and Hand Poses Estimation Zehao Zhu et.al. 2305.01618v1 null
2023-05-02 Adopting AI: How Familiarity Breeds Both Trust and Contempt Michael C. Horowitz et.al. 2305.01405v1 null
2023-05-02 Long-Term Rhythmic Video Soundtracker Jiashuo Yu et.al. 2305.01319v1 link
2023-05-02 DreamPaint: Few-Shot Inpainting of E-Commerce Items for Virtual Try-On without 3D Modeling Mehmet Saygin Seyfioglu et.al. 2305.01257v1 null
2023-05-02 Solving Inverse Problems with Score-Based Generative Priors learned from Noisy Data Asad Aali et.al. 2305.01166v1 null
2023-05-02 Geometric Latent Diffusion Models for 3D Molecule Generation Minkai Xu et.al. 2305.01140v1 link
2023-05-01 Fractional and tempered fractional models for Reynolds-averaged Navier-Stokes equations Pavan Pranjivan Mehta et.al. 2305.00770v1 null
2023-05-01 Diffusion Models for Time Series Applications: A Survey Lequan Lin et.al. 2305.00624v1 null
2023-04-30 Class-Balancing Diffusion Models Yiming Qin et.al. 2305.00562v1 link
2023-04-30 Towards Computational Architecture of Liberty: A Comprehensive Survey on Deep Learning for Generating Virtual Architecture in the Metaverse Anqi Wang et.al. 2305.00510v1 null
2023-04-28 Scaling regimes in rapidly rotating thermal convection at extreme Rayleigh numbers Jiaxing Song et.al. 2304.14854v1 null
2023-04-28 Simplified models of diffusion in radially-symmetric geometries Luke P. Filippini et.al. 2304.14632v1 link
2023-04-28 MUDiff: Unified Diffusion for Complete Molecule Generation Chenqing Hua et.al. 2304.14621v1 null
2023-04-28 Robust Gaussian Process Regression method for efficient reaction pathway optimization: application to surface processes Wei Fang et.al. 2304.14596v1 null
2023-04-28 SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis Azade Farshad et.al. 2304.14573v1 null
2023-04-27 It is all about where you start: Text-to-image generation with seed selection Dvir Samuel et.al. 2304.14530v1 link
2023-04-27 Putting People in Their Place: Affordance-Aware Human Insertion into Scenes Sumith Kulal et.al. 2304.14406v1 link
2023-04-27 Motion-Conditioned Diffusion Model for Controllable Video Synthesis Tsai-Shien Chen et.al. 2304.14404v1 null
2023-04-27 Maximizing Model Generalization for Manufacturing with Self-Supervised Learning and Federated Learning Matthew Russell et.al. 2304.14398v1 null
2023-04-27 Functional Diffusion Maps María Barroso et.al. 2304.14378v1 link
2023-04-27 LDPC Decoders Prefer More Reliable Parity Bits: Unequal Data Protection Over BSC Beyza Dabak et.al. 2304.14278v1 null
2023-04-27 DataComp: In search of the next generation of multimodal datasets Samir Yitzhak Gadre et.al. 2304.14108v1 link
2023-04-26 Heuristic Barycenter Modeling of Fully Absorbing Receivers in Diffusive Molecular Communication Channels Fardad Vakilipoor et.al. 2304.13640v1 null
2023-04-26 Identifying the structure patterns to govern the performance of localization in regulating innovation diffusion Leyang Xue et.al. 2304.13608v1 null
2023-04-26 Bifractality of fractal scale-free networks Jun Yamamoto et.al. 2304.13438v1 null
2023-04-26 Training-Free Location-Aware Text-to-Image Synthesis Jiafeng Mao et.al. 2304.13427v1 null
2023-04-25 The Score-Difference Flow for Implicit Generative Modeling Romann M. Weber et.al. 2304.12906v1 null
2023-04-25 Latent diffusion models for generative precipitation nowcasting with accurate uncertainty quantification Jussi Leinonen et.al. 2304.12891v1 link
2023-04-25 Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning Cheng Lu et.al. 2304.12824v1 link
2023-04-25 A Binary Annular Phase Mask to Regulate Spherical Aberration and Allow Super-Localization in Single-Particle Tracking over Extended Depth-of-Focus Quentin Gresil et.al. 2304.12774v1 null
2023-04-25 Effect of trap states, ion migration and interfaces on carrier transport in single crystal, polycrystalline and thick film devices of halide perovskites CH $_3$NH$_3$PbX$_3$ (X= I, Br, Cl) Mohd Warish et.al. 2304.12701v1 null
2023-04-24 Analyzing the neutron and $γ$ -ray emission properties of an americium-beryllium tagged neutron source Hiroshi Ito et.al. 2304.12153v1 null
2023-04-24 Efficient Halftoning via Deep Reinforcement Learning Haitian Jiang et.al. 2304.12152v1 null
2023-04-24 Variational Diffusion Auto-encoder: Deep Latent Variable Model with Unconditional Diffusion Prior Georgios Batzolis et.al. 2304.12141v1 null
2023-04-24 Customized Load Profiles Synthesis for Electricity Customers Based on Conditional Diffusion Models Zhenyi Wang et.al. 2304.12076v1 null
2023-04-24 Improving Synthetically Generated Image Detection in Cross-Concept Settings Pantelis Dogoulis et.al. 2304.12053v1 link
2023-04-21 BoDiffusion: Diffusing Sparse Observations for Full-Body Human Motion Synthesis Angela Castillo et.al. 2304.11118v1 null
2023-04-21 Improved Diffusion-based Image Colorization via Piggybacked Models Hanyuan Liu et.al. 2304.11105v1 null
2023-04-21 Perturbatively corrected ring-polymer instanton theory for accurate tunneling splittings Joseph E. Lawrence et.al. 2304.10963v1 null
2023-04-20 Farm3D: Learning Articulated 3D Animals by Distilling 2D Diffusion Tomas Jakab et.al. 2304.10535v1 null
2023-04-20 Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFs Frederik Warburg et.al. 2304.10532v1 link
2023-04-20 Collaborative Diffusion for Multi-Modal Face Generation and Editing Ziqi Huang et.al. 2304.10530v1 link
2023-04-20 Prediction of the evolution of the nuclear reactor core parameters using artificial neural network Krzysztof Palmi et.al. 2304.10337v1 null
2023-04-20 Avoiding methane emission rate underestimates when using the divergence method Clayton Roberts et.al. 2304.10303v1 null
2023-04-20 Not Only Generative Art: Stable Diffusion for Content-Style Disentanglement in Art Analysis Yankun Wu et.al. 2304.10278v1 link
2023-04-19 Irregular dependence on Stokes number and non-ergodic transport of heavy inertial particles in steady laminar flows Anu V. S. Nath et.al. 2304.09804v1 null
2023-04-19 NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models Seung Wook Kim et.al. 2304.09787v1 null
2023-04-19 Signatures of heterogeneity in the statistical structure of target state aligned ensembles Nicolas Lenner et.al. 2304.09719v1 null
2023-04-18 Monte-Carlo method for incompressible fluid flows past obstacles Vladislav Cherepanov et.al. 2304.09152v1 null
2023-04-18 On the seed population of solar energetic particles in the inner heliosphere Nicolas Wijsen et.al. 2304.09098v1 null
2023-04-18 Construction of coarse-grained molecular dynamics with many-body non-Markovian memory Liyao Lyu et.al. 2304.09044v1 null
2023-04-18 Look ATME: The Discriminator Mean Entropy Needs Attention Edgardo Solano-Carrillo et.al. 2304.09024v1 link
2023-04-18 UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer Soon Yau Cheong et.al. 2304.08870v1 link
2023-04-17 Text2Performer: Text-Driven Human Video Generation Yuming Jiang et.al. 2304.08483v1 link
2023-04-18 Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation Jie An et.al. 2304.08477v2 null
2023-04-17 Synthetic Data from Diffusion Models Improves ImageNet Classification Shekoofeh Azizi et.al. 2304.08466v1 null
2023-04-17 MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing Mingdeng Cao et.al. 2304.08465v1 link
2023-04-17 OVTrack: Open-Vocabulary Multiple Object Tracking Siyuan Li et.al. 2304.08408v1 null
2023-04-17 Refusion: Enabling Large-Size Realistic Image Restoration with Latent-Space Diffusion Models Ziwei Luo et.al. 2304.08291v1 link
2023-04-17 Solving stiff ordinary differential equations using physics informed neural networks (PINNs): simple recipes to improve training of vanilla-PINNs Hubert Baty et.al. 2304.08289v1 link
2023-04-14 A Comparative Study on Generative Models for High Resolution Solar Observation Imaging Mehdi Cherti et.al. 2304.07169v1 link
2023-04-14 Towards Controllable Diffusion Models via Reward-Guided Exploration Hengtong Zhang et.al. 2304.07132v1 null
2023-04-14 Delta Denoising Score Amir Hertz et.al. 2304.07090v1 null
2023-04-14 Memory Efficient Diffusion Probabilistic Models via Patch-based Generation Shinei Arakawa et.al. 2304.07087v1 null
2023-04-14 DCFace: Synthetic Face Generation with Dual Condition Diffusion Model Minchul Kim et.al. 2304.07060v1 link
2023-04-14 A Diffusion model for POI recommendation Yifang Qin et.al. 2304.07041v1 link
2023-04-13 Expressive Text-to-Image Generation with Rich Text Songwei Ge et.al. 2304.06720v1 null
2023-04-13 Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction Hansheng Chen et.al. 2304.06714v1 link
2023-04-13 DiffusionRig: Learning Personalized Priors for Facial Appearance Editing Zheng Ding et.al. 2304.06711v1 link
2023-04-13 Learning Controllable 3D Diffusion Models from Single-view Images Jiatao Gu et.al. 2304.06700v1 null
2023-04-13 DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning Enze Xie et.al. 2304.06648v1 null
2023-04-12 Continual Diffusion: Continual Customization of Text-to-Image Diffusion with C-LoRA James Seale Smith et.al. 2304.06027v1 null
2023-04-12 DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion Johanna Karras et.al. 2304.06025v1 null
2023-04-12 Probabilistic Human Mesh Recovery in 3D Scenes from Egocentric Views Siwei Zhang et.al. 2304.06024v1 link
2023-04-12 SpectralDiff: Hyperspectral Image Classification with Spectral-Spatial Diffusion Models Ning Chen et.al. 2304.05961v1 link
2023-04-12 Diffusion models with location-scale noise Alexia Jolicoeur-Martineau et.al. 2304.05907v1 null
2023-04-12 Cancer-Net BCa-S: Breast Cancer Grade Prediction using Volumetric Deep Radiomic Features from Synthetic Correlated Diffusion Imaging Chi-en Amy Tai et.al. 2304.05899v1 link
2023-04-11 HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models Eslam Mohamed Bakr et.al. 2304.05390v1 link
2023-04-11 Diffusion Models for Constrained Domains Nic Fishman et.al. 2304.05364v1 link
2023-04-11 Multi-scale Fusion Fault Diagnosis Method Based on Two-Dimensionaliztion Sequence in Complex Scenarios Weiyang Jin et.al. 2304.05198v1 null
2023-04-10 A Cheaper and Better Diffusion Language Model with Soft-Masked Noise Jiaao Chen et.al. 2304.04746v1 link
2023-04-10 Ambiguous Medical Image Segmentation using Diffusion Models Aimon Rahman et.al. 2304.04745v1 link
2023-04-10 Sequential Recommendation with Diffusion Models Hanwen Du et.al. 2304.04541v1 null
2023-04-07 Compressed Regression over Adaptive Networks Marco Carpentiero et.al. 2304.03638v1 null
2023-04-07 Exploring Collaborative Distributed Diffusion-Based AI-Generated Content (AIGC) in Wireless Networks Hongyang Du et.al. 2304.03446v1 link
2023-04-06 RoSteALS: Robust Steganography using Autoencoder Latent Space Tu Bui et.al. 2304.03400v1 link
2023-04-06 Diffusion Models as Masked Autoencoders Chen Wei et.al. 2304.03283v1 null
2023-04-06 Inst-Inpaint: Instructing to Remove Objects with Diffusion Models Ahmet Burak Yildirim et.al. 2304.03246v1 link
2023-04-06 Face Animation with an Attribute-Guided Diffusion Model Bohan Zeng et.al. 2304.03199v1 link
2023-04-06 SketchFFusion: Sketch-guided image editing with diffusion model Weihang Mao et.al. 2304.03174v1 null
2023-04-05 Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models Xuhui Jia et.al. 2304.02642v1 null
2023-04-05 GenPhys: From Physical Processes to Generative Models Ziming Liu et.al. 2304.02637v1 null
2023-04-05 An atlas of the heterogeneous viscoelastic brain with local power-law attenuation synthesised using Prony-series Oisin Morrison et.al. 2304.02610v1 null
2023-04-05 Generative Novel View Synthesis with 3D-Aware Diffusion Models Eric R. Chan et.al. 2304.02602v1 null
2023-04-05 Diffusion across a concentration step: Strongly nonmonotonic evolution into thermodynamic equilibrium Hans R. Moser et.al. 2304.02557v1 null
2023-04-04 viz2viz: Prompt-driven stylized visualization generation using a diffusion model Jiaqi Wu et.al. 2304.01919v1 null
2023-04-04 PODIA-3D: Domain Adaptation of 3D Generative Model Across Large Domain Gap Using Pose-Preserved Text-to-Image Diffusion Gwanghyun Kim et.al. 2304.01900v1 null
2023-04-04 Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion Davis Rempe et.al. 2304.01893v1 null
2023-04-04 Quantitative perfusion and water transport time model from multi b-value diffusion magnetic resonance imaging validated against neutron capture microspheres M. Liu et.al. 2304.01888v1 null
2023-04-04 Adaptive learning of effective dynamics: Adaptive real-time, online modeling for complex systems Ivica Kičić et.al. 2304.01732v1 link
2023-04-03 Learning to Read Braille: Bridging the Tactile Reality Gap with Diffusion Models Carolina Higuera et.al. 2304.01182v1 link
2023-04-03 ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model Mingyuan Zhang et.al. 2304.01116v1 link
2023-04-03 ViT-DAE: Transformer-driven Diffusion Autoencoder for Histopathology Image Analysis Xuan Xu et.al. 2304.01053v1 null
2023-04-03 DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models Yukang Cao et.al. 2304.00916v1 link
2023-03-31 $\infty$ -Diff: Infinite Resolution Diffusion with Subsampled Mollified States Sam Bond-Taylor et.al. 2303.18242v1 link
2023-03-31 A Closer Look at Parameter-Efficient Tuning in Diffusion Models Chendong Xiang et.al. 2303.18181v1 link
2023-03-31 One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models Yasser Benigmim et.al. 2303.18080v1 link
2023-03-30 AvatarCraft: Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control Ruixiang Jiang et.al. 2303.17606v1 link
2023-03-30 Token Merging for Fast Stable Diffusion Daniel Bolya et.al. 2303.17604v1 link
2023-03-30 Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models Wen Wang et.al. 2303.17599v1 link
2023-03-30 Consistent View Synthesis with Pose-Guided Diffusion Models Hung-Yu Tseng et.al. 2303.17598v1 null
2023-03-30 Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models Eric Zhang et.al. 2303.17591v1 link
2023-03-30 DDP: Diffusion Model for Dense Visual Prediction Yuanfeng Ji et.al. 2303.17559v1 link
2023-03-30 DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder Chenpng Du et.al. 2303.17550v1 null
2023-03-30 PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models Vidit Goel et.al. 2303.17546v1 link
2023-03-29 Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos Kun Su et.al. 2303.16897v1 null
2023-03-30 MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path Qian Wang et.al. 2303.16765v2 link
2023-03-29 4D Facial Expression Diffusion Model Kaifeng Zou et.al. 2303.16611v1 link
2023-03-29 WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models Konstantina Nikolaidou et.al. 2303.16576v1 link
2023-03-29 Your Diffusion Model is Secretly a Zero-Shot Classifier Alexander C. Li et.al. 2303.16203v2 link
2023-03-28 Visual Chain-of-Thought Diffusion Models William Harvey et.al. 2303.16187v1 link
2023-03-28 Diffusion Maps for Group-Invariant Manifolds Paulina Hoyos et.al. 2303.16169v1 null
2023-03-28 Novel View Synthesis of Humans using Differentiable Rendering Guillaume Rochette et.al. 2303.15880v1 link
2023-03-27 The Stable Signature: Rooting Watermarks in Latent Diffusion Models Pierre Fernandez et.al. 2303.15435v1 link
2023-03-27 Anti-DreamBooth: Protecting users from personalized text-to-image synthesis Thanh Van Le et.al. 2303.15433v1 link
2023-03-27 Debiasing Scores and Prompts of 2D Diffusion for Robust Text-to-3D Generation Susung Hong et.al. 2303.15413v1 link
2023-03-27 Training-free Style Transfer Emerges from h-space in Diffusion models Jaeseok Jeong et.al. 2303.15403v1 null
2023-03-27 Exploring Continual Learning of Diffusion Models Michał Zając et.al. 2303.15342v1 null
2023-03-27 Diffusion Models for Memory-efficient Processing of 3D Medical Images Florentin Bieder et.al. 2303.15288v1 link
2023-03-27 Text-to-Image Diffusion Models are Zero-Shot Classifiers Kevin Clark et.al. 2303.15233v1 null
2023-03-24 Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior Junshu Tang et.al. 2303.14184v1 link
2023-03-24 MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion Yizhuo Lu et.al. 2303.14139v1 null
2023-03-24 CIFAKE: Image Classification and Explainable Identification of AI-Generated Synthetic Images Jordan J. Bird et.al. 2303.14126v1 null
2023-03-24 Electron transport measurements in liquid xenon with Xenoscope, a large-scale DARWIN demonstrator L. Baudis et.al. 2303.13963v1 null
2023-03-23 Ablating Concepts in Text-to-Image Diffusion Models Nupur Kumari et.al. 2303.13516v1 link
2023-03-23 ReVersion: Diffusion-Based Relation Inversion from Images Ziqi Huang et.al. 2303.13495v1 link
2023-03-23 Scaling laws of two-dimensional incompressible turbulent transport D. I. Palade et.al. 2303.13457v1 null
2023-03-23 Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators Levon Khachatryan et.al. 2303.13439v1 link
2023-03-23 Medical diffusion on a budget: textual inversion for medical image generation Bram de Wilde et.al. 2303.13430v1 null
2023-03-23 DDT: A Diffusion-Driven Transformer-based Framework for Human Mesh Recovery from a Video Ce Zheng et.al. 2303.13397v1 null
2023-03-23 Audio Diffusion Model for Speech Synthesis: A Survey on Text To Speech and Speech Enhancement in Generative AI Chenshuang Zhang et.al. 2303.13336v1 null
2023-03-23 Decentralized Adversarial Training over Graphs Ying Cao et.al. 2303.13326v1 null
2023-03-23 Fourier Diffusion Models: A Method to Control MTF and NPS in Score-Based Stochastic Image Generation Matthew Tivnan et.al. 2303.13285v1 null
2023-03-22 Diffuse-Denoise-Count: Accurate Crowd-Counting with Diffusion Models Yasiru Ranasinghe et.al. 2303.12790v1 link
2023-03-22 Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions Ayaan Haque et.al. 2303.12789v1 null
2023-03-22 FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models Jianglong Ye et.al. 2303.12786v1 null
2023-03-22 Effect of gamma radiation on electrical properties of diffusive memristor devices D. P. Pattnaik et.al. 2303.12762v1 null
2023-03-22 Pix2Video: Video Editing using Image Diffusion Duygu Ceylan et.al. 2303.12688v1 link
2023-03-23 Feature-Conditioned Cascaded Video Diffusion Models for Precise Echocardiogram Synthesis Hadrien Reynaud et.al. 2303.12644v2 link
2023-03-22 A Perceptual Quality Assessment Exploration for AIGC Images Zicheng Zhang et.al. 2303.12618v1 null
2023-03-21 Vox-E: Text-guided Voxel Editing of 3D Objects Etai Sella et.al. 2303.12048v1 link
2023-03-21 Semantic Latent Space Regression of Diffusion Autoencoders for Vertebral Fracture Grading Matthias Keicher et.al. 2303.12031v1 null
2023-03-21 Numerical simulation of self-oscillating catalytic reaction in a plug-flow reactor N. V. Peskov et.al. 2303.12022v1 null
2023-03-21 3D-CLFusion: Fast Text-to-3D Rendering with Contrastive Latent Diffusion Yu-Jhe Li et.al. 2303.11938v1 null
2023-03-21 CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion Geonmo Gu et.al. 2303.11916v1 link
2023-03-21 Projections of Model Spaces for Latent Graph Inference Haitz Sáez de Ocáriz Borde et.al. 2303.11754v1 null
2023-03-20 Zero-1-to-3: Zero-shot One Image to 3D Object Ruoshi Liu et.al. 2303.11328v1 link
2023-03-20 Localizing Object-level Shape Variations with Text-to-Image Diffusion Models Or Patashnik et.al. 2303.11306v1 null
2023-03-20 SVDiff: Compact Parameter Space for Diffusion Fine-Tuning Ligong Han et.al. 2303.11305v1 link
2023-03-20 AnimeDiffusion: Anime Face Line Drawing Colorization via Diffusion Models Yu Cao et.al. 2303.11137v1 link
2023-03-17 A Recipe for Watermarking Diffusion Models Yunqing Zhao et.al. 2303.10137v1 link
2023-03-17 Data-Centric Learning from Unlabeled Graphs with Diffusion Model Gang Liu et.al. 2303.10108v1 link
2023-03-17 DialogPaint: A Dialog-based Image Editing Model Jingxuan Wei et.al. 2303.10073v1 null
2023-03-17 GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation Can Qin et.al. 2303.10056v1 link
2023-03-17 On the momentum diffusion over multiphase surfaces with meshless methods Johannes C. Joubert et.al. 2303.09978v1 null
2023-03-17 Adversarial Counterfactual Visual Explanations Guillaume Jeanneret et.al. 2303.09962v1 link
2023-03-17 Discovering mesoscopic descriptions of collective movement with neural stochastic modelling Utkarsh Pratiush et.al. 2303.09906v1 link
2023-03-16 Efficient Diffusion Training via Min-SNR Weighting Strategy Tiankai Hang et.al. 2303.09556v1 link
2023-03-16 Diffusion-HPC: Generating Synthetic Images with Realistic Humans Zhenzhen Weng et.al. 2303.09541v1 link
2023-03-17 FateZero: Fusing Attentions for Zero-shot Text-based Video Editing Chenyang Qi et.al. 2303.09535v2 link
2023-03-16 $P+$ : Extended Textual Conditioning in Text-to-Image Generation Andrey Voynov et.al. 2303.09522v1 null
2023-03-16 DiffIR: Efficient Diffusion Model for Image Restoration Bin Xia et.al. 2303.09472v1 link
2023-03-16 Unwrapping NPT simulations to calculate diffusion coefficients Jakob Tómas Bullerjahn et.al. 2303.09418v1 null
2023-03-17 DINAR: Diffusion Inpainting of Neural Textures for One-Shot Human Avatars David Svitov et.al. 2303.09375v2 link
2023-03-15 Stochastic Interpolants: A Unifying Framework for Flows and Diffusions Michael S. Albergo et.al. 2303.08797v1 null
2023-03-15 Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion Inhwa Han et.al. 2303.08767v1 null
2023-03-15 Advanced Analysis of Radar Cross-Section Measurements in Reverberation Environment Corentin Charlo et.al. 2303.08751v1 null
2023-03-15 DiffusionAD: Denoising Diffusion for Anomaly Detection Hui Zhang et.al. 2303.08730v1 link
2023-03-16 ResDiff: Combining CNN and Diffusion Model for Image Super-Resolution Shuyao Shang et.al. 2303.08714v2 null
2023-03-15 Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer Serin Yang et.al. 2303.08622v1 link
2023-03-14 LayoutDM: Discrete Diffusion Model for Controllable Layout Generation Naoto Inoue et.al. 2303.08137v1 link
2023-03-14 MeshDiffusion: Score-based Generative 3D Mesh Modeling Zhen Liu et.al. 2303.08133v1 link
2023-03-14 Editing Implicit Assumptions in Text-to-Image Diffusion Models Hadas Orgad et.al. 2303.08084v1 link
2023-03-15 Interpretable ODE-style Generative Diffusion Model via Force Field Construction Weiyang Jin et.al. 2303.08063v2 null
2023-03-14 Edit-A-Video: Single Video Editing with Object-Aware Consistency Chaehun Shin et.al. 2303.07945v1 null
2023-03-15 Controllable Mesh Generation Through Sparse Latent Point Diffusion Models Zhaoyang Lyu et.al. 2303.07938v2 null
2023-03-15 Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation Junyoung Seo et.al. 2303.07937v2 link
2023-03-13 Erasing Concepts from Diffusion Models Rohit Gandikota et.al. 2303.07345v1 link
2023-03-14 Parallel Vertex Diffusion for Unified Visual Grounding Zesen Cheng et.al. 2303.07216v2 null
2023-03-10 GECCO: Geometrically-Conditioned Point Diffusion Models Michał J. Tyszkiewicz et.al. 2303.05916v1 null
2023-03-10 Photon Diffusion in Microscale Solids Avijit Das et.al. 2303.05776v1 null
2023-03-10 TrojDiff: Trojan Attacks on Diffusion Models with Diverse Targets Weixin Chen et.al. 2303.05762v1 link
2023-03-10 Fast Diffusion Sampler for Inverse Problems by Geometric Decomposition Hyungjin Chung et.al. 2303.05754v1 link
2023-03-09 Scaling up GANs for Text-to-Image Synthesis Minguk Kang et.al. 2303.05511v1 null
2023-03-09 Resolving quantitative MRI model degeneracy with machine learning via training data distribution design Michele Guerreri et.al. 2303.05464v1 null
2023-03-09 3DGen: Triplane Latent Diffusion for Textured Mesh Generation Anchit Gupta et.al. 2303.05371v1 null
2023-03-09 TGDataset: a Collection of Over One Hundred Thousand Telegram Channels Massimo La Morgia et.al. 2303.05345v1 link
2023-03-09 Brain-Diffuser: Natural scene reconstruction from fMRI signals using generative latent diffusion Furkan Ozcelik et.al. 2303.05334v1 link
2023-03-08 Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models Jiarui Xu et.al. 2303.04803v1 link
2023-03-08 Multilevel Diffusion: Infinite Dimensional Score-Based Diffusion Models for Image Generation Paul Hagemann et.al. 2303.04772v1 link
2023-03-08 Video-P2P: Video Editing with Cross-attention Control Shaoteng Liu et.al. 2303.04761v1 null
2023-03-08 Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models Chenfei Wu et.al. 2303.04671v1 link
2023-03-08 Diffusing Gaussian Mixtures for Generating Categorical Data Florence Regol et.al. 2303.04635v1 link
2023-03-08 Connecting finite-time Lyapunov exponents with supersaturation and droplet dynamics in the bulk of a turbulent cloud Vladyslav Pushenko et.al. 2303.04632v1 null
2023-03-08 Maritime transportation and people mobility in the early diffusion of COVID-19 in Croatia Corentin Cot et.al. 2303.04617v1 null
2023-03-07 Diffusion Policy: Visuomotor Policy Learning via Action Diffusion Cheng Chi et.al. 2303.04137v1 null
2023-03-06 Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-Type Samplers Sitan Chen et.al. 2303.03384v1 null
2023-03-06 StyO: Stylize Your Face in Only One-Shot Bonan Li et.al. 2303.03231v1 null
2023-03-03 Unleashing Text-to-Image Diffusion Models for Visual Perception Wenliang Zhao et.al. 2303.02153v1 link
2023-03-03 Multi-Agent Adversarial Training Using Diffusion Learning Ying Cao et.al. 2303.01936v1 null
2023-03-03 CONTAIN: A Community-based Algorithm for Network Immunization Özgur Coban et.al. 2303.01934v1 link
2023-03-02 Consistency Models Yang Song et.al. 2303.01469v1 link
2023-03-02 Human Motion Diffusion as a Generative Prior Yonatan Shafir et.al. 2303.01418v1 link
2023-03-02 Why (and When) does Local SGD Generalize Better than SGD? Xinran Gu et.al. 2303.01215v1 link
2023-03-01 StraIT: Non-autoregressive Generation with Stratified Image Transformer Shengju Qian et.al. 2303.00750v1 null
2023-03-01 Diffusing Graph Attention Daniel Glickman et.al. 2303.00613v1 null
2023-03-01 Level Up the Deepfake Detection: a Method to Effectively Discriminate Images Generated by GAN Architectures and Diffusion Models Luca Guarnera et.al. 2303.00608v1 null
2023-03-01 Unlimited-Size Diffusion Restoration Yinhuai Wang et.al. 2303.00354v1 link
2023-03-01 Collage Diffusion Vishnu Sarukkai et.al. 2303.00262v1 null
2023-03-01 Diffusion Probabilistic Fields Peiye Zhuang et.al. 2303.00165v1 null
2023-02-28 Phase Field Modeling of Dictyostelium Discoideum Chemotaxis Yunsong Zhang et.al. 2302.14854v1 null
2023-02-28 Monocular Depth Estimation using Diffusion Models Saurabh Saxena et.al. 2302.14816v1 null
2023-02-28 Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection Jian Shi et.al. 2302.14696v1 link
2023-02-28 Synthesizing Mixed-type Electronic Health Records using Diffusion Models Taha Ceritli et.al. 2302.14679v1 null
2023-02-28 Detecting and Optimising Team Interactions in Software Development Christian Zingg et.al. 2302.14609v1 null
2023-02-28 Can We Use Diffusion Probabilistic Models for 3D Motion Prediction? Hyemin Ahn et.al. 2302.14503v1 null
2023-02-27 Buoyancy-driven attraction of active droplets Yibo Chen et.al. 2302.14008v1 null
2023-02-27 Impact of reconstruction schemes on interpreting lattice Boltzmann results -- A study using the Taylor-Green vortex problem Jianping Meng et.al. 2302.13910v1 null
2023-02-27 Differentially Private Diffusion Models Generate Useful Synthetic Images Sahra Ghalebikesabi et.al. 2302.13861v1 null
2023-02-27 Denoising Diffusion Samplers Francisco Vargas et.al. 2302.13834v1 null
2023-02-24 Modulating Pretrained Diffusion Models for Multimodal Image Synthesis Cusuh Ham et.al. 2302.12764v1 null
2023-02-24 Physical interactions promote Turing patterns Lucas Menou et.al. 2302.12521v1 null
2023-02-24 Flow instability and momentum exchange in separation control by a synthetic jet Yoshiaki Abe et.al. 2302.12496v1 null
2023-02-24 Unsupervised Discovery of Semantic Latent Directions in Diffusion Models Yong-Hyun Park et.al. 2302.12469v1 null
2023-02-23 To the Noise and Back: Diffusion for Shared Autonomy Takuma Yoneda et.al. 2302.12244v1 null
2023-02-23 DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models Jamie Wynn et.al. 2302.12231v1 link
2023-02-23 Designing an Encoder for Fast Personalization of Text-to-Image Models Rinon Gal et.al. 2302.12228v1 null
2023-02-23 Metric-oriented Speech Enhancement using Diffusion Probabilistic Model Chen Chen et.al. 2302.11989v1 null
2023-02-22 Uncovering Bias in Face Generation Models Cristian Muñoz et.al. 2302.11562v1 null
2023-02-22 Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC Yilun Du et.al. 2302.11552v1 link
2023-02-22 Scaling Robot Learning with Semantically Imagined Experience Tianhe Yu et.al. 2302.11550v1 null
2023-02-22 Aligned Diffusion Schrödinger Bridges Vignesh Ram Somnath et.al. 2302.11419v1 link
2023-02-22 Entity-Level Text-Guided Image Manipulation Yikai Wang et.al. 2302.11383v1 link
2023-02-22 An agent-based model of the 2020 international policy diffusion in response to the COVID-19 pandemic with particle filter Yannick Oswald et.al. 2302.11277v1 link
2023-02-21 Provable Copyright Protection for Generative Models Nikhil Vyas et.al. 2302.10870v1 null
2023-02-21 Learning 3D Photography Videos via Self-supervised Diffusion on Single Images Xiaodong Wang et.al. 2302.10781v1 null
2023-02-21 On Calibrating Diffusion Probabilistic Models Tianyu Pang et.al. 2302.10688v1 link
2023-02-21 $PC^2$ : Projection-Conditioned Point Cloud Diffusion for Single-Image 3D Reconstruction Luke Melas-Kyriazi et.al. 2302.10668v1 link
2023-02-21 RealFusion: 360° Reconstruction of Any Object from a Single Image Luke Melas-Kyriazi et.al. 2302.10663v1 null
2023-02-21 Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels Zebin You et.al. 2302.10586v1 link
2023-02-20 Towards Universal Fake Image Detectors that Generalize Across Generative Models Utkarsh Ojha et.al. 2302.10174v1 link
2023-02-20 Cross-domain Compositing with Pretrained Diffusion Models Roy Hachnochi et.al. 2302.10167v1 link
2023-02-20 NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion Jiatao Gu et.al. 2302.10109v1 null
2023-02-20 DINOISER: Diffused Conditional Sequence Learning by Manipulating Noises Jiasheng Ye et.al. 2302.10025v1 link
2023-02-17 Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be Consistent Giannis Daras et.al. 2302.09057v1 link
2023-02-17 MiDi: Mixed Graph and 3D Denoising Diffusion for Molecule Generation Clement Vignac et.al. 2302.09048v1 link
2023-02-17 LDFA: Latent Diffusion Face Anonymization for Self-driving Applications Marvin Klemp et.al. 2302.08931v1 null
2023-02-17 Multi-unit Auction over a Social Network Yuan Fang et.al. 2302.08924v1 null
2023-02-17 Unraveling the Variations of the Society of England and Wales through Diffusion Maps Analysis on Census 2011 Gezhi Xiu et.al. 2302.08701v1 null
2023-02-16 Text-driven Visual Synthesis with Latent Diffusion Prior Ting-Hsuan Liao et.al. 2302.08510v1 null
2023-02-16 T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models Chong Mou et.al. 2302.08453v1 link
2023-02-16 Explicit Diffusion of Gaussian Mixture Model Based Image Priors Martin Zach et.al. 2302.08411v1 null
2023-02-16 Boundary Guided Mixing Trajectory for Semantic Control with Diffusion Models Ye Zhu et.al. 2302.08357v1 link
2023-02-15 Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual Generation Joshua Vendrow et.al. 2302.07865v1 link
2023-02-15 Denoising Diffusion Probabilistic Models for Robust Image Super-Resolution in the Wild Hshmat Sahak et.al. 2302.07864v1 null
2023-02-15 Data Forensics in Diffusion Models: A Systematic Analysis of Membership Privacy Derui Zhu et.al. 2302.07801v1 null
2023-02-15 Video Probabilistic Diffusion Models in Projected Latent Space Sihyun Yu et.al. 2302.07685v1 null
2023-02-14 Where to Diffuse, How to Diffuse, and How to Get Back: Automated Learning for Multivariate Diffusions Raghav Singhal et.al. 2302.07261v1 null
2023-02-14 Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data Minshuo Chen et.al. 2302.07194v1 null
2023-02-14 Universal Guidance for Diffusion Models Arpit Bansal et.al. 2302.07121v1 link
2023-02-14 Differential privacy diffusion auction of homogeneous items Fengjuan Jia et.al. 2302.07072v1 null
2023-02-14 Direct numerical simulations of the Taylor-Green Vortex interacting with a hydrogen diffusion flame: Reynolds number and non-unity Lewis number effects Yifan Xu et.al. 2302.07006v1 null
2023-02-13 Raising the Cost of Malicious AI-Powered Image Editing Hadi Salman et.al. 2302.06588v1 link
2023-02-13 Preconditioned Score-based Generative Models Li Zhang et.al. 2302.06504v1 link
2023-02-13 Technical Note: PDE-constrained Optimization Formulation for Tumor Growth Model Calibration Baoshan Liang et.al. 2302.06445v1 null
2023-02-13 ContrasInver: Voxel-wise Contrastive Semi-supervised Learning for Seismic Inversion Yimin Dou et.al. 2302.06441v1 null
2023-02-13 Interplay between advective, diffusive, and active barriers in Rayleigh-Bénard flow Nikolas Aksamit et.al. 2302.06319v1 null
2023-02-10 Example-Based Sampling with Diffusion Models Bastien Doignies et.al. 2302.05116v1 null
2023-02-09 UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models Wenliang Zhao et.al. 2302.04867v1 link
2023-02-09 RelightableHands: Efficient Neural Relighting of Articulated Hand Models Shun Iwase et.al. 2302.04866v1 null
2023-02-09 Is This Loss Informative? Speeding Up Textual Inversion with Deterministic Objective Evaluation Anton Voronov et.al. 2302.04841v1 link
2023-02-09 Better Diffusion Models Further Improve Adversarial Training Zekai Wang et.al. 2302.04638v1 link
2023-02-09 Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples Chumeng Liang et.al. 2302.04578v1 link
2023-02-08 PFGM++: Unlocking the Potential of Physics-Inspired Generative Models Yilun Xu et.al. 2302.04265v1 link
2023-02-08 GLAZE: Protecting Artists from Style Mimicry by Text-to-Image Models Shawn Shan et.al. 2302.04222v1 null
2023-02-08 Policy Evaluation in Decentralized POMDPs with Belief Sharing Mert Kayaalp et.al. 2302.04151v1 link
2023-02-08 Dimensional lattice Boltzmann method for transport phenomena simulation without conversion to lattice units Ivan Talão Martins et.al. 2302.04120v1 null
2023-02-07 Long Horizon Temperature Scaling Andy Shih et.al. 2302.03686v1 link
2023-02-07 Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery Yuxin Wen et.al. 2302.03668v1 link
2023-02-07 HumanMAC: Masked Motion Completion for Human Motion Prediction Ling-Hao Chen et.al. 2302.03665v1 link
2023-02-07 Graph Generation with Destination-Driven Diffusion Mixture Jaehyeong Jo et.al. 2302.03596v1 link
2023-02-06 Zero-shot Image-to-Image Translation Gaurav Parmar et.al. 2302.03027v1 link
2023-02-06 Structure and Content-Guided Video Synthesis with Diffusion Models Patrick Esser et.al. 2302.03011v1 null
2023-02-03 AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners Zhixuan Liang et.al. 2302.01877v1 link
2023-02-03 TEXTure: Text-Guided Texturing of 3D Shapes Elad Richardson et.al. 2302.01721v1 link
2023-02-03 Learning End-to-End Channel Coding with Diffusion Models Muah Kim et.al. 2302.01714v1 null
2023-02-03 A Lipschitz Bandits Approach for Continuous Hyperparameter Optimization Yasong Feng et.al. 2302.01539v1 null
2023-02-02 Dreamix: Video Diffusion Models are General Video Editors Eyal Molad et.al. 2302.01329v1 null
2023-02-02 Are Diffusion Models Vulnerable to Membership Inference Attacks? Jinhao Duan et.al. 2302.01316v1 link
2023-02-01 Stable Target Field for Reduced Variance Score Estimation in Diffusion Models Yilun Xu et.al. 2302.00670v1 link
2023-01-31 Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models Hila Chefer et.al. 2301.13826v1 link
2023-01-30 Extracting Training Data from Diffusion Models Nicholas Carlini et.al. 2301.13188v1 null
2023-01-30 Shape-aware Text-driven Layered Video Editing Yao-Chih Lee et.al. 2301.13173v1 null
2023-01-30 GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis Ming Tao et.al. 2301.12959v1 link
2023-01-30 ERA-Solver: Error-Robust Adams Solver for Fast Sampling of Diffusion Probabilistic Models Shengmeng Li et.al. 2301.12935v1 null
2023-01-30 PromptMix: Text-to-image diffusion models enhance the performance of lightweight networks Arian Bakhtiarnia et.al. 2301.12914v1 null
2023-01-27 Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion Flavio Schneider et.al. 2301.11757v1 link
2023-01-27 Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines? Victor Boutin et.al. 2301.11722v1 link
2023-01-26 simple diffusion: End-to-end diffusion for high resolution images Emiel Hoogeboom et.al. 2301.11093v1 null
2023-01-26 On the Importance of Noise Scheduling for Diffusion Models Ting Chen et.al. 2301.10972v1 null
2023-01-25 Imitating Human Behaviour with Diffusion Models Tim Pearce et.al. 2301.10677v1 link
2023-01-24 Bipartite Graph Diffusion Model for Human Interaction Generation Baptiste Chopin et.al. 2301.10134v1 link
2023-01-24 DiffMotion: Speech-Driven Gesture Synthesis Using Denoising Diffusion Model Fan Zhang et.al. 2301.10047v1 link
2023-01-24 Membership Inference of Diffusion Models Hailong Hu et.al. 2301.09956v1 link
2023-01-23 LEGO-Net: Learning Regular Rearrangements of Objects in Rooms Qiuhong Anna Wei et.al. 2301.09629v1 null
2023-01-23 Evaluation of Light Collection from Highly Scattering Media using Wavelength-Shifting Fibers Andrew Wilhelm et.al. 2301.09608v1 null
2023-01-23 StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis Axel Sauer et.al. 2301.09515v1 link
2023-01-23 DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion Qitian Wu et.al. 2301.09474v1 link
2023-01-19 Dif-Fusion: Towards High Color Fidelity in Infrared and Visible Image Fusion with Diffusion Models Jun Yue et.al. 2301.08072v1 null
2023-01-18 Targeted Image Reconstruction by Sampling Pre-trained Diffusion Model Jiageng Zheng et.al. 2301.07557v1 null
2023-01-17 GLIGEN: Open-Set Grounded Text-to-Image Generation Yuheng Li et.al. 2301.07093v1 link
2023-01-13 In BLOOM: Creativity and Affinity in Artificial Lyrics and Art Evan Crothers et.al. 2301.05402v1 link
2023-01-12 Guiding Text-to-Image Diffusion Model Towards Grounded Generation Ziyi Li et.al. 2301.05221v1 null
2023-01-12 Thompson Sampling with Diffusion Generative Prior Yu-Guan Hsieh et.al. 2301.05182v1 null

(back to top)

sketch

Publish Date Title Authors PDF Code
2024-04-02 Sketch3D: Style-Consistent Guidance for Sketch-to-3D Generation Wangguandong Zheng et.al. 2404.01843v1 null
2024-04-02 FashionEngine: Interactive Generation and Editing of 3D Clothed Humans Tao Hu et.al. 2404.01655v1 null
2024-04-01 Categorical semiotics: Foundations for Knowledge Integration Carlos Leandro et.al. 2404.01526v1 null
2024-04-01 Can Biases in ImageNet Models Explain Generalization? Paul Gavrikov et.al. 2404.01509v1 link
2024-04-02 GDA: Generalized Diffusion for Robust Test-time Adaptation Yun-Yun Tsai et.al. 2404.00095v2 null
2024-03-29 Optimal Communication for Classic Functions in the Coordinator Model and Beyond Hossein Esfandiari et.al. 2403.20307v1 null
2024-03-29 Sketch-to-Architecture: Generative AI-aided Architectural Design Pengzhi Li et.al. 2403.20186v1 null
2024-03-28 Dealing with Missing Modalities in Multimodal Recommendation: a Feature Propagation-based Approach Daniele Malitesta et.al. 2403.19841v1 null
2024-03-28 TASR: A Novel Trust-Aware Stackelberg Routing Algorithm to Mitigate Traffic Congestion Doris E. M. Brown et.al. 2403.19831v1 null
2024-03-26 Neural Attributed Community Search at Billion Scale Jianwei Wang et.al. 2403.18874v1 null
2024-03-27 A Path Towards Legal Autonomy: An interoperable and explainable approach to extracting, transforming, loading and computing legal information using large language models, expert systems and Bayesian networks Axel Constant et.al. 2403.18537v1 null
2024-03-27 U-Sketch: An Efficient Approach for Sketch to Image Diffusion Models Ilias Mitsouras et.al. 2403.18425v1 null
2024-03-27 ECNet: Effective Controllable Text-to-Image Diffusion Models Sicheng Li et.al. 2403.18417v1 null
2024-03-26 Search and Society: Reimagining Information Access for Radical Futures Bhaskar Mitra et.al. 2403.17901v1 null
2024-03-26 ExpressEdit: Video Editing with Natural Language and Sketching Bekzat Tilekbay et.al. 2403.17693v1 null
2024-03-26 Equipping Sketch Patches with Context-Aware Positional Encoding for Graphic Sketch Representation Sicong Zang et.al. 2403.17525v1 null
2024-03-25 On Policy Reuse: An Expressive Language for Representing and Executing General Policies that Call Other Policies Blai Bonet et.al. 2403.16824v1 null
2024-03-25 CodeS: Natural Language to Code Repository via Multi-Layer Sketch Daoguang Zan et.al. 2403.16443v1 link
2024-03-24 Combined Task and Motion Planning Via Sketch Decompositions (Extended Version with Supplementary Material) Magí Dalmau-Moreno et.al. 2403.16277v1 null
2024-03-22 Efficiently Estimating Mutual Information Between Attributes Across Tables Aécio Santos et.al. 2403.15553v1 null
2024-03-22 Fourier Transform-based Estimators for Data Sketches Seth Pettie et.al. 2403.15366v1 null
2024-03-25 Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing Alberto Baldrati et.al. 2403.14828v2 link
2024-03-21 Object-Centric Domain Randomization for 3D Shape Reconstruction in the Wild Junhyeong Cho et.al. 2403.14539v1 null
2024-03-21 External Knowledge Enhanced 3D Scene Generation from Sketch Zijie Wu et.al. 2403.14121v1 null
2024-03-20 Towards an extension of Fault Trees in the Predictive Maintenance Scenario Roberta De Fazio et.al. 2403.13785v1 null
2024-03-25 Diagrammatic Instructions to Specify Spatial Objectives and Constraints with Applications to Mobile Base Placement Qilin Sun et.al. 2403.12465v2 null
2024-03-18 Towards a Theory of Pragmatic Information Edward D. Weinberger et.al. 2403.12324v1 null
2024-03-17 Stylized Face Sketch Extraction via Generative Prior with Limited Data Kwan Yun et.al. 2403.11263v1 link
2024-03-16 RETINAQA : A Knowledge Base Question Answering Model Robust to both Answerable and Unanswerable Questions Prayushi Faldu et.al. 2403.10849v1 null
2024-03-15 Animate Your Motion: Turning Still Images into Dynamic Videos Mingxiao Li et.al. 2403.10179v1 null
2024-03-14 What Sketch Explainability Really Means for Downstream Tasks Hmrishav Bandyopadhyay et.al. 2403.09480v1 null
2024-03-14 SketchINR: A First Look into Sketches as Implicit Neural Representations Hmrishav Bandyopadhyay et.al. 2403.09344v1 null
2024-03-14 Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset Hugo Laurençon et.al. 2403.09029v1 null
2024-03-13 ARtVista: Gateway To Empower Anyone Into Artist Trong-Vu Hoang et.al. 2403.08876v1 null
2024-03-13 HAIFIT: Human-Centered AI for Fashion Image Translation Jianan Jiang et.al. 2403.08651v1 link
2024-03-13 Sketch2Manga: Shaded Manga Screening from Sketch with Diffusion Models Jian Lin et.al. 2403.08266v1 null
2024-03-12 It's All About Your Sketch: Democratising Sketch Control in Diffusion Models Subhadeep Koley et.al. 2403.07234v1 link
2024-03-12 You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval Subhadeep Koley et.al. 2403.07222v1 null
2024-03-12 Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers Subhadeep Koley et.al. 2403.07214v1 null
2024-03-11 How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval? Subhadeep Koley et.al. 2403.07203v1 null
2024-03-11 Enhancing Image Caption Generation Using Reinforcement Learning with Human Feedback Adarsh N L et.al. 2403.06735v1 null
2024-03-08 Data-Dependent LSH for the Earth Mover's Distance Rajesh Jayaram et.al. 2403.05041v1 null
2024-03-07 A challenge in A(G)I, cybernetics revived in the Ouroboros Model as one algorithm for all thinking Knud Thomsen et.al. 2403.04292v1 null
2024-03-06 NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging Takahiro Shirakawa et.al. 2403.03485v1 link
2024-03-07 DLP-GAN: learning to draw modern Chinese landscape photos with generative adversarial network Xiangquan Gui et.al. 2403.03456v2 null
2024-03-05 SmartSantander: IoT Experimentation over a Smart City Testbed Luis Sanchez et.al. 2403.03196v1 null
2024-03-05 CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following Kaiyan Zhang et.al. 2403.03129v1 null
2024-03-05 RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches Priya Sundaresan et.al. 2403.02709v1 null
2024-03-02 Euclidean distance compression via deep random features Brett Leroux et.al. 2403.01327v1 null
2024-02-29 CoMeT: Count-Min-Sketch-based Row Tracking to Mitigate RowHammer at Low Cost F. Nisa Bostanci et.al. 2402.18769v1 link
2024-02-28 DynaWarp -- Efficient, large-scale log storage and retrieval Julian Reichinger et.al. 2402.18355v1 null
2024-02-28 Block and Detail: Scaffolding Sketch-to-Image Generation Vishnu Sarukkai et.al. 2402.18116v1 null
2024-02-27 Decremental $(1+ε)$ -Approximate Maximum Eigenvector: Dynamic Power Method Deeksha Adil et.al. 2402.17929v1 null
2024-02-27 Surgment: Segmentation-enabled Semantic Search and Creation of Visual Question and Feedback to Support Video-Based Surgery Learning Jingying Wang et.al. 2402.17903v1 null
2024-02-27 CAD-SIGNet: CAD Language Inference from Point Clouds using Layer-wise Sketch Instance Guided Attention Mohammad Sadil Khan et.al. 2402.17678v1 null
2024-02-27 CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing Chufeng Xiao et.al. 2402.17624v1 null
2024-02-27 Equivariant ideals of polynomials Arka Ghosh et.al. 2402.17604v1 null
2024-02-25 Convolution and Cross-Correlation of Count Sketches Enables Fast Cardinality Estimation of Multi-Join Queries Mike Heddes et.al. 2402.15953v1 link
2024-02-23 Genie: Generative Interactive Environments Jake Bruce et.al. 2402.15391v1 null
2024-02-22 Semantic Image Synthesis with Unconditional Generator Jungwoo Chae et.al. 2402.14395v1 null
2024-02-21 Sketching AI Concepts with Capabilities and Examples: AI Innovation in the Intensive Care Unit Nur Yildirim et.al. 2402.13437v1 null
2024-02-20 Quantitative causality, causality-guided scientific discovery, and causal machine learning X. San Liang et.al. 2402.13427v1 null
2024-02-20 Almost-Tight Bounds on Preserving Cuts in Classes of Submodular Hypergraphs Sanjeev Khanna et.al. 2402.13151v1 null
2024-02-17 Be Persistent: Towards a Unified Solution for Mitigating Shortcuts in Deep Learning Hadi M. Dolatabadi et.al. 2402.11237v1 null
2024-02-17 Automated Optimization of Parameterized Data-Plane Programs with Parasol Mary Hogan et.al. 2402.11155v1 null
2024-02-13 Sampling Space-Saving Set Sketches Homin K. Lee et.al. 2402.08604v1 link
2024-02-13 One-to-many Reconstruction of 3D Geometry of cultural Artifacts using a synthetically trained Generative Model Thomas Pöllabauer et.al. 2402.08310v1 null
2024-02-13 Epistemic Power, Objectivity and Gender in AI Ethics Labor: Legitimizing Located Complaints David Gray Widder et.al. 2402.08171v1 null
2024-02-13 Randomized Algorithms for Symmetric Nonnegative Matrix Factorization Koby Hayashi et.al. 2402.08134v1 null
2024-02-10 Guided Sketch-Based Program Induction by Search Gradients Ahmad Ayaz Amin et.al. 2402.06990v1 null
2024-02-09 Squidgets: Sketch-based Widget Design and Direct Manipulation of 3D Scene Joonho Kim et.al. 2402.06795v1 null
2024-02-08 InkSight: Offline-to-Online Handwriting Conversion by Learning to Read and Write Blagoj Mitrevski et.al. 2402.05804v1 null
2024-02-08 A Concept for Reconstructing Stucco Statues from historic Sketches using synthetic Data only Thomas Pöllabauer et.al. 2402.05593v1 null
2024-02-06 Gradient Sketches for Training Data Attribution and Studying the Loss Landscape Andrea Schioppa et.al. 2402.03994v1 null
2024-02-06 3Doodle: Compact Abstraction of Objects with 3D Strokes Changwoon Choi et.al. 2402.03690v1 null
2024-02-05 Computing Generic Fibres of Polynomial Ideals with FGLM and Hensel Lifting Jérémy Berthomieu et.al. 2402.03144v1 null
2024-02-03 Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization Bo Yang et.al. 2402.02141v1 link
2024-02-02 Solitons, dispersive shock waves and Noel Fredrick Smyth Saleh Baqer et.al. 2402.01332v1 null
2024-02-01 Deep Robot Sketching: An application of Deep Q-Learning Networks for human-like sketching Raul Fernandez-Fernandez et.al. 2402.00676v1 null
2024-02-01 High-Quality Medical Image Generation from Free-hand Sketch Quan Huu Cap et.al. 2402.00353v1 null
2024-01-31 On The Power of Subtle Expressive Cues in the Perception of Human Affects Ezgi Dede et.al. 2401.18013v1 null
2024-02-04 Fine-Grained Zero-Shot Learning: Advances, Challenges, and Prospects Jingcai Guo et.al. 2401.17766v2 link
2024-01-31 Estimating Diffusion Degree on Graph Streams Vinit Ramesh Gore et.al. 2401.17611v1 null
2024-01-31 Topology-Aware Latent Diffusion for 3D Shape Generation Jiangbei Hu et.al. 2401.17603v1 null
2024-01-29 FPGA Technology Mapping Using Sketch-Guided Program Synthesis Gus Henry Smith et.al. 2401.16526v1 null
2024-01-29 Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors Shiyin Dong et.al. 2401.16459v1 null
2024-01-25 Incremental Proof Development in Dafny with Module-Based Induction Son Ho et.al. 2401.16233v1 null
2024-01-26 Sketch and Refine: Towards Fast and Accurate Lane Detection Chao Chen et.al. 2401.14729v1 link
2024-01-27 Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation Minglin Chen et.al. 2401.14257v2 null
2024-01-22 PatternPortrait: Draw Me Like One of Your Scribbles Sabine Wieluch et.al. 2401.13001v1 null
2024-01-22 Automated Completion of Statements and Proofs in Synthetic Geometry: an Approach based on Constraint Solving Salwa Tabet Gonzalez et.al. 2401.11898v1 null
2024-01-18 Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access Saibo Geng et.al. 2401.09967v1 null
2024-01-21 Towards Identifiable Unsupervised Domain Translation: A Diversified Distribution Matching Approach Sagar Shrestha et.al. 2401.09671v2 null
2024-01-12 Masked Attribute Description Embedding for Cloth-Changing Person Re-identification Chunlei Peng et.al. 2401.05646v2 link
2024-01-11 DrawTalking: Building Interactive Worlds by Sketching and Speaking Karl Toby Rosenberg et.al. 2401.05631v1 null
2024-01-10 Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval Eunyi Lyou et.al. 2401.04860v1 null
2024-01-09 Content-Conditioned Generation of Stylized Free hand Sketches Jiajun Liu et.al. 2401.04739v1 null
2024-01-09 Representative Feature Extraction During Diffusion Process for Sketch Extraction with One Example Kwan Yun et.al. 2401.04362v1 null
2024-01-08 Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach Huanyu Liu et.al. 2401.03742v1 link
2024-01-05 FedNS: A Fast Sketching Newton-Type Algorithm for Federated Learning Jian Li et.al. 2401.02734v1 link
2024-01-02 ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text Dingkun Yan et.al. 2401.01456v1 link
2024-01-01 Free-form Shape Modeling in XR: A Systematic Review Shounak Chatterjee et.al. 2401.00924v1 null
2024-01-01 DiffMorph: Text-less Image Morphing with Diffusion Models Shounak Chatterjee et.al. 2401.00739v1 null
2023-12-31 SynCDR : Training Cross Domain Retrieval Models with Synthetic Data Samarth Mishra et.al. 2401.00420v1 link
2023-12-31 Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval Liang Wang et.al. 2401.00371v1 link
2023-12-28 A randomized algorithm to solve reduced rank operator regression Giacomo Turri et.al. 2312.17348v1 link
2024-01-03 SVGDreamer: Text Guided SVG Generation with Diffusion Model Ximing Xing et.al. 2312.16476v2 link
2023-12-22 Generative AI and the History of Architecture Joern Ploennigs et.al. 2312.15106v1 null
2023-12-22 A Modular Approach to Metatheoretic Reasoning for Extensible Languages Dawn Michaelson et.al. 2312.14374v1 null
2023-12-21 On the Hardness of Analyzing Quantum Programs Quantitatively Martin Avanzini et.al. 2312.13657v1 null
2023-12-18 Open Vocabulary Semantic Scene Sketch Understanding Ahmed Bourouis et.al. 2312.12463v1 null
2023-12-19 Sketch Vision: Artificial Intelligence with Sight for Imagination Demircan Tas et.al. 2312.12270v1 null
2023-12-19 Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model Lingjun Zhang et.al. 2312.12232v1 link
2023-12-19 CreativeConnect: Supporting Reference Recombination for Graphic Design Ideation with Generative AI DaEun Choi et.al. 2312.11949v1 null
2023-12-16 Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image Retrieval Decheng Liu et.al. 2312.10320v1 link
2023-12-15 Sketch and shift: a robust decoder for compressive clustering Ayoub Belhadji et.al. 2312.09940v1 null
2023-12-15 Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception Xiao Wang et.al. 2312.09812v1 link
2023-12-14 Matching Noisy Keys for Obfuscation Charlie Dickens et.al. 2312.08981v1 null
2023-12-14 Solving Dense Linear Systems Faster than via Preconditioning Michał Dereziński et.al. 2312.08893v1 null
2023-12-13 Enhance Sketch Recognition's Explainability via Semantic Component-Level Parsing Guangming Zhu et.al. 2312.07875v1 link
2023-12-12 Improved Frequency Estimation Algorithms with and without Predictions Anders Aamand et.al. 2312.07535v1 null
2023-12-09 BARET : Balanced Attention based Real image Editing driven by Target-text Inversion Yuming Qiao et.al. 2312.05482v1 null
2023-12-07 Optimal Multi-Pass Lower Bounds for MST in Dynamic Streams Sepehr Assadi et.al. 2312.04674v1 null
2023-12-07 Deep3DSketch: 3D modeling from Free-hand Sketches with View- and Structural-Aware Adversarial Training Tianrun Chen et.al. 2312.04435v1 null
2023-12-07 DemoCaricature: Democratising Caricature Generation with a Rough Sketch Dar-Yen Chen et.al. 2312.04364v1 null
2023-12-07 Doodle Your 3D: From Abstract Freehand Sketches to Precise 3D Shapes Hmrishav Bandyopadhyay et.al. 2312.04043v1 null
2023-12-06 CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models Hailin Zhang et.al. 2312.03256v1 link
2023-12-05 SEVA: Leveraging sketches to evaluate alignment between human and machine visual abstraction Kushin Mukherjee et.al. 2312.03035v1 link
2023-12-08 FreestyleRet: Retrieving Images from Style-Diversified Queries Hao Li et.al. 2312.02428v2 link
2023-12-04 CLIPDrawX: Primitive-based Explanations for Text Guided Sketch Synthesis Nityanand Mathur et.al. 2312.02345v1 null
2023-12-03 Uncertainty-biased molecular dynamics for learning uniformly accurate interatomic potentials Viktor Zaverkin et.al. 2312.01416v1 null
2023-11-30 Sketch Input Method Editor: A Comprehensive Dataset and Methodology for Systematic Input Recognition Guangming Zhu et.al. 2311.18254v1 link
2023-11-29 Analyzing Query Optimizer Performance in the Presence and Absence of Cardinality Estimates Asoke Datta et.al. 2311.17293v1 null
2023-11-28 Time- and Communication-Efficient Overlay Network Construction via Gossip Fabien Dufoulon et.al. 2311.17115v1 null
2023-11-28 SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models Yuwei Guo et.al. 2311.16933v1 null
2023-11-28 ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention Jiawei Wang et.al. 2311.16682v1 null
2023-11-28 Text-Driven Image Editing via Learnable Regions Yuanze Lin et.al. 2311.16432v1 link
2023-11-27 MAST: Model-Agnostic Sparsified Training Yury Demidovich et.al. 2311.16086v1 link
2023-11-26 Sketch Video Synthesis Yudian Zheng et.al. 2311.15306v1 link
2023-11-25 A unified framework for learning with nonlinear model classes from arbitrary linear samples Ben Adcock et.al. 2311.14886v1 null
2023-11-24 Data-to-Text Bilingual Generation Guy Lapalme et.al. 2311.14808v1 link
2023-11-24 One Pass Streaming Algorithm for Super Long Token Attention Approximation in Sublinear Space Raghav Addanki et.al. 2311.14652v1 null
2023-11-21 Breathing Life Into Sketches Using Text-to-Video Priors Rinon Gal et.al. 2311.13608v1 null
2023-11-22 Adaptive Sampling for Deep Learning via Efficient Nonparametric Proxies Shabnam Daghaghi et.al. 2311.13583v1 null
2023-11-21 From Concept to Manufacturing: Evaluating Vision-Language Models for Engineering Design Cyril Picard et.al. 2311.12668v1 null
2023-11-19 AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort Wen Wang et.al. 2311.11243v1 null
2023-11-17 Scaling TabPFN: Sketching and Feature Selection for Tabular Prior-Data Fitted Networks Benjamin Feuer et.al. 2311.10609v1 null
2023-11-09 Chain of Images for Intuitively Reasoning Fanxu Meng et.al. 2311.09241v1 link
2023-11-14 Language and Sketching: An LLM-driven Interactive Multimodal Multitask Robot Navigation Framework Weiqin Zu et.al. 2311.08244v1 null
2023-11-13 Fast and Space-Efficient Parallel Algorithms for Influence Maximization Letong Wang et.al. 2311.07554v1 link
2023-11-13 Sketch-based Video Object Segmentation: Benchmark and Analysis Ruolin Yang et.al. 2311.07261v1 null
2023-11-09 General Policies, Subgoal Structure, and Planning Width Blai Bonet et.al. 2311.05490v1 null
2023-11-09 Control3D: Towards Controllable Text-to-3D Generation Yang Chen et.al. 2311.05461v1 null
2023-11-08 Prompt Sketching for Large Language Models Luca Beurer-Kellner et.al. 2311.04954v1 null
2023-11-07 DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding Kehinde Ajayi et.al. 2311.04098v1 link
2023-11-06 Sketching methods with small window guarantee using minimum decycling sets Guillaume Marçais et.al. 2311.03592v1 link
2023-11-05 Sketching Multidimensional Time Series for Fast Discord Mining Chin-Chia Michael Yeh et.al. 2311.03393v1 null
2023-11-03 Neural Collage Transfer: Artistic Reconstruction via Material Manipulation Ganghun Lee et.al. 2311.02202v1 link
2023-11-06 RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches Jiayuan Gu et.al. 2311.01977v2 null
2023-11-03 Hardness of Low Rank Approximation of Entrywise Transformed Matrix Products Tamas Sarlos et.al. 2311.01960v1 null
2023-11-03 Towards Concept-Aware Large Language Models Chen Shani et.al. 2311.01866v1 link
2023-11-07 inkn'hue: Enhancing Manga Colorization from Multiple Priors with Alignment Multi-Encoder VAE Tawin Jiramahapokee et.al. 2311.01804v2 link
2023-10-31 Progress and outlook on advanced fly scans based on Mamba Peng-Cheng Li et.al. 2310.20106v1 link
2023-10-30 The Expressibility of Polynomial based Attention Scheme Zhao Song et.al. 2310.20051v1 null
2023-10-29 Sketching Algorithms for Sparse Dictionary Learning: PTAS and Turnstile Streaming Gregory Dexter et.al. 2310.19068v1 null
2023-10-29 Customize StyleGAN with One Hand Sketch Shaocong Zhang et.al. 2310.18949v1 null
2023-10-28 Deep3DSketch+: Obtaining Customized 3D Model by Single Free-Hand Sketch through Deep Learning Ying Zang et.al. 2310.18609v1 null
2023-10-27 Deep3DSketch++: High-Fidelity 3D Modeling from Single Free-hand Sketches Ying Zang et.al. 2310.18178v1 null
2023-10-27 Reality3DSketch: Rapid 3D Modeling of Objects from Single Freehand Sketches Tianrun Chen et.al. 2310.18148v1 null
2023-10-27 On General Language Understanding David Schlangen et.al. 2310.18038v1 null
2023-10-27 Sketching and Streaming for Dictionary Compression Ruben Becker et.al. 2310.17980v1 null
2023-10-26 Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models Dingli Yu et.al. 2310.17567v1 null
2023-10-24 Emergent Communication in Interactive Sketch Question Answering Zixing Lei et.al. 2310.15597v1 link
2023-10-24 Fast multiplication of random dense matrices with fixed sparse matrices Tianyu Liang et.al. 2310.15419v1 link
2023-10-18 A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenge Yikun Han et.al. 2310.11703v1 null
2023-10-17 Matrix Compression via Randomized Low Rank and Low Precision Factorization Rajarshi Saha et.al. 2310.11028v1 link
2023-10-16 HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending Tianyi Wei et.al. 2310.10651v1 link
2023-10-16 Visual Data-Type Understanding does not emerge from Scaling Vision-Language Models Vishaal Udandarao et.al. 2310.08577v2 link
2023-10-12 Visualizing a Nondeterministic to Deterministic Finite-State Machine Transformation Tijana Minic et.al. 2310.08248v1 link
2023-10-11 On $(1+\varepsilon)$ -Approximate Flow Sparsifiers Yu Chen et.al. 2310.07857v1 null
2023-10-10 SketchBodyNet: A Sketch-Driven Multi-faceted Decoder Network for 3D Human Reconstruction Fei Wang et.al. 2310.06577v1 link
2023-10-15 HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation Yaosen Chen et.al. 2310.05720v3 link
2023-10-09 Logic-guided Deep Reinforcement Learning for Stock Trading Zhiming Li et.al. 2310.05551v1 null
2023-10-08 Transforming Pixels into a Masterpiece: AI-Powered Art Restoration using a Novel Distributed Denoising CNN (DDCNN) Sankar B. et.al. 2310.05270v1 null
2023-10-06 Hanging in there: Prenatal origins of antigravity homeostasis in humans Nicholas M. Wilkinson et.al. 2310.04168v1 null
2023-10-06 Deterministic Clustering in High Dimensional Spaces: Sketches and Approximation Vincent Cohen-Addad et.al. 2310.04076v1 null
2023-10-05 Matrix Completion from One-Bit Dither Samples Arian Eamaz et.al. 2310.03224v1 null
2023-10-04 Streaming Euclidean $k$-median and $k$-means with $o(\log n)$ Space Vincent Cohen-Addad et.al. 2310.02882v1 null
2023-10-04 On the tilt of the Earth's polar axis ( $κλιμα$ ): Some 'impressionist' remarks V. Courtillot et.al. 2310.02768v1 null
2023-10-03 View-Independent Adjoint Light Tracing for Lighting Design Optimization Lukas Lipp et.al. 2310.02043v1 null
2023-10-03 Randomized Dimension Reduction with Statistical Guarantees Yijun Dong et.al. 2310.01739v1 null
2023-10-02 PolySketchFormer: Fast Transformers via Sketches for Polynomial Kernels Praneeth Kacham et.al. 2310.01655v1 null
2023-09-29 Toward Operationalizing Pipeline-aware ML Fairness: A Research Agenda for Developing Practical Guidelines and Tools Emily Black et.al. 2309.17337v1 null
2023-09-28 Sketch2CADScript: 3D Scene Reconstruction from 2D Sketch using Visual Transformer and Rhino Grasshopper Hong-Bin Yang et.al. 2309.16850v1 null
2023-09-28 Multi-Modal Financial Time-Series Retrieval Through Latent Space Projections Tom Bamford et.al. 2309.16741v1 null
2023-09-28 Language models in molecular discovery Nikita Janakarajan et.al. 2309.16235v1 null
2023-10-01 Sampling Methods for Inner Product Sketching Majid Daliri et.al. 2309.16157v2 link
2023-09-27 Fast Locality Sensitive Hashing with Theoretical Guarantee Zongyuan Tan et.al. 2309.15479v1 null
2023-09-25 Guess & Sketch: Language Model Guided Transpilation Celine Lee et.al. 2309.14396v1 null
2023-09-22 Deep3DSketch+: Rapid 3D Modeling from Single Free-hand Sketches Tianrun Chen et.al. 2309.13006v1 null
2023-09-22 Visualization According to Statisticians: An Interview Study on the Role of Visualization for Inferential Statistics Eric Newburger et.al. 2309.12684v1 null
2023-09-22 Towards medhub: A Self-Service Platform for Analysts and Physicians Markus Höhn et.al. 2309.11234v2 null
2023-09-20 An Empirical Study of Malicious Code In PyPI Ecosystem Wenbo Guo et.al. 2309.11021v1 link
2023-09-19 An overview of some mathematical techniques and problems linking 3D vision to 3D printing Emiliano Cristiani et.al. 2309.10549v1 null
2023-09-19 Learning Orbitally Stable Systems for Diagrammatically Teaching Weiming Zhi et.al. 2309.10298v1 null
2023-09-18 Completeness Thresholds for Memory Safety: Unbounded Guarantees via Bounded Proofs (Extended Abstract) Tobias Reinhard et.al. 2309.09731v1 null
2023-09-18 Applying Security Testing Techniques to Automotive Engineering Irdin Pekaric et.al. 2309.09647v1 null
2023-09-15 Active Learning for Fine-Grained Sketch-Based Image Retrieval Himanshu Thakur et.al. 2309.08743v1 null
2023-09-15 Beyond Domain Gap: Exploiting Subjectivity in Sketch-Based Person Retrieval Kejun Lin et.al. 2309.08372v1 link
2023-09-14 Landscape-Sketch-Step: An AI/ML-Based Metaheuristic for Surrogate Optimization Problems Rafael Monteiro et.al. 2309.07936v1 link
2023-09-12 Grounded Language Acquisition From Object and Action Imagery James Robert Kubricht et.al. 2309.06335v1 null
2023-09-12 OmniSketch: Efficient Multi-Dimensional High-Velocity Stream Analytics with Arbitrary Predicates Wieger R. Punter et.al. 2309.06051v1 null
2023-09-12 GA-Sketching: Shape Modeling from Multi-View Sketching with Geometry-Aligned Deep Implicit Functions Jie Zhou et.al. 2309.05946v1 link
2023-09-11 Photodetachment dynamics using nonlocal dicrete-state-in-continuum model Martin Čížek et.al. 2309.05830v1 null
2023-09-10 Streaming Semidefinite Programs: $O(\sqrt{n})$ Passes, Small Space and Fast Runtime Zhao Song et.al. 2309.05135v1 null
2023-09-08 Receiving an algorithmic recommendation based on documentary filmmaking techniques Samuel Gantier et.al. 2309.04184v1 null
2023-09-07 Learning from Demonstration via Probabilistic Diagrammatic Teaching Weiming Zhi et.al. 2309.03835v1 null
2023-09-07 Adjacency Sketches in Adversarial Environments Moni Naor et.al. 2309.03728v1 null
2023-09-06 An Evaluation of Software Sketches Roy Friedman et.al. 2309.03045v1 null
2023-09-03 Business Process Text Sketch Automation Generation Using Large Language Model Rui Zhu et.al. 2309.01071v1 null
2023-09-02 Online Adaptive Mahalanobis Distance Estimation Lianke Qin et.al. 2309.01030v1 null
2023-09-01 Randomized Polar Codes for Anytime Distributed Machine Learning Burak Bartan et.al. 2309.00682v1 null
2023-09-01 Human-Inspired Facial Sketch Synthesis with Dynamic Adaptation Fei Gao et.al. 2309.00216v1 link
2023-08-31 Terrain Diffusion Network: Climatic-Aware Terrain Generation with Geological Sketch Guidance Zexin Hu et.al. 2308.16725v1 null
2023-08-30 Surrogate-based Autotuning for Randomized Sketching Algorithms in Regression Problems Younghyun Cho et.al. 2308.15720v1 null
2023-08-27 SketchDreamer: Interactive Text-Augmented Creative Sketch Ideation Zhiyu Qu et.al. 2308.14191v1 link
2023-08-25 WorldSmith: Iterative and Expressive Prompting for World Building with a Generative AI Hai Dang et.al. 2308.13355v1 null
2023-08-25 Bridging the Gap: Fine-to-Coarse Sketch Interpolation Network for High-Quality Animation Sketch Inbetweening Jiaming Shen et.al. 2308.13273v1 null
2023-08-21 Geo-Sketcher: Rapid 3D Geological Modeling using Geological and Topographic Map Sketches Ronan Amorim et.al. 2308.12152v1 null
2023-08-24 Bayesian Learning for Dynamic Target Localization with Human-provided Spatial Information Min-Won Seo et.al. 2308.11839v2 null
2023-08-22 MatFuse: Controllable Material Generation with Diffusion Models Giuseppe Vecchio et.al. 2308.11408v1 link
2023-08-22 Minwise-Independent Permutations with Insertion and Deletion of Features Rameshwar Pratap et.al. 2308.11240v1 null
2023-08-28 Large Language Models for Software Engineering: A Systematic Literature Review Xinyi Hou et.al. 2308.10620v2 null
2023-08-16 Freedom of Speech and AI Output Eugene Volokh et.al. 2308.08673v1 null
2023-08-16 Painter: Teaching Auto-regressive Language Models to Draw Sketches Reza Pourreza et.al. 2308.08520v1 null
2023-08-15 Inversion-by-Inversion: Exemplar-based Sketch-to-Photo Synthesis via Stochastic Differential Equations without Training Ximing Xing et.al. 2308.07665v1 link
2023-08-11 Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation Yuki Endo et.al. 2308.06027v1 link
2023-08-11 Uncertainty-Aware Cross-Modal Transfer Network for Sketch-Based 3D Shape Retrieval Yiyang Cai et.al. 2308.05948v1 null
2023-08-20 The Fast and the Private: Task-based Dataset Search Zezhou Huang et.al. 2308.05637v2 null
2023-08-12 LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation Leigang Qu et.al. 2308.05095v2 null
2023-08-10 Apple Vision Pro for Healthcare: "The Ultimate Display"? -- Entering the Wonderland of Precision Jan Egger et.al. 2308.04313v3 null
2023-08-08 Iterative Sketching for Secure Coded Regression Neophytos Charalambides et.al. 2308.04185v1 null
2023-08-06 Gradient Coding through Iterative Block Leverage Score Sampling Neophytos Charalambides et.al. 2308.03096v1 null
2023-08-05 Sketch and Text Guided Diffusion Model for Colored Point Cloud Generation Zijie Wu et.al. 2308.02874v1 null
2023-08-07 SoK: The Ghost Trilemma S. Mukherjee et.al. 2308.02202v2 null
2023-08-07 BEVControl: Accurately Controlling Street-view Elements with Multi-perspective Consistency via BEV Sketch Layout Kairui Yang et.al. 2308.01661v3 null
2023-08-03 PPI-NET: End-to-End Parametric Primitive Inference Liang Wang et.al. 2308.01521v1 null
2023-08-01 Neural approximation of Wasserstein distance via a universal architecture for symmetric and factorwise group invariant functions Samantha Chen et.al. 2308.00273v1 null
2023-08-01 CONSTRUCT: A Program Synthesis Approach for Reconstructing Control Algorithms from Embedded System Binaries in Cyber-Physical Systems Ali Shokri et.al. 2308.00250v1 null
2023-07-30 RealityCanvas: Augmented Reality Sketching for Embedded and Responsive Scribble Animation Effects Zhijie Xia et.al. 2307.16116v1 link
2023-07-25 Federated Heavy Hitter Recovery under Linear Sketching Adria Gascon et.al. 2307.13347v1 null
2023-07-24 Learning Dense Correspondences between Photos and Sketches Xuanchen Lu et.al. 2307.12967v1 null
2023-07-18 Semi-supervised Cycle-GAN for face photo-sketch translation in the wild Chaofeng Chen et.al. 2307.10281v1 null
2023-07-14 Volumetric Wireframe Parsing from Neural Attraction Fields Nan Xue et.al. 2307.10206v1 link
2023-07-17 Multi-Domain Learning with Modulation Adapters Ekaterina Iakovleva et.al. 2307.08528v1 null
2023-07-16 InkSight: Leveraging Sketch Interaction for Documenting Chart Findings in Computational Notebooks Yanna Lin et.al. 2307.07922v1 null
2023-07-13 Connectivity Labeling for Multiple Vertex Failures Merav Parter et.al. 2307.06276v2 null
2023-07-10 Some Preliminary Steps Towards Metaverse Logic Antonio L. Furtado et.al. 2307.05574v1 null
2023-07-11 A "Game of Like" : Online Social Network Sharing As Strategic Interaction Emmanuel J. Genot et.al. 2307.05063v1 null
2023-07-11 Diffusion idea exploration for art generation Nikhil Verma et.al. 2307.04978v1 null
2023-07-08 Sketch-A-Shape: Zero-Shot Sketch-to-3D Shape Generation Aditya Sanghi et.al. 2307.03869v1 null
2023-07-06 Wireless Multi-Agent Generative AI: From Connected Intelligence to Collective Intelligence Hang Zou et.al. 2307.02757v1 null
2023-07-04 Text + Sketch: Image Compression at Ultra Low Rates Eric Lei et.al. 2307.01944v1 link
2023-07-03 Digital Twin-Empowered Communications: A New Frontier of Wireless Networks Lina Bariah et.al. 2307.00973v1 null
2023-07-04 SketchMetaFace: A Learning-based Sketching Interface for High-fidelity 3D Character Face Modeling Zhongjin Luo et.al. 2307.00804v2 null
2023-06-27 Cartesian institutions with evidence: Data and system modelling with diagrammatic constraints and generalized sketches Zinovy Diskin et.al. 2306.16284v1 null
2023-06-26 Towards Optimal Effective Resistance Estimation Rajat Vadiraj Dwaraknath et.al. 2306.14820v1 null
2023-06-26 DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models Ximing Xing et.al. 2306.14685v1 link
2023-06-25 ALBUS: a Probabilistic Monitoring Algorithm to Counter Burst-Flood Attacks Simon Scherrer et.al. 2306.14328v1 null
2023-06-24 Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner Oracles Paul Tarau et.al. 2306.14077v1 link
2023-06-21 PrivSketch: A Private Sketch-based Frequency Estimation Protocol for Data Streams Ying Li et.al. 2306.12144v1 null
2023-06-20 Computing a human-like reaction time metric from stable recurrent vision models Lore Goetschalckx et.al. 2306.11582v1 null
2023-06-23 3D VR Sketch Guided 3D Shape Prototyping and Exploration Ling Luo et.al. 2306.10830v2 link
2023-06-19 Shape Guided Gradient Voting for Domain Generalization Jiaqi Xu et.al. 2306.10809v1 null
2023-06-15 Private Federated Frequency Estimation: Adapting to the Hardness of the Instance Jingfeng Wu et.al. 2306.09396v1 null
2023-06-15 Conditional Human Sketch Synthesis with Explicit Abstraction Control Dar-Yen Chen et.al. 2306.09274v1 null
2023-06-15 Behaviorally Typed State Machines in TypeScript for Heterogeneous Swarms Roland Kuhn et.al. 2306.09068v1 link
2023-06-15 Interleaving Pre-Trained Language Models and Large Language Models for Zero-Shot NL2SQL Generation Zihui Gu et.al. 2306.08891v1 link
2023-06-14 Zero-Shot 3D Shape Sketch View Similarity and Retrieval Gianluca Berardi et.al. 2306.08541v1 null
2023-06-14 Probing the unfolded configurations of a $β$ -hairpin using sketch-map Albert Ardevol et.al. 2306.08429v1 null
2023-06-14 CLIPXPlore: Coupled CLIP and Shape Spaces for 3D Shape Exploration Jingyu Hu et.al. 2306.08226v1 null
2023-06-13 AniFaceDrawing: Anime Portrait Exploration during Your Sketching Zhengyu Huang et.al. 2306.07476v1 null
2023-06-15 Strokes2Surface: Recovering Curve Networks From 4D Architectural Design Sketches S. Rasoulzadeh et.al. 2306.07220v2 link
2023-06-11 Learning the Positions in CountSketch Yi Li et.al. 2306.06611v1 null
2023-06-09 SENS: Sketch-based Implicit Neural Shape Modeling Alexandre Binninger et.al. 2306.06088v1 null
2023-06-09 Sketch2Stress: Sketching with Structural Stress Awareness Deng Yu et.al. 2306.05911v1 null
2023-06-09 Sketch Beautification: Learning Part Beautification and Structure Refinement for Sketches of Man-made Objects Deng Yu et.al. 2306.05832v1 null
2023-06-05 Tracking Evolving labels using Cone based Oracles Aditya Acharya et.al. 2306.03306v1 null
2023-06-09 Explicit Construction of q-ary 2-deletion Correcting Codes with Low Redundancy Shu Liu et.al. 2306.02868v2 null
2023-06-06 VideoComposer: Compositional Video Synthesis with Motion Controllability Xiang Wang et.al. 2306.02018v2 null
2023-06-07 Cross Modal Data Discovery over Structured and Unstructured Data Lakes Mohamed Y. Eltabakh et.al. 2306.00932v2 link
2023-06-01 Towards Interactive Image Inpainting via Sketch Refinement Chang Liu et.al. 2306.00407v1 link
2023-06-01 Faster Robust Tensor Power Method for Arbitrary Order Yichuan Deng et.al. 2306.00406v1 null
2023-05-31 Knowledge Base Question Answering for Space Debris Queries Paul Darm et.al. 2305.19734v1 link
2023-05-30 A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation Omar Seddati et.al. 2305.18988v1 null
2023-05-30 DiffSketching: Sketch Control Image Synthesis with Diffusion Models Qiang Wang et.al. 2305.18812v1 link
2023-05-30 Generalization Bounds for Magnitude-Based Pruning via Sparse Matrix Sketching Etash Kumar Guha et.al. 2305.18789v1 null
2023-05-29 Controllable Text-to-Image Generation with GPT-4 Tianjun Zhang et.al. 2305.18583v1 null
2023-05-29 ANPL: Compiling Natural Programs with Interactive Decomposition Di Huang et.al. 2305.18498v1 link
2023-05-30 TaleCrafter: Interactive Story Visualization with Multiple Characters Yuan Gong et.al. 2305.18247v2 link
2023-05-27 Pruning at Initialization -- A Sketching Perspective Noga Bar et.al. 2305.17559v1 null
2023-05-27 On the Noise Sensitivity of the Randomized SVD Elad Romanov et.al. 2305.17435v1 link
2023-05-26 BIG-C: a Multimodal Multi-Purpose Dataset for Bemba Claytone Sikasote et.al. 2305.17202v1 link
2023-05-26 CARAMEL: A Succinct Read-Only Lookup Table via Compressed Static Functions Benjamin Coleman et.al. 2305.16545v1 null
2023-05-25 SketchOGD: Memory-Efficient Continual Learning Benjamin Wright et.al. 2305.16424v1 link
2023-05-24 DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models Sungnyun Kim et.al. 2305.15194v1 link
2023-05-23 Distributed CONGEST Algorithms against Mobile Adversaries Orr Fischer et.al. 2305.14300v1 null
2023-05-19 MaGIC: Multi-modality Guided Image Completion Yongsheng Yu et.al. 2305.11818v1 null
2023-05-19 MIDI-Draw: Sketching to Control Melody Generation Tashi Namgyal et.al. 2305.11605v1 null
2023-05-17 Data Extraction via Semantic Regular Expression Synthesis Qiaochu Chen et.al. 2305.10401v1 null
2023-05-15 Scalable and Robust Tensor Ring Decomposition for Large-scale Data Yicong He et.al. 2305.09044v1 null
2023-05-15 Validity Constraints for Data Analysis Workflows Florian Schintke et.al. 2305.08409v1 null
2023-05-15 Fast and Efficient Matching Algorithm with Deadline Instances Zhao Song et.al. 2305.08353v1 null
2023-05-15 Approximation and Progressive Display of Multiverse Analyses Yang Liu et.al. 2305.08323v1 null
2023-05-11 Enabling Programming Thinking in Large Language Models Toward Code Generation Jia Li et.al. 2305.06599v1 null
2023-05-12 Searching Mobile App Screens via Text + Doodle Soumik Mohian et.al. 2305.06165v2 link
2023-05-10 Sketching the Future (STF): Applying Conditional Control Techniques to Text-to-Video Models Rohan Dhesikan et.al. 2305.05845v1 link
2023-05-09 Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval Shiyin Dong et.al. 2305.05144v1 null
2023-05-08 Behavioural Types for Local-First Software Roland Kuhn et.al. 2305.04848v1 null
2023-05-09 Locally Attentional SDF Diffusion for Controllable 3D Shape Generation Xin-Yang Zheng et.al. 2305.04461v2 null
2023-05-08 Oblivious algorithms for the Max- $k$ AND Problem Noah G. Singer et.al. 2305.04438v1 null
2023-05-05 Towards Feminist Intersectional XAI: From Explainability to Response-Ability Goda Klumbyte et.al. 2305.03375v1 null
2023-05-04 Program Synthesis for Robot Learning from Demonstrations Noah Patton et.al. 2305.03129v1 null
2023-05-04 HAISTA-NET: Human Assisted Instance Segmentation Through Attention Muhammed Korkmaz et.al. 2305.03105v1 null
2023-05-04 Controllable Visual-Tactile Synthesis Ruihan Gao et.al. 2305.03051v1 link
2023-05-02 A Survey of Methods for Converting Unstructured Data to CSG Models Pierre-Alain Fayolle et.al. 2305.01220v1 null
2023-05-01 IndoorSim-to-OutdoorReal: Learning to Navigate Outdoors without any Outdoor Experience Joanne Truong et.al. 2305.01098v1 null
2023-05-01 Design and Evaluation of a Bioinspired Tendon-Driven 3D-Printed Robotic Eye with Active Vision Capabilities Hamid Osooli et.al. 2305.01076v1 link
2023-05-01 semantic neural model approach for face recognition from sketch Chandana Navuluri et.al. 2305.01058v1 null
2023-04-25 Bridging graph data models: RDF, RDF-star, and property graphs as directed acyclic graphs Ewout Gelling et.al. 2304.13097v1 link
2023-04-25 DualSlide: Global-to-Local Sketching Interface for Slide Content and Layout Design Jiahao Weng et.al. 2304.12506v1 null
2023-04-23 SketchXAI: A First Look at Explainability for Human Sketches Zhiyu Qu et.al. 2304.11744v1 null
2023-04-22 (Vector) Space is Not the Final Frontier: Product Search as Program Synthesis Jacopo Tagliabue et.al. 2304.11473v1 null
2023-04-21 The centaur programmer -- How Kasparov's Advanced Chess spans over to the software development of the future Pedro Alves et.al. 2304.11172v1 null
2023-04-19 StyleDEM: a Versatile Model for Authoring Terrains Simon Perche et.al. 2304.09626v1 null
2023-04-19 Sensitivity estimation for differentially private query processing Meifan Zhang et.al. 2304.09546v1 null
2023-04-19 A Protocol for Cast-as-Intended Verifiability with a Second Device Johannes Müller et.al. 2304.09456v1 null
2023-04-18 Optimal Eigenvalue Approximation via Sketching William Swartworth et.al. 2304.09281v1 null
2023-04-18 GUILGET: GUI Layout GEneration with Transformer Andrey Sobolevsky et.al. 2304.09012v1 link
2023-04-18 Coefficient Synthesis for Threshold Automata A. R. Balasubramanian et.al. 2304.08917v1 null
2023-04-18 Online fair division with arbitrary entitlements Kushagra Chatterjee et.al. 2304.08864v1 null
2023-04-17 Learning Geometry-aware Representations by Sketching Hyundo Lee et.al. 2304.08204v1 null
2023-04-15 Learned Interpolation for Better Streaming Quantile Approximation with Worst-Case Guarantees Nicholas Schiefer et.al. 2304.07652v1 null
2023-04-15 Remembering Ludwig Dmitrievich Faddeev, our lifelong partner in mathematical physics Daniel Sternheimer et.al. 2304.07577v1 null
2023-04-14 Pool Inference Attacks on Local Differential Privacy: Quantifying the Privacy Guarantees of Apple's Count Mean Sketch in Practice Andrea Gadotti et.al. 2304.07134v1 null
2023-04-14 On deterministic, constant memory triangular searches on the integer lattice J. Alfredo Cruz-Carlon et.al. 2304.07033v1 null
2023-04-13 Learning Controllable 3D Diffusion Models from Single-view Images Jiatao Gu et.al. 2304.06700v1 null
2023-04-13 On streaming approximation algorithms for constraint satisfaction problems Noah G. Singer et.al. 2304.06664v1 null
2023-04-13 Solving Tensor Low Cycle Rank Approximation Yichuan Deng et.al. 2304.06594v1 null
2023-04-12 TextANIMAR: Text-based 3D Animal Fine-Grained Retrieval Trung-Nghia Le et.al. 2304.06053v1 null
2023-04-12 SketchANIMAR: Sketch-based 3D Animal Fine-Grained Retrieval Trung-Nghia Le et.al. 2304.05731v1 null
2023-04-10 Identity-Guided Collaborative Learning for Cloth-Changing Person Reidentification Zan Gao et.al. 2304.04400v1 null
2023-04-09 On Extend-Only Directed Posets and Derived Byzantine-Tolerant Replicated Data Types (Extended Version) Florian Jacob et.al. 2304.04318v1 null
2023-04-07 ChiroDiff: Modelling chirographic data with Diffusion Models Ayan Das et.al. 2304.03785v1 null
2023-04-06 SketchFFusion: Sketch-guided image editing with diffusion model Weihang Mao et.al. 2304.03174v1 null
2023-04-06 LSketch: A Label-Enabled Graph Stream Sketch Toward Time-Sensitive Queries Yiling Zeng et.al. 2304.02897v1 null
2023-04-05 Tracing and Visualizing Human-ML/AI Collaborative Processes through Artifacts of Data Work Jennifer Rogers and et.al. 2304.02699v1 null
2023-04-05 Beyond Summarization: Designing AI Support for Real-World Expository Writing Tasks Zejiang Shen et.al. 2304.02623v1 null
2023-04-05 Optimal Sketching Bounds for Sparse Linear Regression Tung Mai et.al. 2304.02261v1 null
2023-04-05 LogoNet: a fine-grained network for instance-level logo sketch retrieval Binbin Feng et.al. 2304.02214v1 link
2023-04-04 Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing Alberto Baldrati et.al. 2304.02051v1 link
2023-04-02 Sketch-based Video Object Localization Sangmin Woo et.al. 2304.00450v1 link
2023-03-31 Almost Linear Constant-Factor Sketching for $\ell_1$ and Logistic Regression Alexander Munteanu et.al. 2304.00051v1 link
2023-03-30 If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval Finlay G. C. Hudson et.al. 2303.17703v1 null
2023-03-30 Methods and advancement of content-based fashion image retrieval: A Review Amin Muhammad Shoib et.al. 2303.17371v1 null
2023-03-29 Sketch-an-Anchor: Sub-epoch Fast Model Adaptation for Zero-shot Sketch-based Image Retrieval Leo Sampaio Ferraz Ribeiro et.al. 2303.16769v1 null
2023-03-28 Visual Chain-of-Thought Diffusion Models William Harvey et.al. 2303.16187v1 link
2023-03-27 What Can Human Sketches Do for Object Detection? Pinaki Nath Chowdhury et.al. 2303.15149v1 null
2023-03-25 Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style Fengyin Lin et.al. 2303.14348v1 link
2023-03-24 Feature Space Sketching for Logistic Regression Gregory Dexter et.al. 2303.14284v1 null
2023-03-24 Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR Aneeshan Sain et.al. 2303.13779v1 null
2023-03-24 The First Computer Program Raúl Rojas et.al. 2303.13740v1 null
2023-03-28 CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not Aneeshan Sain et.al. 2303.13440v3 null
2023-03-23 Defining Quality Requirements for a Trustworthy AI Wildflower Monitoring Platform Petra Heck et.al. 2303.13151v1 null
2023-03-22 Evaluation of Sketch-Based and Semantic-Based Modalities for Mockup Generation Tommaso Calò et.al. 2303.12709v1 null
2023-03-22 An Extended Study of Human-like Behavior under Adversarial Training Paul Gavrikov et.al. 2303.12669v1 link
2023-03-24 RaBit: Parametric Modeling of 3D Biped Cartoon Characters with a Topological-consistent Dataset Zhongjin Luo et.al. 2303.12564v2 null
2023-03-21 Roots and Requirements for Collaborative AI Mark Stefik et.al. 2303.12040v1 null
2023-03-23 Sketch2Saliency: Learning to Detect Salient Objects from Human Drawings Ayan Kumar Bhunia et.al. 2303.11502v2 null
2023-03-20 Automatic Measures for Evaluating Generative Design Methods for Architects Eric Yeh et.al. 2303.11483v1 null
2023-03-20 Picture that Sketch: Photorealistic Image Generation from Abstract Sketches Subhadeep Koley et.al. 2303.11162v1 null
2023-03-20 On the Maximal Independent Sets of $k$ -mers with the Edit Distance Leran Ma et.al. 2303.10926v1 link
2023-03-19 SKED: Sketch-guided Text-based 3D Editing Aryan Mikaeili et.al. 2303.10735v1 null
2023-03-19 Trainable Projected Gradient Method for Robust Fine-tuning Junjiao Tian et.al. 2303.10720v1 link
2023-03-19 EduVis: Workshop on Visualization Education, Literacy, and Activities Mandy Keck et.al. 2303.10708v1 null
2023-03-19 SECAD-Net: Self-Supervised CAD Reconstruction by Learning Sketch-Extrude Operations Pu Li et.al. 2303.10613v1 link
2023-03-17 PersonalTailor: Personalizing 2D Pattern Design from 3D Garment Point Clouds Anran Qi et.al. 2303.09695v1 null
2023-03-15 Query-guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch Aditay Tripathi et.al. 2303.08784v1 null
2023-03-15 RIS-Enabled Smart Wireless Environments: Deployment Scenarios, Network Architecture, Bandwidth and Area of Influence George C. Alexandropoulos et.al. 2303.08505v1 null
2023-03-14 Data-Free Sketch-Based Image Retrieval Abhra Chaudhuri et.al. 2303.07775v1 link
2023-03-13 Can Workers Meaningfully Consent to Workplace Wellbeing Technologies? Shreya Chowdhary et.al. 2303.07242v1 null
2023-03-13 An Improved Sample Complexity for Rank-1 Matrix Sensing Yichuan Deng et.al. 2303.06895v1 null
2023-03-10 StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces Shuai Yang et.al. 2303.06146v1 link
2023-03-08 Sketching with Spherical Designs for Noisy Data Fitting on Spheres Shao-Bo Lin et.al. 2303.04550v1 null
2023-03-08 Models of symbol emergence in communication: a conceptual review and a guide for avoiding local minima Julian Zubek et.al. 2303.04544v1 null
2023-03-07 Introspective Cross-Attention Probing for Lightweight Transfer of Pre-trained Models Yonatan Dukler et.al. 2303.04105v1 null
2023-03-06 Data Portraits: Recording Foundation Model Training Data Marc Marone et.al. 2303.03919v1 null
2023-03-07 Sketch-based Medical Image Retrieval Kazuma Kobayashi et.al. 2303.03633v1 null
2023-03-06 Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design Michelle S. Lam et.al. 2303.02884v1 link
2023-03-05 Text2Face: A Multi-Modal 3D Face Model Will Rowan et.al. 2303.02688v1 null
2023-03-03 Graph-based Extreme Feature Selection for Multi-class Classification Tasks Shir Friedman et.al. 2303.01792v1 null
2023-03-02 Coresets for Clustering in Geometric Intersection Graphs Sayan Bandyapadhyay et.al. 2303.01400v1 null
2023-03-01 Sketch2Cloth: Sketch-based 3D Garment Generation with Unsigned Distance Fields Yi He et.al. 2303.00167v1 null
2023-02-26 Towards Human-Bot Collaborative Software Architecting with ChatGPT Aakash Ahmad et.al. 2302.14600v1 link
2023-02-28 On-the-Fly Communication-and-Computing for Distributed Tensor Decomposition Over MIMO Channels Xu Chen et.al. 2302.14297v1 null
2023-02-27 Capstone: A Capability-based Foundation for Trustless Secure Memory Access (Extended Version) Jason Zhijingcheng Yu et.al. 2302.13863v1 null
2023-02-27 Evaluation of Automatically Constructed Word Meaning Explanations Marie Stará et.al. 2302.13625v1 null
2023-02-26 Scalable Weight Reparametrization for Efficient Transfer Learning Byeonggeun Kim et.al. 2302.13435v1 null
2023-02-24 Modulating Pretrained Diffusion Models for Multimodal Image Synthesis Cusuh Ham et.al. 2302.12764v1 null
2023-02-23 Using Colors and Sketches to Count Subgraphs in a Streaming Graph Shirin Handjani et.al. 2302.12210v1 null
2023-02-24 A Scalable Space-efficient In-database Interpretability Framework for Embedding-based Semantic SQL Queries Prabhakar Kudva et.al. 2302.12178v2 null
2023-02-22 A Reference Architecture for Observability and Compliance of Cloud Native Applications William Pourmajidi et.al. 2302.11617v1 null
2023-02-20 Ontology-aware Network for Zero-shot Sketch-based Image Retrieval Haoxiang Zhang et.al. 2302.10040v1 null
2023-02-22 Composer: Creative and Controllable Image Synthesis with Composable Conditions Lianghua Huang et.al. 2302.09778v2 link
2023-02-16 Rejecting Cognitivism: Computational Phenomenology for Deep Learning Pierre Beckmann et.al. 2302.09071v1 null
2023-02-14 DiffFaceSketch: High-Fidelity Face Image Synthesis with Sketch-Guided Latent Diffusion Model Yichen Peng et.al. 2302.06908v1 link
2023-02-14 Text-Guided Scene Sketch-to-Photo Synthesis AprilPyone MaungMaung et.al. 2302.06883v1 null
2023-02-14 Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation Yasheng Sun et.al. 2302.06857v1 null
2023-02-13 SkCoder: A Sketch-based Approach for Automatic Code Generation Jia Li et.al. 2302.06144v1 link
2023-02-13 Learning to Scale Temperature in Masked Self-Attention for Image Inpainting Xiang Zhou et.al. 2302.06130v1 null
2023-02-11 An Evaluation Algorithm for Datalog with Equality Martin E. Bidlingmaier et.al. 2302.05792v1 link
2023-02-11 Sketch Less Face Image Retrieval: A New Challenge Dawei Dai et.al. 2302.05576v1 link
2023-02-10 MaskSketch: Unpaired Structure-guided Masked Image Generation Dina Bashkirova et.al. 2302.05496v1 link
2023-02-10 Count-min sketch with variable number of hash functions: an experimental study Éric Fusy et.al. 2302.05245v1 null
2023-02-10 Fast Gumbel-Max Sketch and its Applications Yuanming Zhang et.al. 2302.05176v1 null
2023-02-09 Projection-free Online Exp-concave Optimization Dan Garber et.al. 2302.04859v1 null
2023-02-09 Locally consistent decomposition of strings with applications to edit distance sketching Sudatta Bhattacharya et.al. 2302.04475v1 null
2023-02-06 Sketching Robot Programs On the Fly David Porfirio et.al. 2302.03088v1 null
2023-02-05 Leaving Reality to Imagination: Robust Classification via Generated Datasets Hritik Bansal et.al. 2302.02503v1 link
2023-02-04 An Effective and Differentially Private Protocol for Secure Distributed Cardinality Estimation Pinghui Wang et.al. 2302.02158v1 null
2023-02-04 Sketch-Flip-Merge: Mergeable Sketches for Private Distinct Counting Jonathan Hehir et.al. 2302.02056v1 null
2023-02-01 A Nearly-Optimal Bound for Fast Regression with $\ell_\infty$ Guarantee Zhao Song et.al. 2302.00248v1 null
2023-01-31 FLAME: A small language model for spreadsheet formulas Harshit Joshi et.al. 2301.13779v1 null
2023-01-30 Streaming Anomaly Detection Siddharth Bhatia et.al. 2301.13199v1 link
2023-01-29 BERT-based Authorship Attribution on the Romanian Dataset called ROST Sanda-Maria Avram et.al. 2301.12500v1 null
2023-01-26 Synesthetic Dice: Sensors, Actuators, And Mappings Albrecht Kurze et.al. 2301.11436v1 null
2023-01-26 Cut and Learn for Unsupervised Object Detection and Instance Segmentation Xudong Wang et.al. 2301.11320v1 link
2023-01-25 Reflective Artificial Intelligence Peter R. Lewis et.al. 2301.10823v1 null
2023-01-25 Distilling Text into Circuits Vincent Wang-Mascianica et.al. 2301.10595v1 null
2023-01-24 Capacity Analysis of Vector Symbolic Architectures Kenneth L. Clarkson et.al. 2301.10352v1 null
2023-01-20 Improving Sketch Colorization using Adversarial Segmentation Consistency Samet Hicsonmez et.al. 2301.08590v1 link
2023-01-19 On Finite Blocklength Lossy Source Coding Lin Zhou et.al. 2301.07871v1 null
2023-01-17 Vision Based Machine Learning Algorithms for Out-of-Distribution Generalisation Hamza Riaz et.al. 2301.06975v1 null
2023-01-17 Distribution Aligned Feature Clustering for Zero-Shot Sketch-Based Image Retrieval Yuchen Wu et.al. 2301.06685v1 null
2023-01-16 A Distributed Palette Sparsification Theorem Maxime Flin et.al. 2301.06457v1 null
2023-01-14 Weighted Minwise Hashing Beats Linear Sketching for Inner Product Estimation Aline Bessa et.al. 2301.05811v1 null
2023-01-06 Better Differentially Private Approximate Histograms and Heavy Hitters using the Misra-Gries Sketch Christian Janos Lebeda et.al. 2301.02457v1 null
2023-01-03 EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions [Technical Report] Enhao Zhang et.al. 2301.00929v1 link
2023-01-17 Algorithms for Massive Data -- Lecture Notes Nicola Prezza et.al. 2301.00754v2 null
2022-12-28 Modular termination verification with a higher-order concurrent separation logic (Intermediate report) Justus Fasse et.al. 2212.14126v1 null
2022-12-22 A Domain-Extensible Compiler with Controllable Automation of Optimisations Thomas Koehler et.al. 2212.12035v1 null

(back to top)

3D reconstruction

Publish Date Title Authors PDF Code
2024-04-03 Neural Radiance Fields with Torch Units Bingnan Ni et.al. 2404.02617v1 null
2024-04-03 TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes Cheng Zhao et.al. 2404.02410v1 null
2024-04-03 APC2Mesh: Bridging the gap from occluded building façades to full 3D models Perpetual Hope Akwensi et.al. 2404.02391v1 null
2024-04-01 Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects Yijia Weng et.al. 2404.01440v1 link
2024-04-01 NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification Juyeop Han et.al. 2404.01400v1 null
2024-04-01 FPGA-Accelerated Correspondence-free Point Cloud Registration with PointNet Features Keisuke Sugiura et.al. 2404.01237v1 null
2024-04-02 Few-shot point cloud reconstruction and denoising via learned Guassian splats renderings and fine-tuned diffusion features Pietro Bonazzi et.al. 2404.01112v2 null
2024-03-30 DiffHuman: Probabilistic Photorealistic 3D Reconstruction of Humans Akash Sengupta et.al. 2404.00485v1 null
2024-03-30 3DGSR: Implicit Surface Reconstruction with 3D Gaussian Splatting Xiaoyang Lyu et.al. 2404.00409v1 null
2024-03-29 Sparse Views, Near Light: A Practical Paradigm for Uncalibrated Point-light Photometric Stereo Mohammed Brahimi et.al. 2404.00098v1 null
2024-03-29 NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising Tianchen Deng et.al. 2403.20034v1 link
2024-03-28 CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians Avinash Paliwal et.al. 2403.19495v1 null
2024-03-30 Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction Xiaoyang Lyu et.al. 2403.19314v2 link
2024-03-28 Neural Fields for 3D Tracking of Anatomy and Surgical Instruments in Monocular Laparoscopic Video Clips Beerend G. A. Gerats et.al. 2403.19265v1 null
2024-04-01 WALT3D: Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects under Occlusion Khiem Vuong et.al. 2403.19022v2 null
2024-03-29 Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction Qiuhong Shen et.al. 2403.18795v2 null
2024-03-29 SplatFace: Gaussian Splat Face Reconstruction Leveraging an Optimizable Surface Jiahao Luo et.al. 2403.18784v2 null
2024-03-27 Breaking the Limitations with Sparse Inputs by Variational Frameworks (BLIss) in Terahertz Super-Resolution 3D Reconstruction Yiyao Zhang et.al. 2403.18776v1 link
2024-03-27 SAT-NGP : Unleashing Neural Graphics Primitives for Fast Relightable Transient-Free 3D reconstruction from Satellite Imagery Camille Billouard et.al. 2403.18711v1 null
2024-03-26 EgoLifter: Open-world 3D Segmentation for Egocentric Perception Qiao Gu et.al. 2403.18118v1 null
2024-03-25 Creating a Digital Twin of Spinal Surgery: A Proof of Concept Jonas Hein et.al. 2403.16736v1 null
2024-03-25 Spike-NeRF: Neural Radiance Field Based On Spike Camera Yijia Guo et.al. 2403.16410v1 null
2024-03-25 Elite360D: Towards Efficient 360 Depth Estimation via Semantic- and Distance-Aware Bi-Projection Fusion Hao Ai et.al. 2403.16376v1 null
2024-03-24 latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction Christopher Wewer et.al. 2403.16292v1 null
2024-03-23 UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation Yuliang Guo et.al. 2403.15705v1 null
2024-03-22 FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos Florian Langer et.al. 2403.15161v1 null
2024-03-22 Recent Trends in 3D Reconstruction of General Non-Rigid Scenes Raza Yunus et.al. 2403.15064v1 null
2024-03-21 Hyperspectral Neural Radiance Fields Gerry Chen et.al. 2403.14839v1 null
2024-03-21 GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation Yinghao Xu et.al. 2403.14621v1 link
2024-03-21 Isotropic Gaussian Splatting for Real-Time Radiance Field Rendering Yuanhao Gong et.al. 2403.14244v1 null
2024-03-21 Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions Jiacong Xu et.al. 2403.14053v1 null
2024-03-20 T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image Shijie Zhang et.al. 2403.13663v1 null
2024-03-20 MULAN-WC: Multi-Robot Localization Uncertainty-aware Active NeRF with Wireless Coordination Weiying Wang et.al. 2403.13348v1 null
2024-03-19 GVGEN: Text-to-3D Generation with Volumetric Representation Xianglong He et.al. 2403.12957v1 null
2024-03-19 PostoMETRO: Pose Token Enhanced Mesh Transformer for Robust 3D Human Mesh Recovery Wendi Yang et.al. 2403.12473v1 null
2024-03-18 LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation Yushi Lan et.al. 2403.12019v1 null
2024-03-18 GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image Xiao Fu et.al. 2403.12013v1 null
2024-03-18 SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion Vikram Voleti et.al. 2403.12008v1 null
2024-03-18 GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors LI Yang et.al. 2403.11899v1 null
2024-03-18 OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation Haochen Jiang et.al. 2403.11796v1 null
2024-03-18 Fed3DGS: Scalable 3D Gaussian Splatting with Federated Learning Teppei Suzuki et.al. 2403.11460v1 link
2024-03-18 BAGS: Building Animatable Gaussian Splatting from a Monocular Video with Diffusion Priors Tingyang Zhang et.al. 2403.11427v1 null
2024-03-17 Creating Seamless 3D Maps Using Radiance Fields Sai Tarun Sathyan et.al. 2403.11364v1 null
2024-03-17 Recent Advances in 3D Gaussian Splatting Tong Wu et.al. 2403.11134v1 null
2024-03-17 Omni-Recon: Towards General-Purpose Neural Radiance Fields for Versatile 3D Applications Yonggan Fu et.al. 2403.11131v1 null
2024-03-16 Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription Hongxiang Zhao et.al. 2403.10953v1 null
2024-03-15 SCILLA: SurfaCe Implicit Learning for Large Urban Area, a volumetric hybrid solution Hala Djeghim et.al. 2403.10344v1 null
2024-03-15 FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model Qijun Feng et.al. 2403.10242v1 null
2024-03-15 Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience Xiaohang Yu et.al. 2403.09973v1 null
2024-03-14 MARVIS: Motion & Geometry Aware Real and Virtual Image Segmentation Jiayi Wu et.al. 2403.09850v1 link
2024-03-14 Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting Jaewoo Jung et.al. 2403.09413v1 link
2024-03-13 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surface Linyi Jin et.al. 2403.08768v1 null
2024-03-13 Refractive COLMAP: Refractive Structure-from-Motion Revisited Mengkun She et.al. 2403.08640v1 null
2024-03-12 Q-SLAM: Quadric Representations for Monocular SLAM Chensheng Peng et.al. 2403.08125v1 null
2024-03-11 Bayesian Diffusion Models for 3D Shape Reconstruction Haiyang Xu et.al. 2403.06973v1 null
2024-03-08 DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction Jaehyeok Shim et.al. 2403.05005v1 null
2024-03-11 Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed Yifan Wang et.al. 2403.04765v2 null
2024-03-08 Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces Evangelos Skartados et.al. 2403.04508v2 null
2024-03-07 CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images Guanlin Shen et.al. 2403.04198v1 link
2024-03-05 Pooling Image Datasets With Multiple Covariate Shift and Imbalance Sotirios Panagiotis Chytas et.al. 2403.02598v1 null
2024-03-04 TripoSR: Fast 3D Object Reconstruction from a Single Image Dmitry Tochilkin et.al. 2403.02151v1 link
2024-03-03 A Novel Dynamic Light-Section 3D Reconstruction Method for Wide-Range Sensing Mengjuan Chen et.al. 2403.01374v1 null
2024-03-08 G3DR: Generative 3D Reconstruction in ImageNet Pradyumna Reddy et.al. 2403.00939v2 link
2024-03-01 DISORF: A Distributed Online NeRF Training and Rendering Framework for Mobile Robots Chunlin Li et.al. 2403.00228v1 null
2024-03-05 VEnvision3D: A Synthetic Perception Dataset for 3D Multi-Task Model Research Jiahao Zhou et.al. 2402.19059v2 null
2024-02-27 Sora Generates Videos with Stunning Geometrical Consistency Xuanyi Li et.al. 2402.17403v1 null
2024-02-27 CharNeRF: 3D Character Generation from Concept Art Eddy Chu et.al. 2402.17115v1 null
2024-02-26 DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer Yizhe Wu et.al. 2402.16308v1 null
2024-02-25 GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction Xiao Chen et.al. 2402.16174v1 null
2024-02-24 A Generative Machine Learning Model for Material Microstructure 3D Reconstruction and Performance Evaluation Yilin Zheng et.al. 2402.15815v1 null
2024-02-22 Cameras as Rays: Pose Estimation via Ray Diffusion Jason Y. Zhang et.al. 2402.14817v1 null
2024-02-22 Workspace Analysis for Laparoscopic Rectal Surgery : A Preliminary Study Alexandra Thomieres et.al. 2402.14386v1 null
2024-02-22 MVD $^2$ : Efficient Multiview 3D Reconstruction for Multiview Diffusion Xin-Yang Zheng et.al. 2402.14253v1 null
2024-02-20 MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction Shitao Tang et.al. 2402.12712v1 null
2024-02-25 A Robust Error-Resistant View Selection Method for 3D Reconstruction Shaojie Zhang et.al. 2402.11431v2 null
2024-02-17 Dense Matchers for Dense Tracking Tomáš Jelínek et.al. 2402.11287v1 null
2024-02-17 DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model Yu Feng et.al. 2402.11241v1 null
2024-02-15 Evaluating NeRFs for 3D Plant Geometry Reconstruction in Field Conditions Muhammad Arbab Arshad et.al. 2402.10344v1 null
2024-02-15 GES: Generalized Exponential Splatting for Efficient Radiance Field Rendering Abdullah Hamdi et.al. 2402.10128v1 link
2024-02-14 PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments Xiuzhong Hu et.al. 2402.09325v1 link
2024-02-14 DUDF: Differentiable Unsigned Distance Fields with Hyperbolic Scaling Miguel Fainstein et.al. 2402.08876v1 null
2024-02-13 IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation Luke Melas-Kyriazi et.al. 2402.08682v1 null
2024-02-20 Camera Calibration through Geometric Constraints from Rotation and Projection Matrices Muhammad Waleed et.al. 2402.08437v2 link
2024-02-09 Neural Rendering based Urban Scene Reconstruction for Autonomous Driving Shihao Shen et.al. 2402.06826v1 null
2024-02-07 Carousel phase retrieval algorithm for 3D coherent X-ray diffraction imaging Fangzhou Ai et.al. 2402.05283v1 link
2024-02-06 EscherNet: A Generative Model for Scalable View Synthesis Xin Kong et.al. 2402.03908v1 link
2024-02-09 MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction Heng Zhou et.al. 2402.03762v2 null
2024-02-05 Denoising Diffusion via Image-Based Rendering Titas Anciukevicius et.al. 2402.03445v1 null
2024-02-02 Di-NeRF: Distributed NeRF for Collaborative Learning with Unknown Relative Poses Mahboubeh Asadi et.al. 2402.01485v1 null
2024-02-02 DeepAAT: Deep Automated Aerial Triangulation for Fast UAV-based Mapping Zequan Chen et.al. 2402.01134v1 link
2024-02-01 Enhanced fringe-to-phase framework using deep learning Won-Hoe Kim et.al. 2402.00977v1 null
2024-02-01 Diffusion-based Light Field Synthesis Ruisheng Gao et.al. 2402.00575v1 null
2024-01-31 Local Feature Matching Using Deep Learning: A Survey Shibiao Xu et.al. 2401.17592v1 link
2024-01-30 Self-Supervised Representation Learning for Nerve Fiber Distribution Patterns in 3D-PLI Alexander Oberstrass et.al. 2401.17207v1 null
2024-01-30 Physical Priors Augmented Event-Based 3D Reconstruction Jiaxu Wang et.al. 2401.17121v1 link
2024-01-30 OmniSCV: An Omnidirectional Synthetic Image Generator for Computer Vision Bruno Berenguel-Baeta et.al. 2401.17061v1 link
2024-01-29 Domain adaptation strategies for 3D reconstruction of the lumbar spine using real fluoroscopy data Sascha Jecklin et.al. 2401.16027v1 null
2024-01-29 2L3: Lifting Imperfect Generated 2D Images into Accurate 3D Yizheng Chen et.al. 2401.15841v1 null
2024-01-28 Multi-Person 3D Pose Estimation from Multi-View Uncalibrated Depth Cameras Yu-Jhe Li et.al. 2401.15616v1 null
2024-01-26 3D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field Zhenyu Bao et.al. 2401.14726v1 link
2024-01-25 TIFu: Tri-directional Implicit Function for High-Fidelity 3D Character Reconstruction Byoungsung Lim et.al. 2401.14565v1 null
2024-01-25 Range-Agnostic Multi-View Depth Estimation With Keyframe Selection Andrea Conti et.al. 2401.14401v1 link
2024-01-25 pix2gestalt: Amodal Segmentation by Synthesizing Wholes Ege Ozguroglu et.al. 2401.14398v1 link
2024-01-25 GauU-Scene: A Scene Reconstruction Benchmark on Large Scale 3D Reconstruction Dataset Using Gaussian Splatting Butian Xiong et.al. 2401.14032v1 null
2024-01-24 EndoGaussians: Single View Dynamic Gaussian Splatting for Deformable Endoscopic Tissues Reconstruction Yangsen Chen et.al. 2401.13352v1 null
2024-01-23 IRIS: Inverse Rendering of Indoor Scenes from Low Dynamic Range Images Zhi-Hao Lin et.al. 2401.12977v1 null
2024-01-21 A Survey on African Computer Vision Datasets, Topics and Researchers Abdul-Hakeem Omotayo et.al. 2401.11617v1 null
2024-01-21 Multi-View Neural 3D Reconstruction of Micro-/Nanostructures with Atomic Force Microscopy Shuo Chen et.al. 2401.11541v1 link
2024-01-21 Deformable Endoscopic Tissues Reconstruction with Gaussian Splatting Lingting Zhu et.al. 2401.11535v1 link
2024-01-17 POE: Acoustic Soft Robotic Proprioception for Omnidirectional End-effectors Uksang Yoo et.al. 2401.09382v1 null
2024-01-16 Learning Implicit Representation for Reconstructing Articulated Objects Hao Zhang et.al. 2401.08809v1 null
2024-01-20 Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis Zhenhui Ye et.al. 2401.08503v2 null
2024-01-16 S3M: Semantic Segmentation Sparse Mapping for UAVs with RGB-D Camera Thanh Nguyen Canh et.al. 2401.08134v1 null
2024-01-12 3D Reconstruction of Interacting Multi-Person in Clothing from a Single Image Junuk Cha et.al. 2401.06415v1 null
2024-01-12 SD-MVS: Segmentation-Driven Deformation Multi-View Stereo with Spherical Refinement and EM optimization Zhenlong Yuan et.al. 2401.06385v1 null
2024-01-12 Surgical-DINO: Adapter Learning of Foundation Models for Depth Estimation in Endoscopic Surgery Beilei Cui et.al. 2401.06013v2 link
2024-01-10 Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects Tianhang Cheng et.al. 2401.05236v1 link
2024-01-07 RHOBIN Challenge: Reconstruction of Human Object Interaction Xianghui Xie et.al. 2401.04143v1 null
2024-01-08 AGG: Amortized Generative 3D Gaussians for Single Image to 3D Dejia Xu et.al. 2401.04099v1 null
2024-01-08 A Survey on 3D Gaussian Splatting Guikun Chen et.al. 2401.03890v1 null
2024-01-03 S3Net: Innovating Stereo Matching and Semantic Segmentation with a Single-Branch Semantic Stereo Network in Satellite Epipolar Imagery Qingyuan Yang et.al. 2401.01643v1 link
2023-12-29 Informative Rays Selection for Few-Shot Neural Radiance Fields Marco Orsingher et.al. 2312.17561v1 null
2023-12-28 Toward Semantic Scene Understanding for Fine-Grained 3D Modeling of Plants Mohamad Qadri et.al. 2312.17110v1 null
2023-12-28 Learning Spatially Collaged Fourier Bases for Implicit Neural Representation Jason Chun Lok Li et.al. 2312.17018v1 null
2023-12-27 In-Hand 3D Object Reconstruction from a Monocular RGB Video Shijian Jiang et.al. 2312.16425v1 null
2023-12-24 SUNDIAL: 3D Satellite Understanding through Direct, Ambient, and Complex Lighting Decomposition Nikhil Behari et.al. 2312.16215v1 null
2023-12-24 A theory of volumetric representations for opaque solids Bailey Miller et.al. 2312.15406v1 null
2023-12-22 Pola4All: survey of polarimetric applications and an open-source toolkit to analyze polarization Joaquin Rodriguez et.al. 2312.14697v1 link
2023-12-22 Scalable 3D Reconstruction From Single Particle X-Ray Diffraction Images Based on Online Machine Learning Jay Shenoy et.al. 2312.14432v1 null
2023-12-21 PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar Tzofi Klinghoffer et.al. 2312.14239v1 null
2023-12-21 3D Pose Estimation of Two Interacting Hands from a Monocular Event Camera Christen Millerdurai et.al. 2312.14157v1 null
2023-12-21 DUSt3R: Geometric 3D Vision Made Easy Shuzhe Wang et.al. 2312.14132v1 link
2023-12-21 Anatomical basis of sex differences in human post-myocardial infarction ECG phenotypes identified by novel automated torso-cardiac 3D reconstruction Hannah J. Smith et.al. 2312.13976v1 null
2023-12-21 SyncDreamer for 3D Reconstruction of Endangered Animal Species with NeRF and NeuS Ahmet Haydar Ornek et.al. 2312.13832v1 null
2023-12-21 Visual Tomography: Physically Faithful Volumetric Models of Partially Translucent Objects David Nakath et.al. 2312.13494v1 null
2023-12-20 UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections Fangjinhua Wang et.al. 2312.13285v1 null
2023-12-20 Splatter Image: Ultra-Fast Single-View 3D Reconstruction Stanislaw Szymanowicz et.al. 2312.13150v1 link
2023-12-21 pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction David Charatan et.al. 2312.12337v2 link
2023-12-19 EVI-SAM: Robust, Real-time, Tightly-coupled Event-Visual-Inertial State Estimation and 3D Dense Mapping Weipeng Guan et.al. 2312.11911v1 link
2023-12-17 Primitive-based 3D Human-Object Interaction Modelling and Programming Siqi Liu et.al. 2312.10714v1 null
2023-12-16 Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers Zi-Xin Zou et.al. 2312.09147v2 null
2023-12-14 Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments Liyuan Zhu et.al. 2312.09138v1 null
2023-12-14 Scene 3-D Reconstruction System in Scattering Medium Zhuoyifan Zhang et.al. 2312.09005v1 null
2023-12-11 Gaussian Splatting SLAM Hidenobu Matsuki et.al. 2312.06741v1 null
2023-12-10 UNeR3D: Versatile and Scalable 3D RGB Point Cloud Generation from 2D Images in Unsupervised Reconstruction Hongbin Lin et.al. 2312.06706v1 null
2023-12-10 SuperPrimitive: Scene Reconstruction at a Primitive Level Kirill Mazur et.al. 2312.05889v1 null
2023-12-11 Nuvo: Neural UV Mapping for Unruly 3D Representations Pratul P. Srinivasan et.al. 2312.05283v1 null
2023-12-08 Fine Dense Alignment of Image Bursts through Camera Pose and Depth Estimation Bruno Lecouat et.al. 2312.05190v1 null
2023-12-08 SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration Xu Cao et.al. 2312.04803v1 null
2023-12-07 FitDiff: Robust monocular 3D facial shape and reflectance estimation using Diffusion Models Stathis Galanakis et.al. 2312.04465v1 null
2023-12-06 Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian Particle Youtian Lin et.al. 2312.03431v1 null
2023-12-06 Evaluating the point cloud of individual trees generated from images based on Neural Radiance fields (NeRF) method Hongyu Huang et.al. 2312.03372v1 null
2023-12-06 RING-NeRF: A Versatile Architecture based on Residual Implicit Neural Grids Doriand Petit et.al. 2312.03357v1 null
2023-12-05 ReconFusion: 3D Reconstruction with Diffusion Priors Rundi Wu et.al. 2312.02981v1 null
2023-12-05 R3D-SWIN:Use Shifted Window Attention for Single-View 3D Reconstruction Chenhuan Li et.al. 2312.02725v1 null
2023-12-05 DreaMo: Articulated 3D Reconstruction From A Single Casual Video Tao Tu et.al. 2312.02617v1 null
2023-12-05 Prompt2NeRF-PIL: Fast NeRF Generation via Pretrained Implicit Latent Jianmeng Liu et.al. 2312.02568v1 null
2023-12-03 Slice3D: Multi-Slice, Occlusion-Revealing, Single View 3D Reconstruction Yizhi Wang et.al. 2312.02221v1 null
2023-12-04 Steerers: A framework for rotation equivariant keypoint descriptors Georg Bökman et.al. 2312.02152v1 link
2023-12-04 iMatching: Imperative Correspondence Learning Zitong Zhan et.al. 2312.02141v1 null
2023-12-04 Light Field Imaging in the Restrictive Object Space based on Flexible Angular Plane Ping Zhou et.al. 2312.01761v1 null
2023-12-02 RNb-NeuS: Reflectance and Normal-based Multi-View 3D Reconstruction Baptiste Brument et.al. 2312.01215v1 link
2023-12-05 Self-Evolving Neural Radiance Fields Jaewoo Jung et.al. 2312.01003v2 link
2023-12-01 NeuSG: Neural Implicit Surface Reconstruction with 3D Gaussian Splatting Guidance Hanlin Chen et.al. 2312.00846v1 null
2023-12-01 UAVs and Birds: Enhancing Short-Range Navigation through Budgerigar Flight Studies Md. Mahmudur Rahman et.al. 2312.00597v1 null
2023-11-30 Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data Yu Deng et.al. 2311.18729v1 null
2023-11-30 Multi-task learning with cross-task consistency for improved depth estimation in colonoscopy Pedro Esteban Chavarrias Solano et.al. 2311.18664v1 null
2023-11-30 HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Video Zicong Fan et.al. 2311.18448v1 link
2023-11-29 Volumetric Cloud Field Reconstruction Jacob Lin et.al. 2311.17657v1 null
2023-11-30 REF $^2$ -NeRF: Reflection and Refraction aware Neural Radiance Field Wooseok Kim et.al. 2311.17116v2 link
2023-11-28 Multi-Scale 3D Gaussian Splatting for Anti-Aliased Rendering Zhiwen Yan et.al. 2311.17089v1 null
2023-11-28 Surf-D: High-Quality Surface Generation for Arbitrary Topologies using Diffusion Models Zhengming Yu et.al. 2311.17050v1 null
2023-11-28 Gradient-based Local Next-best-view Planning for Improved Perception of Targeted Plant Nodes Akshay K. Burusa et.al. 2311.16759v1 null
2023-11-28 RGBGrasp: Image-based Object Grasping by Capturing Multiple Views during Robot Arm Movement with Neural Radiance Fields Chang Liu et.al. 2311.16592v1 null
2023-11-28 Rethinking Directional Integration in Neural Radiance Fields Congyue Deng et.al. 2311.16504v1 null
2023-11-27 Weakly-Supervised 3D Reconstruction of Clothed Humans via Normal Maps Jane Wu et.al. 2311.16042v1 null
2023-11-27 SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion Hsuan-I Ho et.al. 2311.15855v1 null
2023-11-26 Obj-NeRF: Extract Object NeRFs from Multi-view Images Zhiyi Li et.al. 2311.15291v1 null
2023-11-25 Multi-task Planar Reconstruction with Feature Warping Guidance Luan Wei et.al. 2311.14981v1 link
2023-11-24 RSB-Pose: Robust Short-Baseline Binocular 3D Human Pose Estimation with Occlusion Handling Xiaoyue Wan et.al. 2311.14242v1 null
2023-11-23 GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence Van Nguyen Nguyen et.al. 2311.14155v1 link
2023-11-23 MonoNav: MAV Navigation via Monocular Depth Estimation and Reconstruction Nathaniel Simon et.al. 2311.14100v1 null
2023-11-23 DRIFu: Differentiable Rendering and Implicit Function-based Single-View 3D Reconstruction Zijian Kuang et.al. 2311.13199v2 link
2023-11-22 Differentiable Radio Frequency Ray Tracing for Millimeter-Wave Sensing Xingyu Chen et.al. 2311.13182v1 null
2023-11-21 Physics-guided Shape-from-Template: Monocular Video Perception through Neural Surrogate Models David Stotko et.al. 2311.12796v1 link
2023-11-20 Mixing-Denoising Generalizable Occupancy Networks Amine Ouasfi et.al. 2311.12125v1 null
2023-11-23 PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction Peng Wang et.al. 2311.12024v2 null
2023-11-19 GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion Probabilistic Models with Structured Noise Xinhai Li et.al. 2311.11221v1 null
2023-11-18 LOSTU: Fast, Scalable, and Uncertainty-Aware Triangulation Sébastien Henry et.al. 2311.11171v1 null
2023-11-18 Invariant-based Mapping of Space During General Motion of an Observer Juan D. Yepes et.al. 2311.11130v1 null
2023-11-16 DSR-Diff: Depth Map Super-Resolution with Diffusion Model Yuan Shi et.al. 2311.09919v1 null
2023-11-18 EvaSurf: Efficient View-Aware Implicit Textured Surface Reconstruction on Mobile Devices Jingnan Gao et.al. 2311.09806v2 null
2023-11-14 DynamicSurf: Dynamic Neural RGB-D Surface Reconstruction with an Optimizable Feature Grid Mirgahney Mohamed et.al. 2311.08159v1 null
2023-11-13 $L_0$-Sampler: An $L_{0}$ Model Guided Volume Sampling for NeRF Liangchen Li et.al. 2311.07044v1 null
2023-11-11 3DFusion, A real-time 3D object reconstruction pipeline based on streamed instance segmented data Xi Sun et.al. 2311.06659v1 null
2023-11-09 ConRad: Image Constrained Radiance Fields for 3D Generation from a Single Image Senthil Purushwalkam et.al. 2311.05230v1 null
2023-11-08 Implicit Neural Representations for Breathing-compensated Volume Reconstruction in Robotic Ultrasound Aorta Screening Yordanka Velikova et.al. 2311.04999v1 null
2023-11-08 LRM: Large Reconstruction Model for Single Image to 3D Yicong Hong et.al. 2311.04400v1 null
2023-11-07 High-fidelity 3D Reconstruction of Plants using Neural Radiance Field Kewei Hu et.al. 2311.04154v1 null
2023-11-07 DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding Kehinde Ajayi et.al. 2311.04098v1 link
2023-11-05 MuSHRoom: Multi-Sensor Hybrid Room Dataset for Joint 3D Reconstruction and Novel View Synthesis Xuqian Ren et.al. 2311.02778v1 null
2023-11-05 Fast Point-cloud to Mesh Reconstruction for Deformable Object Tracking Elham Amin Mansour et.al. 2311.02749v1 null
2023-11-05 IPVNet: Learning Implicit Point-Voxel Features for Open-Surface 3D Reconstruction Mohammad Samiul Arshad et.al. 2311.02552v1 link
2023-11-02 CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation Jingkang Wang et.al. 2311.01447v1 null
2023-11-02 Look at Robot Base Once: Hand-Eye Calibration with Point Clouds of Robot Base Leveraging Learning-Based 3D Vision Leihui Li et.al. 2311.01335v1 link
2023-11-02 Joint 3D Shape and Motion Estimation from Rolling Shutter Light-Field Images Hermes McGriff et.al. 2311.01292v1 link
2023-11-01 Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture Yixin Chen et.al. 2311.00457v1 null
2023-10-31 Deep Compressed Learning for 3D Seismic Inversion Maayan Gelboim et.al. 2311.00107v1 null
2023-10-31 Refined Equivalent Pinhole Model for Large-scale 3D Reconstruction from Spaceborne CCD Imagery Hong Danyang et.al. 2310.20117v1 null
2023-10-29 3DMiner: Discovering Shapes from Large-Scale Unannotated Image Datasets Ta-Ying Cheng et.al. 2310.19188v1 null
2023-10-25 Open-NeRF: Towards Open Vocabulary NeRF Decomposition Hao Zhang et.al. 2310.16383v1 null
2023-10-23 Novel-View Acoustic Synthesis from 3D Reconstructed Rooms Byeongjoo Ahn et.al. 2310.15130v1 link
2023-10-23 Interaction-Driven Active 3D Reconstruction with Object Interiors Zihao Yan et.al. 2310.14700v1 null
2023-10-23 VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations Yiying Yang et.al. 2310.14487v1 null
2023-10-22 A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video Jan Emily Mangulabnan et.al. 2310.14364v1 null
2023-10-20 Single-view 3D reconstruction via inverse procedural modeling Albert Garifullin et.al. 2310.13373v1 null
2023-10-20 UE4-NeRF:Neural Radiance Field for Real-Time Rendering of Large-Scale Scene Jiaming Gu et.al. 2310.13263v1 null
2023-10-19 Real space iterative reconstruction for vector tomography (RESIRE-V) Minh Pham et.al. 2310.12513v1 link
2023-10-18 ShapeGraFormer: GraFormer-Based Network for Hand-Object Reconstruction from a Single Depth Map Ahmed Tawfik Aboukhadra et.al. 2310.11811v1 null
2023-10-17 Learning Neural Implicit through Volume Rendering with Attentive Depth Fusion Priors Pengchong Hu et.al. 2310.11598v1 null
2023-10-17 Field Robot for High-throughput and High-resolution 3D Plant Phenotyping Felix Esser et.al. 2310.11516v1 null
2023-10-16 In-Situ Single Particle Reconstruction Reveals 3D Evolution of PtNi Nanocatalysts During Heating Yi-Chi Wang et.al. 2310.10253v1 null
2023-10-15 Tabletop Transparent Scene Reconstruction via Epipolar-Guided Optical Flow with Monocular Depth Completion Prior Xiaotong Chen et.al. 2310.09956v1 null
2023-10-15 CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields from Imperfect Camera Poses Hongyu Fu et.al. 2310.09776v1 null
2023-10-12 Implicit Shape and Appearance Priors for Few-Shot Full Head Reconstruction Pol Caselles et.al. 2310.08784v1 null
2023-10-13 PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm Haoyi Zhu et.al. 2310.08586v2 link
2023-10-12 Consistent123: Improve Consistency for One Image to 3D Object Synthesis Haohan Weng et.al. 2310.08092v1 null
2023-10-10 SketchBodyNet: A Sketch-Driven Multi-faceted Decoder Network for 3D Human Reconstruction Fei Wang et.al. 2310.06577v1 link
2023-10-08 Experiences with CAMRE: Single-Device Collaborative Adaptive Mixed Reality Environment Hung-Jui Guo et.al. 2310.04996v1 null
2023-10-02 PC-NeRF: Parent-Child Neural Radiance Fields under Partial Sensor Data Loss in Autonomous Driving Environments Xiuzhong Hu et.al. 2310.00874v1 link
2023-10-01 Enabling Neural Radiance Fields (NeRF) for Large-scale Aerial Images -- A Multi-tiling Approaching and the Geometry Assessment of NeRF Ningli Xu et.al. 2310.00530v1 null
2023-09-29 3D Reconstruction in Noisy Agricultural Environments: A Bayesian Optimization Perspective for View Planning Athanasios Bacharis et.al. 2310.00145v1 null
2023-09-29 Effect of structure-based training on 3D localization precision and quality Armin Abdehkakha et.al. 2309.17265v1 null
2023-09-28 Sketch2CADScript: 3D Scene Reconstruction from 2D Sketch using Visual Transformer and Rhino Grasshopper Hong-Bin Yang et.al. 2309.16850v1 null
2023-09-29 3D Reconstruction with Generalizable Neural Fields using Scene Priors Yang Fu et.al. 2309.15164v2 null
2023-09-26 Combining optical diffraction tomography with imaging flow cytometry for characterizing morphology, hemoglobin content, and membrane deformability of live red blood cells Yu-Hsiang Chang et.al. 2309.15131v1 null
2023-09-26 PHRIT: Parametric Hand Representation with Implicit Template Zhisheng Huang et.al. 2309.14916v1 null
2023-09-26 Unsupervised Reconstruction of 3D Human Pose Interactions From 2D Poses Alone Peter Hardy et.al. 2309.14865v1 null
2023-09-26 3D Density-Gradient based Edge Detection on Neural Radiance Fields (NeRFs) for Geometric Reconstruction Miriam Jäger et.al. 2309.14800v1 null
2023-09-23 MP-MVS: Multi-Scale Windows PatchMatch and Planar Prior Multi-View Stereo Rongxuan Tan et.al. 2309.13294v1 link
2023-09-22 NeRRF: 3D Reconstruction and View Synthesis for Transparent and Specular Objects with Neural Refractive-Reflective Fields Xiaoxue Chen et.al. 2309.13039v1 link
2023-09-25 Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates Ka Chun Shum et.al. 2309.11281v2 link
2023-09-19 PLVS: A SLAM System with Points, Lines, Volumetric Mapping, and 3D Incremental Segmentation Luigi Freda et.al. 2309.10896v1 link
2023-09-19 SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction Anilkumar Swamy et.al. 2309.10748v1 null
2023-09-18 Improving Neural Indoor Surface Reconstruction with Mask-Guided Adaptive Consistency Constraints Xinyi Yu et.al. 2309.09739v1 null
2023-09-18 Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering Chi Zhang et.al. 2309.09724v1 null
2023-09-17 Uncertainty-aware 3D Object-Level Mapping with Deep Shape Priors Ziwei Liao et.al. 2309.09118v1 null
2023-09-13 Exploiting Multiple Priors for Neural 3D Indoor Reconstruction Federico Lincetto et.al. 2309.07021v1 null
2023-09-12 Semantic and Articulated Pedestrian Sensing Onboard a Moving Vehicle Maria Priisalu et.al. 2309.06313v1 null
2023-09-11 A survey on real-time 3D scene reconstruction with SLAM methods in embedded systems Quentin Picard et.al. 2309.05349v1 null
2023-09-07 A Food Package Recognition and Sorting System Based on Structured Light and Deep Learning Xuanzhi Liu et.al. 2309.03704v1 null
2023-09-06 SADIR: Shape-Aware Diffusion Models for 3D Image Reconstruction Nivetha Jayakumar et.al. 2309.03335v1 null
2023-09-06 Sparse 3D Reconstruction via Object-Centric Ray Sampling Llukman Cerkezi et.al. 2309.03008v1 link
2023-09-06 Multi-log grasping using reinforcement learning and virtual visual servoing Erik Wallin et.al. 2309.02997v1 null
2023-09-06 LightNeuS: Neural Surface Reconstruction in Endoscopy using Illumination Decline Víctor M. Batlle et.al. 2309.02777v1 null
2023-09-05 GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction Youmin Zhang et.al. 2309.02436v1 link
2023-09-05 Doppelgangers: Learning to Disambiguate Images of Similar Structures Ruojin Cai et.al. 2309.02420v1 link
2023-09-05 TiAVox: Time-aware Attenuation Voxels for Sparse-view 4D DSA Reconstruction Zhenghong Zhou et.al. 2309.02318v1 null
2023-09-05 Iterative Superquadric Recomposition of 3D Objects from Multiple Views Stephan Alaniz et.al. 2309.02102v1 link
2023-09-01 Dense Voxel 3D Reconstruction Using a Monocular Event Camera Haodong Chen et.al. 2309.00385v1 null
2023-08-24 Improving NeRF Quality by Progressive Camera Placement for Unrestricted Navigation in Complex Environments Georgios Kopanas et.al. 2309.00014v1 null
2023-08-29 Intensity correlation holography for remote phase sensing and 3D imaging Guillaume Thekkadath et.al. 2308.15619v1 null
2023-08-28 R3D3: Dense 3D Reconstruction of Dynamic Scenes from Multiple Cameras Aron Schmied et.al. 2308.14713v1 null
2023-08-27 Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views Zi-Xin Zou et.al. 2308.14078v1 null
2023-08-26 HoloPOCUS: Portable Mixed-Reality 3D Ultrasound Tracking, Reconstruction and Overlay Kian Wei Ng et.al. 2308.13823v1 null
2023-08-25 Textureless Deformable Surface Reconstruction with Invisible Markers Xinyuan Li et.al. 2308.13678v1 null
2023-08-23 ARF-Plus: Controlling Perceptual Factors in Artistic Radiance Fields for 3D Scene Stylization Wenzhao Li et.al. 2308.12452v1 null
2023-08-21 Coordinate Quantized Neural Implicit Representations for Multi-view Reconstruction Sijia Jiang et.al. 2308.11025v1 link
2023-08-19 Root Pose Decomposition Towards Generic Non-rigid 3D Reconstruction with Monocular Videos Yikai Wang et.al. 2308.10089v1 null
2023-08-19 TSAR-MVS: Textureless-aware Segmentation and Correlative Refinement Guided Multi-View Stereo Zhenlong Yuan et.al. 2308.09990v1 null
2023-08-19 A Theory of Topological Derivatives for Inverse Rendering of Geometry Ishit Mehta et.al. 2308.09865v1 null
2023-08-18 O^2-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion Model Yubin Hu et.al. 2308.09591v1 link
2023-08-17 A Fusion of Variational Distribution Priors and Saliency Map Replay for Continual 3D Reconstruction Sanchar Palit et.al. 2308.08812v1 null
2023-08-17 Long-Range Grouping Transformer for Multi-View 3D Reconstruction Liying Yang et.al. 2308.08724v1 link
2023-08-16 DeDoDe: Detect, Don't Describe -- Describe, Don't Detect for Local Feature Matching Johan Edstedt et.al. 2308.08479v1 link
2023-08-17 ObjectSDF++: Improved Object-Compositional Neural Implicit Surfaces Qianyi Wu et.al. 2308.07868v2 link
2023-08-15 CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction Yan Di et.al. 2308.07837v1 null
2023-08-15 Multi-view 3D Face Reconstruction Based on Flame Wenzhuo Zheng et.al. 2308.07551v1 null
2023-08-14 A One Stop 3D Target Reconstruction and multilevel Segmentation Method Jiexiong Xu et.al. 2308.06974v1 link
2023-08-11 Efficient Large-scale AUV-based Visual Seafloor Mapping Mengkun She et.al. 2308.06147v1 null
2023-08-10 PlankAssembly: Robust 3D Reconstruction from Three Orthographic Views with Learnt Shape Programs Wentao Hu et.al. 2308.05744v1 link
2023-08-10 HGDNet: A Height-Hierarchy Guided Dual-Decoder Network for Single View Building Extraction and Height Estimation Chaoran Lu et.al. 2308.05387v1 null
2023-08-07 Learning Photometric Feature Transform for Free-form Object Scan Xiang Feng et.al. 2308.03492v1 null
2023-08-04 Reconstructing Three-Dimensional Models of Interacting Humans Mihai Fieraru et.al. 2308.01854v2 link
2023-08-02 HANDAL: A Dataset of Real-World Manipulable Object Categories with Pose Annotations, Affordances, and Reconstructions Andrew Guo et.al. 2308.01477v1 null
2023-08-15 Tirtha -- An Automated Platform to Crowdsource Images and Create 3D Models of Heritage Sites Jyotirmaya Shivottam et.al. 2308.01246v2 link
2023-08-02 Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network Shenbagaraj Kannapiran et.al. 2308.01125v1 null
2023-08-01 Body Knowledge and Uncertainty Modeling for Monocular 3D Human Body Reconstruction Yufei Zhang et.al. 2308.00799v1 null
2023-07-31 Onboard View Planning of a Flying Camera for High Fidelity 3D Reconstruction of a Moving Actor Qingyuan Jiang et.al. 2308.00134v1 link
2023-07-21 Autonomous Electron Tomography Reconstruction with Machine Learning William Millsaps et.al. 2308.00099v1 null
2023-07-31 Towards Head Computed Tomography Image Reconstruction Standardization with Deep Learning Assisted Automatic Detection Bowen Zheng et.al. 2307.16440v1 null
2023-07-27 FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene Chengrui Wei et.al. 2307.14624v1 null
2023-07-27 Physically Plausible 3D Human-Scene Reconstruction from Monocular RGB Image using an Adversarial Learning Approach Sandika Biswas et.al. 2307.14570v1 null
2023-07-27 Creative Birds: Self-Supervised Single-View 3D Style Transfer Renke Wang et.al. 2307.14127v2 link
2023-07-24 CarPatch: A Synthetic Benchmark for Radiance Field Evaluation on Vehicle Components Davide Di Nucci et.al. 2307.12718v1 null
2023-07-24 VIRD: Immersive Match Video Analysis for High-Performance Badminton Coaching Tica Lin et.al. 2307.12539v1 link
2023-07-23 LIST: Learning Implicitly from Spatial Transformers for Single-View 3D Reconstruction Mohammad Samiul Arshad et.al. 2307.12194v1 link
2023-07-22 Replay: Multi-modal Multi-view Acted Videos for Casual Holography Roman Shapovalov et.al. 2307.12067v1 link
2023-07-20 SimCol3D -- 3D Reconstruction during Colonoscopy Challenge Anita Rau et.al. 2307.11261v1 link
2023-07-14 Transient Neural Radiance Fields for Lidar View Synthesis and 3D Reconstruction Anagh Malik et.al. 2307.09555v1 null
2023-07-18 NU-MCC: Multiview Compressive Coding with Neighborhood Decoder and Repulsive UDF Stefan Lionar et.al. 2307.09112v1 link
2023-07-16 Enforcing Topological Interaction between Implicit Surfaces via Uniform Sampling Hieu Le et.al. 2307.08716v1 null
2023-07-13 Bag of Views: An Appearance-based Approach to Next-Best-View Planning for 3D Reconstruction Sara Hatami Gazani et.al. 2307.05832v2 link
2023-07-11 3D detection of roof sections from a single satellite image and application to LOD2-building reconstruction Johann Lussange et.al. 2307.05409v1 null
2023-07-08 MAP-NBV: Multi-agent Prediction-guided Next-Best-View Planning for Active 3D Object Reconstruction Harnaik Dhami et.al. 2307.04004v1 null
2023-07-07 Depth Estimation Analysis of Orthogonally Divergent Fisheye Cameras with Distortion Removal Matvei Panteleev et.al. 2307.03602v1 null
2023-07-07 RGB-D Mapping and Tracking in a Plenoxel Radiance Field Andreas L. Teigen et.al. 2307.03404v1 link
2023-07-04 User-Friendly Safety Monitoring System for Manufacturing Cobots Ye-Ji Mun et.al. 2307.01886v1 null
2023-06-29 One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization Minghua Liu et.al. 2306.16928v1 link
2023-06-23 LightGlue: Local Feature Matching at Light Speed Philipp Lindenberger et.al. 2306.13643v1 link
2023-06-24 3D Reconstruction of Spherical Images based on Incremental Structure from Motion San Jiang et.al. 2306.12770v2 link
2023-06-26 Infinite Photorealistic Worlds using Procedural Generation Alexander Raistrick et.al. 2306.09310v2 null
2023-06-15 NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations Varun Jampani et.al. 2306.09109v1 link
2023-06-15 Enhancing Neural Rendering Methods with Image Augmentations Juan C. Pérez et.al. 2306.08904v1 null
2023-06-14 Learning to Predict Scene-Level Implicit 3D from Posed RGBD Data Nilesh Kulkarni et.al. 2306.08671v1 null
2023-06-13 Viewset Diffusion: (0-)Image-Conditioned 3D Generative Models from 2D Data Stanislaw Szymanowicz et.al. 2306.07881v1 null
2023-06-12 Reconstructing Heterogeneous Cryo-EM Molecular Structures by Decomposing Them into Polymer Chains Bongjin Koo et.al. 2306.07274v1 null
2023-06-10 3D reconstruction using Structure for Motion Kshitij Karnawat et.al. 2306.06360v1 link
2023-06-15 NERFBK: A High-Quality Benchmark for NERF-Based 3D Reconstruction Ali Karami et.al. 2306.06300v2 link
2023-06-12 Neural Haircut: Prior-Guided Strand-Based Hair Reconstruction Vanessa Sklyarova et.al. 2306.05872v2 link
2023-06-08 2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction Jiawei He et.al. 2306.05418v1 null
2023-06-08 Enhance-NeRF: Multiple Performance Evaluation for Neural Radiance Fields Qianqiu Tan et.al. 2306.05303v1 link
2023-06-07 BU-CVKit: Extendable Computer Vision Framework for Species Independent Tracking and Analysis Mahir Patel et.al. 2306.04736v1 null
2023-06-09 DiViNeT: 3D Reconstruction from Disparate Views via Neural Template Regularization Aditya Vora et.al. 2306.04699v2 null
2023-06-05 BeyondPixels: A Comprehensive Review of the Evolution of Neural Radiance Fields AKM Shahariar Azad Rabby et.al. 2306.03000v1 null
2023-06-05 Single-Stage 3D Geometry-Preserving Depth Estimation Model Training on Dataset Mixtures with Uncalibrated Stereo Data Nikolay Patakin et.al. 2306.02878v1 null
2023-06-05 Computational 3D topographic microscopy from terabytes of data per sample Kevin C. Zhou et.al. 2306.02634v1 null
2023-06-08 Adaptive Robotic Information Gathering via Non-Stationary Gaussian Processes Weizhe Chen et.al. 2306.01263v2 link
2023-06-01 BUOL: A Bottom-Up Framework with Occupancy-aware Lifting for Panoptic 3D Scene Reconstruction From A Single Image Tao Chu et.al. 2306.00965v1 link
2023-05-31 Humans in 4D: Reconstructing and Tracking Humans with Transformers Shubham Goel et.al. 2305.20091v1 link
2023-05-30 Template-free Articulated Neural Point Clouds for Reposable View Synthesis Lukas Uzolas et.al. 2305.19065v1 link
2023-05-29 Synfeal: A Data-Driven Simulator for End-to-End Camera Localization Daniel Coelho et.al. 2305.18260v1 link
2023-06-04 VoxDet: Voxel Learning for Novel Instance Detection Bowen Li et.al. 2305.17220v3 link
2023-05-25 Look Ma, No Hands! Agent-Environment Factorization of Egocentric Videos Matthew Chang et.al. 2305.16301v1 null
2023-05-25 Domain-Adaptive Full-Face Gaze Estimation via Novel-View-Synthesis and Feature Disentanglement Jiawei Qin et.al. 2305.16140v1 null
2023-05-25 Robust Category-Level 3D Pose Estimation from Synthetic Data Jiahao Yang et.al. 2305.16124v1 null
2023-05-25 T2TD: Text-3D Generation Model based on Prior Knowledge Guidance Weizhi Nie et.al. 2305.15753v1 null
2023-05-23 Cross3DVG: Baseline and Dataset for Cross-Dataset 3D Visual Grounding on Different RGB-D Scans Taiki Miyanishi et.al. 2305.13876v1 link
2023-05-22 A three-dimensional MR-STAT protocol for high-resolution multi-parametric quantitative MRI Hongyan Liu et.al. 2305.13022v1 null
2023-05-29 Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models Byungjun Kim et.al. 2305.11870v2 link
2023-05-19 Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields Jingbo Zhang et.al. 2305.11588v1 link
2023-05-19 RGB-D And Thermal Sensor Fusion: A Systematic Literature Review Martin Brenner et.al. 2305.11427v1 null
2023-05-18 Progressive Learning of 3D Reconstruction Network from 2D GAN Data Aysegul Dundar et.al. 2305.11102v1 null
2023-05-18 ConsistentNeRF: Enhancing Neural Radiance Fields with 3D Consistency for Sparse View Synthesis Shoukang Hu et.al. 2305.11031v1 link
2023-05-17 Colonoscopy Coverage Revisited: Identifying Scanning Gaps in Real-Time G. Leifman et.al. 2305.10026v1 null
2023-05-15 AutoRecon: Automated 3D Object Discovery and Reconstruction Yuang Wang et.al. 2305.08810v1 null
2023-05-11 Towards a Better Understanding of the Computer Vision Research Community in Africa Abdul-Hakeem Omotayo et.al. 2305.06773v1 null
2023-05-10 Scan2LoD3: Reconstructing semantic 3D building models at LoD3 using ray casting and Bayesian networks Olaf Wysocki et.al. 2305.06314v1 null
2023-05-08 RelPose++: Recovering 6D Poses from Sparse-view Observations Amy Lin et.al. 2305.04926v1 link
2023-05-04 UrbanBIS: a Large-scale Benchmark for Fine-grained Urban Building Instance Segmentation Guoqing Yang et.al. 2305.02627v1 null
2023-05-03 Biological Hotspot Mapping in Coral Reefs with Robotic Visual Surveys Daniel Yang et.al. 2305.02330v1 link
2023-04-30 Second-order Anisotropic Gaussian Directional Derivative Filters for Blob Detection Jie Ren et.al. 2305.00435v1 null
2023-04-29 NSLF-OL: Online Learning of Neural Surface Light Fields alongside Real-time Incremental 3D Reconstruction Yijun Yuan et.al. 2305.00282v1 null
2023-04-23 UHRNet: A Deep Learning-Based Method for Accurate 3D Reconstruction from a Single Fringe-Pattern Yixiao Wang et.al. 2304.14503v1 link
2023-04-27 Learning Articulated Shape with Keypoint Pseudo-labels from Web Images Anastasis Stathopoulos et.al. 2304.14396v1 null
2023-05-03 Combining HoloLens with Instant-NeRFs: Advanced Real-Time 3D Mobile Mapping Dennis Haitz et.al. 2304.14301v2 null
2023-04-25 Shape-Net: Room Layout Estimation from Panoramic Images Robust to Occlusion using Knowledge Distillation with 3D Shapes as Additional Inputs Mizuki Tabata et.al. 2304.12624v1 null
2023-04-24 Instant-3D: Instant Neural Radiance Field Training Towards On-Device AR/VR 3D Reconstruction Sixu Li et.al. 2304.12467v1 null
2023-04-24 Unsupervised Style-based Explicit 3D Face Reconstruction from Single Image Heng Yu et.al. 2304.12455v1 null
2023-04-24 gSDF: Geometry-Driven Signed Distance Functions for 3D Hand-Object Reconstruction Zerui Chen et.al. 2304.11970v1 null
2023-04-24 Learning Visibility Field for Detailed 3D Human Reconstruction and Relighting Ruichen Zheng et.al. 2304.11900v1 null
2023-04-24 NoiseTrans: Point Cloud Denoising with Transformers Guangzhe Hou et.al. 2304.11812v1 null
2023-04-20 A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion Miriam Jäger et.al. 2304.10664v1 null
2023-04-20 Reconstructing Signing Avatars From Video Using Linguistic Priors Maria-Paola Forte et.al. 2304.10482v1 null
2023-04-19 Anything-3D: Towards Single-view Anything Reconstruction in the Wild Qiuhong Shen et.al. 2304.10261v1 link
2023-04-20 A geometry-aware deep network for depth estimation in monocular endoscopy Yongming Yang et.al. 2304.10241v1 link
2023-04-19 Tetra-NeRF: Representing Neural Radiance Fields Using Tetrahedra Jonas Kulhanek et.al. 2304.09987v1 link
2023-04-20 Single-View View Synthesis with Self-Rectified Pseudo-Stereo Yang Zhou et.al. 2304.09527v2 null
2023-04-19 3 Dimensional Dense Reconstruction: A Review of Algorithms and Dataset Yangming Li et.al. 2304.09371v1 null
2023-04-18 SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes Yiming Gao et.al. 2304.08971v1 null
2023-04-17 Learning How To Robustly Estimate Camera Pose in Endoscopic Videos Michel Hayoz et.al. 2304.08023v1 link
2023-04-15 Temporally Consistent Online Depth Estimation Using Point-Based Fusion Numair Khan et.al. 2304.07435v1 link
2023-04-17 Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction Hansheng Chen et.al. 2304.06714v2 link
2023-04-12 SiLK -- Simple Learned Keypoints Pierre Gleize et.al. 2304.06194v1 link
2023-04-12 Dynamic Voxel Grid Optimization for High-Fidelity RGB-D Supervised Surface Reconstruction Xiangyu Xu et.al. 2304.06178v1 null
2023-04-11 EvAC3D: From Event-based Apparent Contours to 3D Models via Continuous Visual Hulls Ziyun Wang et.al. 2304.05296v1 link
2023-04-10 Neural Lens Modeling Wenqi Xian et.al. 2304.04848v1 null
2023-04-10 Evaluate Geometry of Radiance Field with Low-frequency Color Prior Qihang Fang et.al. 2304.04351v1 link
2023-04-11 Analysis of Sampling Strategies for Implicit 3D Reconstruction Q. Liu et.al. 2304.03999v2 null
2023-04-08 3D GANs and Latent Space: A comprehensive survey Satya Pratheek Tata et.al. 2304.03932v1 null
2023-04-08 Photometric Correction for Infrared Sensors Jincheng Zhang et.al. 2304.03930v1 null
2023-04-07 ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation Xiaoming Zhao et.al. 2304.03608v1 link
2023-04-06 Neural Fields meet Explicit Geometric Representation for Inverse Rendering of Urban Scenes Zian Wang et.al. 2304.03266v1 null
2023-04-06 DeLiRa: Self-Supervised Depth, Light, and Radiance Fields Vitor Guizilini et.al. 2304.02797v1 null
2023-04-05 Image Stabilization for Hololens Camera in Remote Collaboration Gowtham Senthil et.al. 2304.02736v1 null
2023-04-05 Real-Time Dense 3D Mapping of Underwater Environments Weihan Wang et.al. 2304.02704v1 link
2023-04-04 USTC FLICAR: A Multisensor Fusion Dataset of LiDAR-Inertial-Camera for Heavy-duty Autonomous Aerial Work Robots Ziming Wang et.al. 2304.01986v1 null
2023-04-04 End-to-End Latency Optimization of Multi-view 3D Reconstruction for Disaster Response Xiaojie Zhang et.al. 2304.01488v1 null
2023-04-04 FineRecon: Depth-aware Feed-forward Network for Detailed 3D Reconstruction Noah Stier et.al. 2304.01480v1 link
2023-04-03 One-Shot View Planning for Fast and Complete Unknown Object Reconstruction Sicong Pan et.al. 2304.00910v1 link
2023-03-31 LivePose: Online 3D Reconstruction from Monocular Video with Dynamic Camera Poses Noah Stier et.al. 2304.00054v1 link
2023-04-03 Three-dimensional coherent diffraction snapshot imaging using extreme ultraviolet radiation from a free electron laser Danny Fainozzi et.al. 2303.18166v2 null
2023-03-30 Enhanced Stable View Synthesis Nishant Jain et.al. 2303.17094v1 null
2023-03-29 AirLine: Efficient Learnable Line Detection with Local Edge Voting Xiao Lin et.al. 2303.16500v1 link
2023-03-29 Multi-View Azimuth Stereo via Tangent Space Consistency Xu Cao et.al. 2303.16447v1 link
2023-03-27 NeUDF: Learning Unsigned Distance Fields from Multi-view Images for Reconstructing Non-watertight Models Fei Hou et.al. 2303.15368v1 null
2023-03-27 TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering Jaehoon Choi et.al. 2303.15060v1 null
2023-03-26 Clean-NeRF: Reformulating NeRF to account for View-Dependent Observations Xinhang Liu et.al. 2303.14707v1 null
2023-03-25 PAniC-3D: Stylized Single-view 3D Reconstruction from Portraits of Anime Characters Shuhong Chen et.al. 2303.14587v1 link
2023-03-25 LPFF: A Portrait Dataset for Face Generators Across Large Poses Yiqian Wu et.al. 2303.14407v1 null
2023-03-24 BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects Bowen Wen et.al. 2303.14158v1 link
2023-03-24 Deformable Model Driven Neural Rendering for High-fidelity 3D Reconstruction of Human Heads Under Low-View Settings Baixin Xu et.al. 2303.13855v1 link
2023-03-24 Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container Jinguang Tong et.al. 2303.13805v1 link
2023-03-23 SCADE: NeRFs from Space Carving with Ambiguity-Aware Depth Estimates Mikaela Angelina Uy et.al. 2303.13582v1 null
2023-03-21 Real-time volumetric rendering of dynamic humans Ignacio Rocco et.al. 2303.11898v1 null
2023-03-20 Zero-1-to-3: Zero-shot One Image to 3D Object Ruoshi Liu et.al. 2303.11328v1 link
2023-03-20 DIME-Net: Neural Network-Based Dynamic Intrinsic Parameter Rectification for Cameras with Optical Image Stabilization System Shu-Hao Yeh et.al. 2303.11307v1 null
2023-03-20 Ref-NeuS: Ambiguity-Reduced Neural Implicit Surface Learning for Multi-View Reconstruction with Reflection Wenhang Ge et.al. 2303.10840v1 link
2023-03-14 FingerSLAM: Closed-loop Unknown Object Localization and Reconstruction from Visuo-tactile Feedback Jialiang Zhao et.al. 2303.07997v1 null
2023-03-11 Normal-guided Garment UV Prediction for Human Re-texturing Yasamin Jafarian et.al. 2303.06504v1 null
2023-03-11 Just Flip: Flipped Observation Generation and Optimization for Neural Radiance Fields to Cover Unobserved View Minjae Lee et.al. 2303.06335v1 link
2023-03-10 ACR: Attention Collaboration-based Regressor for Arbitrary Two-Hand Reconstruction Zhengdi Yu et.al. 2303.05938v1 link
2023-03-10 Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction Mingfang Zhang et.al. 2303.05937v1 null
2023-03-08 FastSurf: Fast Neural RGB-D Surface Reconstruction using Per-Frame Intrinsic Refinement and TSDF Fusion Prior Learning Seunghwan Lee et.al. 2303.04508v1 link
2023-03-08 Corner Detection Based on Multi-directional Gabor Filters with Multi-scales Huaqing Wang et.al. 2303.04334v1 null
2023-03-08 DroNeRF: Real-time Multi-agent Drone Pose Optimization for Computing Neural Radiance Fields Dipam Patel et.al. 2303.04322v1 null
2023-03-07 Proactive Multi-Camera Collaboration For 3D Human Pose Estimation Hai Ci et.al. 2303.03767v1 null
2023-03-06 System for 3D Acquisition and 3D Reconstruction using Structured Light for Sewer Line Inspection Johannes Künzel et.al. 2303.02978v1 null
2023-03-03 Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement Jiaxiang Tang et.al. 2303.02091v1 link
2023-03-09 MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices Kejie Li et.al. 2303.01932v2 link
2023-03-01 Motion Compensation via Epipolar Consistency for In-Vivo X-Ray Microscopy Mareike Thies et.al. 2303.00449v1 null
2023-02-28 3D Coronary Vessel Reconstruction from Bi-Plane Angiography using Graph Convolutional Networks Kit Mills Bransby et.al. 2302.14795v1 null
2023-02-28 Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors Ji Hou et.al. 2302.14746v1 null
2023-02-27 UMIFormer: Mining the Correlations between Similar Tokens for Multi-View 3D Reconstruction Zhenwei Zhu et.al. 2302.13987v1 link
2023-02-26 Perceiving Unseen 3D Objects by Poking the Objects Linghao Chen et.al. 2302.13375v1 null
2023-02-25 SUPS: A Simulated Underground Parking Scenario Dataset for Autonomous Driving Jiawei Hou et.al. 2302.12966v1 link
2023-02-24 3D Surface Reconstruction in the Wild by Deforming Shape Priors from Synthetic Data Nicolai Häni et.al. 2302.12883v1 null
2023-02-23 View Consistency Aware Holistic Triangulation for 3D Human Pose Estimation Xiaoyue Wan et.al. 2302.11301v2 null
2023-02-23 $PC^2$ : Projection-Conditioned Point Cloud Diffusion for Single-Image 3D Reconstruction Luke Melas-Kyriazi et.al. 2302.10668v2 link
2023-02-23 RealFusion: 360° Reconstruction of Any Object from a Single Image Luke Melas-Kyriazi et.al. 2302.10663v2 null
2023-02-20 UAVStereo: A Multiple Resolution Dataset for Stereo Matching in UAV Scenarios Zhang Xiaoyi et.al. 2302.10082v1 link
2023-02-14 HR-NeuS: Recovering High-Frequency Surface Geometry via Neural Implicit Surfaces Erich Liang et.al. 2302.06793v1 null
2023-02-14 Boosted ab initio Cryo-EM 3D Reconstruction with ACE-EM Lin Yao et.al. 2302.06091v2 null
2023-02-11 3D Colored Shape Reconstruction from a Single RGB Image through Diffusion Bo Li et.al. 2302.05573v1 null
2023-02-09 3D reconstruction of spherical images: A review of techniques, applications, and prospects San Jiang et.al. 2302.04495v1 null
2023-02-09 PredRecon: A Prediction-boosted Planning Framework for Fast and High-quality Autonomous Aerial Reconstruction Chen Feng et.al. 2302.04488v1 link
2023-02-07 S4R: Self-Supervised Semantic Scene Reconstruction from RGB-D Scans Junwen Huang et.al. 2302.03640v1 null
2023-01-30 Mono-STAR: Mono-camera Scene-level Tracking and Reconstruction Haonan Chang et.al. 2301.13244v1 link
2023-01-27 A Comparison of Tiny-nerf versus Spatial Representations for 3d Reconstruction Saulo Abraham Gante et.al. 2301.11522v1 null
2023-01-25 Local Feature Extraction from Salient Regions by Feature Map Transformation Yerim Jung et.al. 2301.10413v1 null
2023-02-02 3D Reconstruction of Non-cooperative Resident Space Objects using Instant NGP-accelerated NeRF and D-NeRF Trupti Mahendrakar et.al. 2301.09060v2 null
2023-01-19 Parallelized computational 3D video microscopy of freely moving organisms at multiple gigapixels per second Kevin C. Zhou et.al. 2301.08351v1 link
2023-01-19 Multiview Compressive Coding for 3D Reconstruction Chao-Yuan Wu et.al. 2301.08247v1 link
2023-01-19 Regularizing disparity estimation via multi task learning with structured light reconstruction Alistair Weld et.al. 2301.08140v1 null
2023-01-12 Edge Preserving Implicit Surface Representation of Point Clouds Xiaogang Wang et.al. 2301.04860v1 null
2023-01-11 Elevation Estimation-Driven Building 3D Reconstruction from Single-View Remote Sensing Imagery Yongqiang Mao et.al. 2301.04581v1 null
2023-01-11 First 3D reconstruction of a blast furnace using muography Amélie Cohu et.al. 2301.04354v1 null
2023-01-04 Towards a Pipeline for Real-Time Visualization of Faces for VR-based Telepresence and Live Broadcasting Utilizing Neural Rendering Philipp Ladwig et.al. 2301.01490v1 link
2023-01-03 BS3D: Building-scale 3D Reconstruction from RGB-D Images Janne Mustaniemi et.al. 2301.01057v1 null
2022-12-31 Ponder: Point Cloud Pre-training via Neural Rendering Di Huang et.al. 2301.00157v1 null
2022-12-28 NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action Kuan-Chieh Wang et.al. 2212.13660v1 link
2022-12-24 Polarimetric Multi-View Inverse Rendering Jinyu Zhao et.al. 2212.12721v1 null

(back to top)

generate

Publish Date Title Authors PDF Code
2024-04-03 Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Keyu Tian et.al. 2404.02905v1 link
2024-04-03 LidarDM: Generative LiDAR Simulation in a Generated World Vlas Zyrianov et.al. 2404.02903v1 null
2024-04-03 DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets Harsh Rangwani et.al. 2404.02900v1 link
2024-04-03 MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment Duygu Ceylan et.al. 2404.02899v1 null
2024-04-03 A Mean Field Game Model for Timely Computation in Edge Computing Systems Shubham Aggarwal et.al. 2404.02898v1 null
2024-04-03 Deep Image Composition Meets Image Forgery Eren Tahir et.al. 2404.02897v1 link
2024-04-03 ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline Yifan Xu et.al. 2404.02893v1 null
2024-04-03 PoCo: Point Context Cluster for RGBD Indoor Place Recognition Jing Liang et.al. 2404.02885v1 null
2024-04-02 Segment Any 3D Object with Language Seungjun Lee et.al. 2404.02157v1 null
2024-04-02 Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration Akshay Dudhane et.al. 2404.02154v1 null
2024-04-02 GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image Chong Bao et.al. 2404.02152v1 null
2024-04-02 Diffusion $^2$ : Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models Zeyu Yang et.al. 2404.02148v1 link
2024-04-02 Harder, Better, Faster, Stronger: Interactive Visualization for Human-Centered AI Tools Md Naimul Hoque et.al. 2404.02147v1 null
2024-04-02 Iterated Learning Improves Compositionality in Large Vision-Language Models Chenhao Zheng et.al. 2404.02145v1 null
2024-04-02 Multiparametric quantification and visualization of liver fat using ultrasound Jihye Baek et.al. 2404.02143v1 null
2024-03-29 Gecko: Versatile Text Embeddings Distilled from Large Language Models Jinhyuk Lee et.al. 2403.20327v1 null
2024-03-29 Shaving Logs via Large Sieve Inequality: Faster Algorithms for Sparse Convolution and More Ce Jin et.al. 2403.20326v1 null
2024-03-29 Structure and Dynamics of Magneto-Inertial, Differentially Rotating Laboratory Plasmas V. Valenzuela-Villaseca et.al. 2403.20321v1 null
2024-03-29 SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects Abhinav Kumar et.al. 2403.20318v1 link
2024-03-29 Convolutional Prompting meets Language Models for Continual Learning Anurag Roy et.al. 2403.20317v1 null
2024-03-29 Optimal Communication for Classic Functions in the Coordinator Model and Beyond Hossein Esfandiari et.al. 2403.20307v1 null
2024-03-28 GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling Bowen Zhang et.al. 2403.19655v1 null
2024-03-28 Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond Katherine Xu et.al. 2403.19653v1 link
2024-03-28 InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction Sirui Xu et.al. 2403.19652v1 null
2024-03-28 GraspXL: Generating Grasping Motions for Diverse Objects at Scale Hui Zhang et.al. 2403.19649v1 null
2024-03-28 Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models Samuel Marks et.al. 2403.19647v1 link
2024-03-28 GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models Yusuf Dalva et.al. 2403.19645v1 null
2024-03-27 Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark Ziyang Chen et.al. 2403.18821v1 null
2024-03-27 MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering Guoxing Sun et.al. 2403.18820v1 null
2024-03-27 ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion Daniel Winter et.al. 2403.18818v1 null
2024-03-27 Garment3DGen: 3D Garment Stylization and Texture Generation Nikolaos Sarafianos et.al. 2403.18816v1 null
2024-03-27 Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models Yanwei Li et.al. 2403.18814v1 link
2024-03-27 Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment Li Siyao et.al. 2403.18811v1 null
2024-03-28 ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation Suraj Patni et.al. 2403.18807v2 link
2024-03-26 ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis Muhammad Hamza Mughal et.al. 2403.17936v1 null
2024-03-26 OmniVid: A Generative Framework for Universal Video Understanding Junke Wang et.al. 2403.17935v1 link
2024-03-26 SLEDGE: Synthesizing Simulation Environments for Driving Agents with Generative Models Kashyap Chitta et.al. 2403.17933v1 null
2024-03-26 MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution Wei Tao et.al. 2403.17927v1 null
2024-03-26 AID: Attention Interpolation of Text-to-Image Diffusion Qiyuan He et.al. 2403.17924v1 link
2024-03-26 The Need for Speed: Pruning Transformers with One Recipe Samir Khaki et.al. 2403.17921v1 link
2024-03-26 TC4D: Trajectory-Conditioned Text-to-4D Generation Sherwin Bahmani et.al. 2403.17920v1 null
2024-03-26 AgentStudio: A Toolkit for Building General Virtual Agents Longtao Zheng et.al. 2403.17918v1 null
2024-03-25 Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning Sicong Pan et.al. 2403.16803v1 null
2024-03-25 Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback Zhangqian Bi et.al. 2403.16792v1 null
2024-03-25 Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise Dilum Fernando et.al. 2403.16790v1 null
2024-03-25 HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation Linglin Jing et.al. 2403.16788v1 null
2024-03-25 Creating a Digital Twin of Spinal Surgery: A Proof of Concept Jonas Hein et.al. 2403.16736v1 null
2024-03-25 Improving Diffusion Models's Data-Corruption Resistance using Scheduled Pseudo-Huber Loss Artem Khrapov et.al. 2403.16728v1 link
2024-03-22 DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data Hanrong Ye et.al. 2403.15389v1 null
2024-03-22 LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis Kevin Xie et.al. 2403.15385v1 null
2024-03-22 ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars Zhenwei Wang et.al. 2403.15383v1 null
2024-03-22 DragAPart: Learning a Part-Level Motion Prior for Articulated Objects Ruining Li et.al. 2403.15382v1 null
2024-03-22 Long-CLIP: Unlocking the Long-Text Capability of CLIP Beichen Zhang et.al. 2403.15378v1 link
2024-03-22 InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding Yi Wang et.al. 2403.15377v1 link
2024-03-22 A Modular, End-to-End Next-Generation Network Testbed: Towards a Fully Automated Network Management Platform Ali Chouman et.al. 2403.15376v1 null
2024-03-21 Zero-Shot Multi-Object Shape Completion Shun Iwase et.al. 2403.14628v1 null
2024-03-21 MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images Yuedong Chen et.al. 2403.14627v1 link
2024-03-21 Simplified Diffusion Schrödinger Bridge Zhicong Tang et.al. 2403.14623v1 link
2024-03-21 GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation Yinghao Xu et.al. 2403.14621v1 link
2024-03-21 ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition Tianhao Wu et.al. 2403.14619v1 null
2024-03-21 Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion Xiang Fan et.al. 2403.14617v1 null
2024-03-21 Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning Hasindri Watawana et.al. 2403.14616v1 link
2024-03-21 DreamReward: Text-to-3D Generation with Human Preference Junliang Ye et.al. 2403.14613v1 null
2024-03-21 Explorative Inbetweening of Time and Space Haiwen Feng et.al. 2403.14611v1 null
2024-03-20 On Pretraining Data Diversity for Self-Supervised Learning Hasan Abed Al Kader Hammoud et.al. 2403.13808v1 link
2024-03-20 Editing Massive Concepts in Text-to-Image Diffusion Models Tianwei Xiong et.al. 2403.13807v1 link
2024-03-20 Learning from Models and Data for Visual Grounding Ruozhen He et.al. 2403.13804v1 null
2024-03-20 Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments Yang Yang et.al. 2403.13803v1 link
2024-03-20 ZigMa: Zigzag Mamba Diffusion Model Vincent Tao Hu et.al. 2403.13802v1 link
2024-03-20 Natural Language as Polices: Reasoning for Coordinate-Level Embodied Control with LLMs Yusuke Mikami et.al. 2403.13801v1 link
2024-03-20 TimeRewind: Rewinding Time with Image-and-Events Video Diffusion Jingxi Chen et.al. 2403.13800v1 null
2024-03-20 Reverse Training to Nurse the Reversal Curse Olga Golovneva et.al. 2403.13799v1 null
2024-03-20 Hierarchical NeuroSymbolic Approach for Action Quality Assessment Lauren Okamoto et.al. 2403.13798v1 null
2024-03-20 Bridge the Modality and Capacity Gaps in Vision-Language Model Selection Chao Yi et.al. 2403.13797v1 null
2024-03-19 LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression Zhuoshi Pan et.al. 2403.12968v1 link
2024-03-19 Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence Alignment Mengting Chen et.al. 2403.12965v1 null
2024-03-19 Negative Yields Positive: Unified Dual-Path Adapter for Vision-Language Models Ce Zhang et.al. 2403.12964v1 link
2024-03-19 FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis Linjiang Huang et.al. 2403.12963v1 link
2024-03-19 TexTile: A Differentiable Metric for Texture Tileability Carlos Rodriguez-Pardo et.al. 2403.12961v1 null
2024-03-19 FaceXFormer: A Unified Transformer for Facial Analysis Kartik Narayan et.al. 2403.12960v1 link
2024-03-19 GVGEN: Text-to-3D Generation with Volumetric Representation Xianglong He et.al. 2403.12957v1 null
2024-03-19 Abiogenesis: a possible quantum interpretation of the telepoietic conjecture Vittorio Cocchi et.al. 2403.12955v1 null
2024-03-19 Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models Elaine Sui et.al. 2403.12952v1 link
2024-03-18 RIS-aided Single-frequency 3D Imaging by Exploiting Multi-view Image Correlations Yixuan Huang et.al. 2403.11764v1 null
2024-03-19 Full-Duplex MU-MIMO Systems with Coarse Quantization: How Many Bits Do We Need? Seunghyeong Yoo et.al. 2403.11762v2 null
2024-03-18 Why E.T. Can't Phone Home: A Global View on IP-based Geoblocking at VoWiFi Gabriel Karl Gegenhuber et.al. 2403.11759v1 null
2024-03-18 Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs M. Jehanzeb Mirza et.al. 2403.11755v1 link
2024-03-18 Asymptotically Optimal Codes for $(t,s)$ -Burst Error Yubo Sun et.al. 2403.11750v1 null
2024-03-18 Embedded Named Entity Recognition using Probing Classifiers Nicholas Popovič et.al. 2403.11747v1 null
2024-03-18 Revisiting Tensor Basis Neural Networks for Reynolds stress modeling: application to plane channel and square duct flows Jiayi Cai et.al. 2403.11746v1 null
2024-03-18 Matter and cosmogenesis in Kant's Theory of the Heavens Garance Benoit et.al. 2403.11710v1 null
2024-03-18 Significant impact of light-matter strong coupling on chiral nonlinear optical effect Daichi Okada et.al. 2403.11709v1 null
2024-03-18 Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models Emilian Postolache et.al. 2403.11706v1 link
2024-03-18 Virbo: Multimodal Multilingual Avatar Video Generation in Digital Marketing Juan Zhang et.al. 2403.11700v1 null
2024-03-18 Urban Scene Diffusion through Semantic Occupancy Map Junge Zhang et.al. 2403.11697v1 null
2024-03-18 Generalization error of spectral algorithms Maksim Velikanov et.al. 2403.11696v1 null
2024-03-18 Beamforming Design for Semantic-Bit Coexisting Communication System Maojun Zhang et.al. 2403.11693v1 null
2024-03-15 P-MapNet: Far-seeing Map Generator Enhanced by both SDMap and HDMap Priors Zhou Jiang et.al. 2403.10521v1 null
2024-03-15 Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives Ronghui Li et.al. 2403.10518v1 link
2024-03-15 FeatUp: A Model-Agnostic Framework for Features at Any Resolution Stephanie Fu et.al. 2403.10516v1 link
2024-03-15 A Novel Framework for Multi-Person Temporal Gaze Following and Social Gaze Prediction Anshul Gupta et.al. 2403.10511v1 null
2024-03-15 Demystifying Faulty Code with LLM: Step-by-Step Reasoning for Explainable Fault Localization Ratnadira Widyasari et.al. 2403.10507v1 null
2024-03-15 Belief Change based on Knowledge Measures Umberto Straccia et.al. 2403.10502v1 null
2024-03-14 SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior Huan-ang Gao et.al. 2403.09638v1 null
2024-03-14 GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping Yuhang Zheng et.al. 2403.09637v1 link
2024-03-14 Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference Piotr Nawrot et.al. 2403.09636v1 null
2024-03-14 OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning Lingyi Hong et.al. 2403.09634v1 null
2024-03-14 Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image Yiqun Mei et.al. 2403.09632v1 null
2024-03-14 3D-VLA: A 3D Vision-Language-Action Generative World Model Haoyu Zhen et.al. 2403.09631v1 null
2024-03-14 Generalized Predictive Model for Autonomous Driving Jiazhi Yang et.al. 2403.09630v1 link
2024-03-14 Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking Eric Zelikman et.al. 2403.09629v1 link
2024-03-14 Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation Fangfu Liu et.al. 2403.09625v1 null
2024-03-14 Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering Zeyu Liu et.al. 2403.09622v1 null
2024-03-13 FastMAC: Stochastic Spectral Sampling of Correspondence Graph Yifei Zhang et.al. 2403.08770v1 link
2024-03-13 VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Enric Corona et.al. 2403.08764v1 null
2024-03-13 A local model for the optical energy and momentum transfer in dielectric media and the microscopic origin of Abraham's force density B. Anghinoni et.al. 2403.08752v1 null
2024-03-13 iCONTRA: Toward Thematic Collection Design Via Interactive Concept Transfer Dinh-Khoi Vo et.al. 2403.08746v1 link
2024-03-12 Rethinking Generative Large Language Model Evaluation for Semantic Comprehension Fangyun Wei et.al. 2403.07872v1 null
2024-03-12 TeleMoMa: A Modular and Versatile Teleoperation System for Mobile Manipulation Shivin Dass et.al. 2403.07869v1 null
2024-03-12 Exploring Safety Generalization Challenges of Large Language Models via Code Qibing Ren et.al. 2403.07865v1 null
2024-03-12 Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation Shihao Zhao et.al. 2403.07860v1 link
2024-03-12 Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias Sierra Wyllie et.al. 2403.07857v1 null
2024-03-12 Quantifying and Mitigating Privacy Risks for Tabular Generative Models Chaoyi Zhu et.al. 2403.07842v1 null
2024-03-11 A representation-learning game for classes of prediction tasks Neria Uzan et.al. 2403.06971v1 null
2024-03-11 The pitfalls of next-token prediction Gregor Bachmann et.al. 2403.06963v1 link
2024-03-11 Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot Domain Transfer Siddhant Satyanaik et.al. 2403.06953v1 null
2024-03-11 SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data Jialu Li et.al. 2403.06952v1 null
2024-03-08 Tell, Don't Show!: Language Guidance Eases Transfer Across Domains in Images and Videos Tarun Kalluri et.al. 2403.05535v1 null
2024-03-08 Tune without Validation: Searching for Learning Rate and Weight Decay on Training Sets Lorenzo Brigato et.al. 2403.05532v1 null
2024-03-08 Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Machel Reid et.al. 2403.05530v1 null
2024-03-08 The Computational Complexity of Learning Gaussian Single-Index Models Alex Damian et.al. 2403.05529v1 null
2024-03-08 GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM Hao Kang et.al. 2403.05527v1 link
2024-03-08 Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola Yijiang Li et.al. 2403.05523v1 null
2024-03-08 Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought James Chua et.al. 2403.05518v1 link
2024-03-07 BloomGML: Graph Machine Learning through the Lens of Bilevel Optimization Amber Yijia Zheng et.al. 2403.04763v1 link
2024-03-07 Lifelong Intelligence Beyond the Edge using Hyperdimensional Computing Xiaofan Yu et.al. 2403.04759v1 link
2024-03-07 KnowledgeVIS: Interpreting Language Models by Comparing Fill-in-the-Blank Prompts Adam Coscia et.al. 2403.04758v1 link
2024-03-07 Preliminary Guidelines For Combining Data Integration and Visual Data Analysis Adam Coscia et.al. 2403.04757v1 link
2024-03-07 Mechanism for Decision-aware Collaborative Federated Learning: A Pitfall of Shapley Values Meng Qi et.al. 2403.04753v1 null
2024-03-07 JAX-SPH: A Differentiable Smoothed Particle Hydrodynamics Framework Artur P. Toshev et.al. 2403.04750v1 link
2024-03-07 A General Calibrated Regret Metric for Detecting and Mitigating Human-Robot Interaction Failures Kensuke Nakamura et.al. 2403.04745v1 null
2024-03-06 Backtracing: Retrieving the Cause of the Query Rose E. Wang et.al. 2403.03956v1 link
2024-03-06 3D Diffusion Policy Yanjie Ze et.al. 2403.03954v1 link
2024-03-06 Bridging Language and Items for Retrieval and Recommendation Yupeng Hou et.al. 2403.03952v1 link
2024-03-06 Can Audio Reveal Music Performance Difficulty? Insights from the Piano Syllabus Dataset Pedro Ramoneda et.al. 2403.03947v1 null
2024-03-06 Separate and Detailed Treatment of Absolute Signal and Noise Enables NMR Under Adverse Circumstances A Guinness et.al. 2403.03943v1 null
2024-03-06 The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models Adithya Bhaskar et.al. 2403.03942v1 link
2024-03-06 GUIDE: Guidance-based Incremental Learning with Diffusion Models Bartosz Cywiński et.al. 2403.03938v1 link
2024-03-05 LC-Tsalis-INF: Generalized Best-of-Both-Worlds Linear Contextual Bandits Masahiro Kato et.al. 2403.03219v1 null
2024-03-05 The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning Nathaniel Li et.al. 2403.03218v1 null
2024-03-05 Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion Meng Zheng et.al. 2403.03217v1 null
2024-03-05 A Safety-Critical Framework for UGVs in Complex Environments: A Data-Driven Discrepancy-Aware Approach Skylar X. Wei et.al. 2403.03215v1 null
2024-03-05 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Patrick Esser et.al. 2403.03206v1 null
2024-03-05 CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments Savitha Sam Abraham et.al. 2403.03203v1 null
2024-03-03 Bandit Profit-maximization for Targeted Marketing Joon Suk Huh et.al. 2403.01361v1 null
2024-03-03 ModelWriter: Text & Model-Synchronized Document Engineering Platform Ferhat Erata et.al. 2403.01359v1 null
2024-03-03 Improving Uncertainty Sampling with Bell Curve Weight Function Zan-Kai Chong et.al. 2403.01352v1 null
2024-03-03 Efficient FIR filtering with Bit Layer Multiply Accumulator Vincenzo Liguori et.al. 2403.01351v1 null
2024-03-02 ShapeBoost: Boosting Human Shape Estimation with Part-Based Parameterization and Clothing-Preserving Augmentation Siyuan Bian et.al. 2403.01345v1 null
2024-02-29 DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models Muyang Li et.al. 2402.19481v1 link
2024-02-29 Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers Tsai-Shien Chen et.al. 2402.19479v1 null
2024-02-29 Learning a Generalized Physical Face Model From Data Lingchen Yang et.al. 2402.19477v1 null
2024-02-29 The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations? Alex Gu et.al. 2402.19475v1 null
2024-02-29 The All-Seeing Project V2: Towards General Relation Comprehension of the Open World Weiyun Wang et.al. 2402.19474v1 link
2024-02-29 Retrieval-Augmented Generation for AI-Generated Content: A Survey Penghao Zhao et.al. 2402.19473v1 link
2024-02-29 Loose LIPS Sink Ships: Asking Questions in Battleship with Language-Informed Program Sampling Gabriel Grand et.al. 2402.19471v1 null
2024-02-29 Humanoid Locomotion as Next Token Prediction Ilija Radosavovic et.al. 2402.19469v1 null
2024-02-28 UniMODE: Unified Monocular 3D Object Detection Zhuoling Li et.al. 2402.18573v1 null
2024-02-28 Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards Haoxiang Wang et.al. 2402.18571v1 link
2024-02-28 Diffusion Language Models Are Versatile Protein Learners Xinyou Wang et.al. 2402.18567v1 null
2024-02-28 Approaching Human-Level Forecasting with Language Models Danny Halawi et.al. 2402.18563v1 null
2024-02-27 The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Shuming Ma et.al. 2402.17764v1 null
2024-02-27 Reducing Unnecessary Alerts in Pedestrian Protection Systems Based on P2V Communications Ignacio Soto et.al. 2402.17763v1 null
2024-02-27 Towards Optimal Learning of Language Models Yuxian Gu et.al. 2402.17759v1 null
2024-02-27 ADL4D: Towards A Contextually Rich Dataset for 4D Activities of Daily Living Marsil Zakour et.al. 2402.17758v1 null
2024-02-27 Evaluating Very Long-Term Conversational Memory of LLM Agents Adyasha Maharana et.al. 2402.17753v1 null
2024-02-26 Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision Fan Jiang et.al. 2402.16508v1 link
2024-02-26 Stochastic Conditional Diffusion Models for Semantic Image Synthesis Juyeon Ko et.al. 2402.16506v1 null
2024-02-26 SAND: Decoupling Sanitization from Fuzzing for Low Overhead Ziqiao Kong et.al. 2402.16497v1 null
2024-02-26 Intelligent Known and Novel Aircraft Recognition -- A Shift from Classification to Similarity Learning for Combat Identification Ahmad Saeed et.al. 2402.16486v1 null
2024-02-23 Seamless Human Motion Composition with Blended Positional Encodings German Barquero et.al. 2402.15509v1 link
2024-02-23 AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning Jianguo Zhang et.al. 2402.15506v1 link
2024-02-23 Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts Yuejiang Liu et.al. 2402.15505v1 null
2024-02-23 Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition Chun-Hsiao Yeh et.al. 2402.15504v1 link
2024-02-23 API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs Kinjal Basu et.al. 2402.15491v1 null
2024-02-22 PALO: A Polyglot Large Multimodal Model for 5B People Muhammad Maaz et.al. 2402.14818v1 link
2024-02-22 Cameras as Rays: Pose Estimation via Ray Diffusion Jason Y. Zhang et.al. 2402.14817v1 null
2024-02-22 WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition Lianghui Zhu et.al. 2402.14812v1 link
2024-02-22 Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking Nikhil Prakash et.al. 2402.14811v1 null
2024-02-22 GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion Xueyi Liu et.al. 2402.14810v1 link
2024-02-22 CriticBench: Benchmarking LLMs for Critique-Correct Reasoning Zicheng Lin et.al. 2402.14809v1 link
2024-02-22 RelayAttention for Efficient Large Language Model Serving with Long System Prompts Lei Zhu et.al. 2402.14808v1 link
2024-02-22 A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health Nikhil Behari et.al. 2402.14807v1 null
2024-02-22 Identifying Multiple Personalities in Large Language Models with External Evaluation Xiaoyang Song et.al. 2402.14805v1 null
2024-02-21 D-Flow: Differentiating through Flows for Controlled Generation Heli Ben-Hamu et.al. 2402.14017v1 null
2024-02-21 Corrective Machine Unlearning Shashwat Goel et.al. 2402.14015v1 link
2024-02-21 Geometry-Informed Neural Networks Arturs Berzins et.al. 2402.14009v1 null
2024-02-21 OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems Chaoqun He et.al. 2402.14008v1 link
2024-02-21 Hallucinations or Attention Misdirection? The Path to Strategic Value Extraction in Business Using Large Language Models Aline Ioste et.al. 2402.14002v1 null
2024-02-21 Real-time 3D-aware Portrait Editing from a Single Image Qingyan Bai et.al. 2402.14000v1 null
2024-02-20 CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples Jianrui Zhang et.al. 2402.13254v1 link
2024-02-20 BiMediX: Bilingual Medical Mixture of Experts LLM Sara Pieri et.al. 2402.13253v1 link
2024-02-20 Video ReCap: Recursive Captioning of Hour-Long Videos Md Mohaiminul Islam et.al. 2402.13250v1 null
2024-02-20 TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization Liyan Tang et.al. 2402.13249v1 link
2024-02-20 Are Fact-Checking Tools Reliable? An Evaluation of Google Fact Check Qiangeng Yang et.al. 2402.13244v1 null
2024-02-20 Unlocking Insights: Semantic Search in Jupyter Notebooks Lan Li et.al. 2402.13234v1 null
2024-02-20 A Touch, Vision, and Language Dataset for Multimodal Alignment Letian Fu et.al. 2402.13232v1 link
2024-02-19 FiT: Flexible Vision Transformer for Diffusion Model Zeyu Lu et.al. 2402.12376v1 link
2024-02-19 A synthetic data approach for domain generalization of NLI models Mohammad Javad Hosseini et.al. 2402.12368v1 null
2024-02-19 A Critical Evaluation of AI Feedback for Aligning Large Language Models Archit Sharma et.al. 2402.12366v1 link
2024-02-19 Almost-linear time parameterized algorithm for rankwidth via dynamic rankwidth Tuukka Korhonen et.al. 2402.12364v1 null
2024-02-19 Flip Graphs of Pseudo-Triangulations With Face Degree at Most 4 Maarten Löffler et.al. 2402.12357v1 null
2024-02-19 Graph-Based Retriever Captures the Long Tail of Biomedical Knowledge Julien Delile et.al. 2402.12352v1 null
2024-02-16 Fusion of Diffusion Weighted MRI and Clinical Data for Predicting Functional Outcome after Acute Ischemic Stroke with Deep Contrastive Learning Chia-Ling Tsai et.al. 2402.10894v1 null
2024-02-16 RLVF: Learning from Verbal Feedback without Overgeneralization Moritz Stephan et.al. 2402.10893v1 link
2024-02-16 Instruction Diversity Drives Generalization To Unseen Tasks Dylan Zhang et.al. 2402.10891v1 null
2024-02-16 When is Tree Search Useful for LLM Planning? It Depends on the Discriminator Ziru Chen et.al. 2402.10890v1 link
2024-02-16 Evaluation of EAP Usage for Authenticating Eduroam Users in 5G Networks Leonardo Azalim de Oliveira et.al. 2402.10889v1 null
2024-02-16 Explainability for Machine Learning Models: From Data Adaptability to User Perception julien Delaunay et.al. 2402.10888v1 null
2024-02-16 Reviewer2: Optimizing Review Generation Through Prompt Generation Zhaolin Gao et.al. 2402.10886v1 null
2024-02-16 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations Tsung-Wei Ke et.al. 2402.10885v1 null
2024-02-15 Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Huizhuo Yuan et.al. 2402.10210v1 null
2024-02-15 Recovering the Pre-Fine-Tuning Weights of Generative Models Eliahu Horwitz et.al. 2402.10208v1 link
2024-02-15 Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment Rui Yang et.al. 2402.10207v1 link
2024-02-15 Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention Romain Ilbert et.al. 2402.10198v1 link
2024-02-15 BitDelta: Your Fine-Tune May Only Be Worth One Bit James Liu et.al. 2402.10193v1 link
2024-02-15 Multi-Excitation Projective Simulation with a Many-Body Physics Inspired Inductive Bias Philip A. LeMaitre et.al. 2402.10192v1 link
2024-02-15 FedAnchor: Enhancing Federated Semi-Supervised Learning with Label Contrastive Loss for Unlabeled Clients Xinchi Qiu et.al. 2402.10191v1 null
2024-02-14 AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability Siwei Yang et.al. 2402.09404v1 link
2024-02-14 Reinforcement Learning from Human Feedback with Active Queries Kaixuan Ji et.al. 2402.09401v1 null
2024-02-14 Long-form evaluation of model editing Domenic Rosati et.al. 2402.09394v1 null
2024-02-14 Introduction to Physically Unclonable Fuctions: Properties and Applications M. Garcia-Bosque et.al. 2402.09386v1 null
2024-02-14 GraSSRep: Graph-Based Self-Supervised Learning for Repeat Detection in Metagenomic Assembly Ali Azizpour et.al. 2402.09381v1 link
2024-02-13 IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation Luke Melas-Kyriazi et.al. 2402.08682v1 null
2024-02-13 Mitigating Object Hallucination in Large Vision-Language Models via Classifier-Free Guidance Linxi Zhao et.al. 2402.08680v1 null
2024-02-13 COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability Xingang Guo et.al. 2402.08679v1 link
2024-02-13 Graph Mamba: Towards Learning on Graphs with State Space Models Ali Behrouz et.al. 2402.08678v1 link
2024-02-13 Model Assessment and Selection under Temporal Distribution Shift Elise Han et.al. 2402.08672v1 link
2024-02-13 Rec-GPT4V: Multimodal Recommendation with Large Vision-Language Models Yuqing Liu et.al. 2402.08670v1 null
2024-02-13 Improving Generalization in Semantic Parsing by Increasing Natural Language Variation Irina Saparina et.al. 2402.08666v1 link
2024-02-12 A systematic investigation of learnability from single child linguistic input Yulu Qin et.al. 2402.07899v1 null
2024-02-12 Label-Efficient Model Selection for Text Generation Shir Ashury-Tahan et.al. 2402.07891v1 null
2024-02-12 Toward an Android Static Analysis Approach for Data Protection Mugdha Khedkar et.al. 2402.07889v1 null
2024-02-12 WildfireGPT: Tailored Large Language Model for Wildfire Analysis Yangxinyu Xie et.al. 2402.07877v1 null
2024-02-12 Policy Improvement using Language Feedback Models Victor Zhong et.al. 2402.07876v1 null
2024-02-09 Feedback Loops With Language Models Drive In-Context Reward Hacking Alexander Pan et.al. 2402.06627v1 link
2024-02-09 Understanding the Effects of Iterative Prompting on Truthfulness Satyapriya Krishna et.al. 2402.06625v1 null
2024-02-09 A two-stage algorithm in evolutionary product unit neural networks for classification Antonio J. Tallón-Ballesteros et.al. 2402.06622v1 null
2024-02-09 TIC: Translate-Infer-Compile for accurate 'text to plan' using LLMs and logical intermediate representations Sudhir Agarwal et.al. 2402.06608v1 null
2024-02-09 On the Out-Of-Distribution Generalization of Multimodal Large Language Models Xingxuan Zhang et.al. 2402.06599v1 null
2024-02-09 CigaR: Cost-efficient Program Repair with LLMs Dávid Hidvégi et.al. 2402.06598v1 link
2024-02-09 Understanding the Weakness of Large Language Model Agents within a Complex Android Environment Mingzhe Xing et.al. 2402.06596v1 link
2024-02-08 InstaGen: Enhancing Object Detection by Training on Synthetic Dataset Chengjian Feng et.al. 2402.05937v1 null
2024-02-08 SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models Peng Gao et.al. 2402.05935v1 link
2024-02-08 Time Series Diffusion in the Frequency Domain Jonathan Crabbé et.al. 2402.05933v1 link
2024-02-08 WebLINX: Real-World Website Navigation with Multi-Turn Dialogue Xing Han Lù et.al. 2402.05930v1 link
2024-02-08 An Interactive Agent Foundation Model Zane Durante et.al. 2402.05929v1 null
2024-02-08 Sharp Rates in Dependent Learning Theory: Avoiding Sample Size Deflation for the Square Loss Ingvar Ziemann et.al. 2402.05928v1 null
2024-02-07 Image captioning for Brazilian Portuguese using GRIT model Rafael Silva de Alencar et.al. 2402.05106v1 null
2024-02-07 You Can REST Now: Automated Specification Inference and Black-Box Testing of RESTful APIs with Large Language Models Alix Decrop et.al. 2402.05102v1 null
2024-02-07 Hydragen: High-Throughput LLM Inference with Shared Prefixes Jordan Juravsky et.al. 2402.05099v1 null
2024-02-07 On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling Marcin Sendera et.al. 2402.05098v1 link
2024-02-07 Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation Dennis Hoftijzer et.al. 2402.05090v1 null
2024-02-07 Hyperspectral acquisition with ScanImage at the single pixel level: Application to time domain coherent Raman imaging Samuel Metais et.al. 2402.05086v1 null
2024-02-06 Linear-time Minimum Bayes Risk Decoding with Reference Aggregation Jannis Vamvas et.al. 2402.04251v1 link
2024-02-06 CAST: Clustering Self-Attention using Surrogate Tokens for Efficient Transformers Adjorn van Engelenhoven et.al. 2402.04239v1 null
2024-02-06 CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations Ji Qi et.al. 2402.04236v1 link
2024-02-06 **Role of spontaneously generated

Releases

No releases published

Packages

No packages published

Languages