Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-04-03 | Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction | Keyu Tian et.al. | 2404.02905v1 | link |
2024-04-03 | LidarDM: Generative LiDAR Simulation in a Generated World | Vlas Zyrianov et.al. | 2404.02903v1 | null |
2024-04-03 | MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment | Duygu Ceylan et.al. | 2404.02899v1 | null |
2024-04-03 | On the Scalability of Diffusion-based Text-to-Image Generation | Hao Li et.al. | 2404.02883v1 | null |
2024-04-03 | Fast Diffusion Model For Seismic Data Noise Attenuation | Junheng Peng et.al. | 2404.02767v1 | null |
2024-04-03 | Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models | Wentian Zhang et.al. | 2404.02747v1 | link |
2024-04-03 | InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation | Haofan Wang et.al. | 2404.02733v1 | link |
2024-04-03 | Harnessing the Power of Large Vision Language Models for Synthetic Image Detection | Mamadou Keita et.al. | 2404.02726v1 | null |
2024-04-02 | Diffusion |
Zeyu Yang et.al. | 2404.02148v1 | link |
2024-04-02 | WcDT: World-centric Diffusion Transformer for Traffic Scene Generation | Chen Yang et.al. | 2404.02082v1 | link |
2024-04-02 | AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design | Xinze Li et.al. | 2404.02003v1 | null |
2024-04-02 | Bi-LORA: A Vision-Language Approach for Synthetic Image Detection | Mamadou Keita et.al. | 2404.01959v1 | null |
2024-03-29 | Relation Rectification in Diffusion Model | Yinwei Wu et.al. | 2403.20249v1 | null |
2024-03-29 | Graph Neural Aggregation-diffusion with Metastability | Kaiyuan Cui et.al. | 2403.20221v1 | null |
2024-03-29 | Motion Inversion for Video Customization | Luozhou Wang et.al. | 2403.20193v1 | null |
2024-03-29 | FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models | Barbara Toniella Corradini et.al. | 2403.20105v1 | null |
2024-03-29 | SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior | Zhongrui Yu et.al. | 2403.20079v1 | null |
2024-03-29 | Optimal s-boxes against alternative operations | Marco Calderini et.al. | 2403.20059v1 | null |
2024-03-28 | GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling | Bowen Zhang et.al. | 2403.19655v1 | null |
2024-03-28 | Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond | Katherine Xu et.al. | 2403.19653v1 | link |
2024-03-28 | InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction | Sirui Xu et.al. | 2403.19652v1 | null |
2024-03-28 | GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models | Yusuf Dalva et.al. | 2403.19645v1 | null |
2024-03-28 | Generalisation of the Spectral Difference scheme for the diffused-interface five equation model | Niccolò Tonicello et.al. | 2403.19623v1 | null |
2024-03-28 | Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model | Zhicai Wang et.al. | 2403.19600v1 | link |
2024-03-28 | Frame by Familiar Frame: Understanding Replication in Video Diffusion Models | Aimon Rahman et.al. | 2403.19593v1 | null |
2024-03-28 | Keypoint Action Tokens Enable In-Context Imitation Learning in Robotics | Norman Di Palo et.al. | 2403.19578v1 | null |
2024-03-27 | ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion | Daniel Winter et.al. | 2403.18818v1 | null |
2024-03-27 | Garment3DGen: 3D Garment Stylization and Texture Generation | Nikolaos Sarafianos et.al. | 2403.18816v1 | null |
2024-03-28 | ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation | Suraj Patni et.al. | 2403.18807v2 | link |
2024-03-27 | Object Pose Estimation via the Aggregation of Diffusion Features | Tianfu Wang et.al. | 2403.18791v1 | link |
2024-03-27 | ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object | Chenshuang Zhang et.al. | 2403.18775v1 | link |
2024-03-28 | FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing | Trong-Tung Nguyen et.al. | 2403.18605v2 | null |
2024-03-27 | HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions | Hao Xu et.al. | 2403.18575v1 | link |
2024-03-26 | ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis | Muhammad Hamza Mughal et.al. | 2403.17936v1 | null |
2024-03-26 | SLEDGE: Synthesizing Simulation Environments for Driving Agents with Generative Models | Kashyap Chitta et.al. | 2403.17933v1 | null |
2024-03-26 | AID: Attention Interpolation of Text-to-Image Diffusion | Qiyuan He et.al. | 2403.17924v1 | link |
2024-03-26 | Boosting Diffusion Models with Moving Average Sampling in Frequency Domain | Yurui Qian et.al. | 2403.17870v1 | null |
2024-03-26 | The memory of Rayleigh-Taylor turbulence | S. Thévenin et.al. | 2403.17832v1 | null |
2024-03-26 | DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions | Sammy Christen et.al. | 2403.17827v1 | null |
2024-03-25 | Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning | Sicong Pan et.al. | 2403.16803v1 | null |
2024-03-25 | Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise | Dilum Fernando et.al. | 2403.16790v1 | null |
2024-03-25 | Multilevel Modeling as a Methodology for the Simulation of Human Mobility | Luca Serena et.al. | 2403.16745v1 | null |
2024-03-25 | A Robotic Skill Learning System Built Upon Diffusion Policies and Foundation Models | Nils Ingelhag et.al. | 2403.16730v1 | null |
2024-03-25 | Improving Diffusion Models's Data-Corruption Resistance using Scheduled Pseudo-Huber Loss | Artem Khrapov et.al. | 2403.16728v1 | link |
2024-03-25 | The effect of inter-track coupling on H $_2$O$_2$ productions | Ramin Abolfath et.al. | 2403.16722v1 | null |
2024-03-25 | The Directionality of Gravitational and Thermal Diffusive Transport in Geologic Fluid Storage | Anna Herring et.al. | 2403.16659v1 | null |
2024-03-25 | SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions | Yuda Song et.al. | 2403.16627v1 | link |
2024-03-25 | SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation | Aysim Toker et.al. | 2403.16605v1 | null |
2024-03-22 | DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data | Hanrong Ye et.al. | 2403.15389v1 | null |
2024-03-22 | LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis | Kevin Xie et.al. | 2403.15385v1 | null |
2024-03-22 | Controlled Training Data Generation with Diffusion Models | Teresa Yeo et.al. | 2403.15309v1 | null |
2024-03-22 | Parametric PDE Control with Deep Reinforcement Learning and Differentiable L0-Sparse Polynomial Policies | Nicolò Botteghi et.al. | 2403.15267v1 | null |
2024-03-22 | Spectral Motion Alignment for Video Motion Transfer using Diffusion Models | Geon Yeong Park et.al. | 2403.15249v1 | null |
2024-03-22 | Shadow Generation for Composite Image Using Diffusion model | Qingyang Liu et.al. | 2403.15234v1 | link |
2024-03-22 | Broad Instantaneous Bandwidth Microwave Spectrum Analyzer with a Microfabricated Atomic Vapor Cell | Yongqi Shi et.al. | 2403.15155v1 | null |
2024-03-22 | Oxygenation of CO and NO on Amorphous Solid Water | Meenu Upadhyay et.al. | 2403.15141v1 | null |
2024-03-21 | Simplified Diffusion Schrödinger Bridge | Zhicong Tang et.al. | 2403.14623v1 | link |
2024-03-21 | GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation | Yinghao Xu et.al. | 2403.14621v1 | link |
2024-03-21 | Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion | Xiang Fan et.al. | 2403.14617v1 | null |
2024-03-21 | DreamReward: Text-to-3D Generation with Human Preference | Junliang Ye et.al. | 2403.14613v1 | null |
2024-03-21 | ReNoise: Real Image Inversion Through Iterative Noising | Daniel Garibi et.al. | 2403.14602v1 | null |
2024-03-21 | Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion Descriptors | Nikolaos Tsagkas et.al. | 2403.14526v1 | null |
2024-03-21 | Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation | Mathias Öttl et.al. | 2403.14429v1 | null |
2024-03-20 | On Pretraining Data Diversity for Self-Supervised Learning | Hasan Abed Al Kader Hammoud et.al. | 2403.13808v1 | link |
2024-03-20 | Editing Massive Concepts in Text-to-Image Diffusion Models | Tianwei Xiong et.al. | 2403.13807v1 | link |
2024-03-20 | ZigMa: Zigzag Mamba Diffusion Model | Vincent Tao Hu et.al. | 2403.13802v1 | link |
2024-03-20 | TimeRewind: Rewinding Time with Image-and-Events Video Diffusion | Jingxi Chen et.al. | 2403.13800v1 | null |
2024-03-20 | DepthFM: Fast Monocular Depth Estimation with Flow Matching | Ming Gui et.al. | 2403.13788v1 | null |
2024-03-20 | Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation | Fu-Yun Wang et.al. | 2403.13745v1 | link |
2024-03-20 | Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes | Yifan Chen et.al. | 2403.13724v1 | null |
2024-03-19 | FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis | Linjiang Huang et.al. | 2403.12963v1 | link |
2024-03-19 | FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation | Shuai Yang et.al. | 2403.12962v1 | link |
2024-03-19 | TexTile: A Differentiable Metric for Texture Tileability | Carlos Rodriguez-Pardo et.al. | 2403.12961v1 | null |
2024-03-19 | GVGEN: Text-to-3D Generation with Volumetric Representation | Xianglong He et.al. | 2403.12957v1 | null |
2024-03-19 | Zero-Reference Low-Light Enhancement via Physical Quadruple Priors | Wenjing Wang et.al. | 2403.12933v1 | null |
2024-03-19 | You Only Sample Once: Taming One-Step Text-To-Image Synthesis by Self-Cooperative Diffusion GANs | Yihong Luo et.al. | 2403.12931v1 | link |
2024-03-19 | Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model | Jiajie Yang et.al. | 2403.12915v1 | link |
2024-03-19 | D-Cubed: Latent Diffusion Trajectory Optimisation for Dexterous Deformable Manipulation | Jun Yamada et.al. | 2403.12861v1 | null |
2024-03-18 | Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models | Emilian Postolache et.al. | 2403.11706v1 | link |
2024-03-19 | Urban Scene Diffusion through Semantic Occupancy Map | Junge Zhang et.al. | 2403.11697v2 | null |
2024-03-18 | Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection | Julia Wolleb et.al. | 2403.11667v1 | null |
2024-03-18 | Diffusion-Based Environment-Aware Trajectory Prediction | Theodor Westny et.al. | 2403.11643v1 | null |
2024-03-18 | Arc2Face: A Foundation Model of Human Faces | Foivos Paraperas Papantoniou et.al. | 2403.11641v1 | link |
2024-03-18 | LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models | Yang Yang et.al. | 2403.11627v1 | link |
2024-03-18 | CRS-Diff: Controllable Generative Remote Sensing Foundation Model | Datao Tang et.al. | 2403.11614v1 | link |
2024-03-15 | Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives | Ronghui Li et.al. | 2403.10518v1 | link |
2024-03-15 | MusicHiFi: Fast High-Fidelity Stereo Vocoding | Ge Zhu et.al. | 2403.10493v1 | null |
2024-03-15 | SculptDiff: Learning Robotic Clay Sculpting from Humans with Goal Conditioned Diffusion Policy | Alison Bartsch et.al. | 2403.10401v1 | null |
2024-03-15 | Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding | Pengkun Liu et.al. | 2403.10395v1 | link |
2024-03-15 | Denoising Task Difficulty-based Curriculum for Training Diffusion Models | Jin-Young Kim et.al. | 2403.10348v1 | null |
2024-03-15 | Towards Generalizable Deepfake Video Detection with Thumbnail Layout and Graph Reasoning | Yuting Xu et.al. | 2403.10261v1 | link |
2024-03-14 | SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior | Huan-ang Gao et.al. | 2403.09638v1 | null |
2024-03-14 | 3D-VLA: A 3D Vision-Language-Action Generative World Model | Haoyu Zhen et.al. | 2403.09631v1 | null |
2024-03-14 | Generalized Predictive Model for Autonomous Driving | Jiazhi Yang et.al. | 2403.09630v1 | link |
2024-03-14 | Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation | Fangfu Liu et.al. | 2403.09625v1 | null |
2024-03-14 | Score-Guided Diffusion for 3D Human Recovery | Anastasis Stathopoulos et.al. | 2403.09623v1 | link |
2024-03-14 | Explore In-Context Segmentation via Latent Diffusion Models | Chaoyang Wang et.al. | 2403.09616v1 | null |
2024-03-14 | The effect of spatially-varying collision frequency on the development of the Rayleigh-Taylor instability | John Rodman et.al. | 2403.09591v1 | null |
2024-03-14 | MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models | Zunnan Xu et.al. | 2403.09471v1 | null |
2024-03-14 | Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing | Wonjun Kang et.al. | 2403.09468v1 | link |
2024-03-13 | VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis | Enric Corona et.al. | 2403.08764v1 | null |
2024-03-14 | GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing | Jing Wu et.al. | 2403.08733v2 | null |
2024-03-13 | Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data | Asad Aali et.al. | 2403.08728v1 | link |
2024-03-13 | Historical Astronomical Diagrams Decomposition in Geometric Primitives | Syrine Kalleli et.al. | 2403.08721v1 | null |
2024-03-12 | Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation | Shihao Zhao et.al. | 2403.07860v1 | link |
2024-03-12 | Quantifying and Mitigating Privacy Risks for Tabular Generative Models | Chaoyi Zhu et.al. | 2403.07842v1 | null |
2024-03-12 | MPCPA: Multi-Center Privacy Computing with Predictions Aggregation based on Denoising Diffusion Probabilistic Model | Guibo Luo et.al. | 2403.07838v1 | null |
2024-03-13 | SemCity: Semantic Scene Generation with Triplane Diffusion | Jumin Lee et.al. | 2403.07773v2 | link |
2024-03-12 | Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model | Yuxuan Zhang et.al. | 2403.07764v1 | null |
2024-03-13 | Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion | Dongyang Li et.al. | 2403.07721v2 | link |
2024-03-12 | SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces | Yuta Oshima et.al. | 2403.07711v1 | link |
2024-03-12 | Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal | Yijun Yang et.al. | 2403.07684v1 | null |
2024-03-11 | BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion | Xuan Ju et.al. | 2403.06976v1 | link |
2024-03-11 | Bayesian Diffusion Models for 3D Shape Reconstruction | Haiyang Xu et.al. | 2403.06973v1 | null |
2024-03-11 | SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data | Jialu Li et.al. | 2403.06952v1 | null |
2024-03-12 | DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations | Tianhao Qi et.al. | 2403.06951v2 | link |
2024-03-08 | VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models | Yabo Zhang et.al. | 2403.05438v1 | link |
2024-03-08 | DiffSF: Diffusion Models for Scene Flow Estimation | Yushan Zhang et.al. | 2403.05327v1 | link |
2024-03-07 | ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes | Hashmat Shadab Malik et.al. | 2403.04701v1 | link |
2024-03-07 | Delving into the Trajectory Long-tail Distribution for Muti-object Tracking | Sijia Chen et.al. | 2403.04700v1 | link |
2024-03-07 | PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation | Junsong Chen et.al. | 2403.04692v1 | null |
2024-03-07 | Pix2Gif: Motion-Guided Diffusion for GIF Generation | Hitesh Kandala et.al. | 2403.04634v1 | null |
2024-03-06 | 3D Diffusion Policy | Yanjie Ze et.al. | 2403.03954v1 | link |
2024-03-06 | GUIDE: Guidance-based Incremental Learning with Diffusion Models | Bartosz Cywiński et.al. | 2403.03938v1 | link |
2024-03-06 | Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation | Xiao Ma et.al. | 2403.03890v1 | null |
2024-03-06 | Latent Dataset Distillation with Diffusion Models | Brian B. Moser et.al. | 2403.03881v1 | null |
2024-03-06 | Accelerating Convergence of Score-Based Diffusion Models, Provably | Gen Li et.al. | 2403.03852v1 | null |
2024-03-06 | Diffusion on language model embeddings for protein sequence generation | Viacheslav Meshchaninov et.al. | 2403.03726v1 | null |
2024-03-05 | Scaling Rectified Flow Transformers for High-Resolution Image Synthesis | Patrick Esser et.al. | 2403.03206v1 | null |
2024-03-05 | MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets | Hossein Aboutalebi et.al. | 2403.03194v1 | null |
2024-03-05 | Behavior Generation with Latent Actions | Seungjae Lee et.al. | 2403.03181v1 | link |
2024-03-05 | Enhanced beam-beam modeling to include longitudinal variation during weak-strong simulation | Derong Xu et.al. | 2403.03137v1 | null |
2024-03-02 | Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models | Neta Shaul et.al. | 2403.01329v1 | null |
2024-03-02 | Anomalous mass dependency in Hydra endoderm cell cluster diffusion | Aline Lütz et.al. | 2403.01294v1 | null |
2024-03-02 | DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction | Junwen Xiong et.al. | 2403.01226v1 | null |
2024-03-02 | TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion | Salaheldin Mohamed et.al. | 2403.01212v1 | null |
2024-02-29 | DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models | Muyang Li et.al. | 2402.19481v1 | link |
2024-02-29 | Structure Preserving Diffusion Models | Haoye Lu et.al. | 2402.19369v1 | null |
2024-02-29 | A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation | Hanxi Li et.al. | 2402.19330v1 | link |
2024-02-29 | DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly | Gianluca Scarpellini et.al. | 2402.19302v1 | link |
2024-02-29 | Generative models struggle with kirigami metamaterials | Gerrit Felsch et.al. | 2402.19196v1 | null |
2024-02-28 | Diffusion Language Models Are Versatile Protein Learners | Xinyou Wang et.al. | 2402.18567v1 | null |
2024-02-28 | Photon statistics of resonantly driven spectrally diffusive quantum emitters | Aymeric Delteil et.al. | 2402.18542v1 | null |
2024-02-28 | Dynamical Regimes of Diffusion Models | Giulio Biroli et.al. | 2402.18491v1 | null |
2024-02-28 | Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model | Sangjoon Park et.al. | 2402.18362v1 | null |
2024-02-27 | Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning | Xiaoyu Zhang et.al. | 2402.17768v1 | null |
2024-02-27 | Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners | Yazhou Xing et.al. | 2402.17723v1 | null |
2024-02-27 | Structure-Guided Adversarial Training of Diffusion Models | Ling Yang et.al. | 2402.17563v1 | null |
2024-02-27 | Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class Label | Xinliang Zhang et.al. | 2402.17555v1 | link |
2024-02-27 | Diffusion Model-Based Image Editing: A Survey | Yi Huang et.al. | 2402.17525v1 | link |
2024-02-27 | Label-Noise Robust Diffusion Models | Byeonghu Na et.al. | 2402.17517v1 | link |
2024-02-27 | The Unwanted Dissemination of Science: The Usage of Academic Articles as Ammunition in Contested Discursive Arenas on Twitter | Richard Zhang et.al. | 2402.17495v1 | null |
2024-02-27 | EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions | Linrui Tian et.al. | 2402.17485v1 | null |
2024-02-26 | Stochastic Conditional Diffusion Models for Semantic Image Synthesis | Juyeon Ko et.al. | 2402.16506v1 | null |
2024-02-26 | Outline-Guided Object Inpainting with Diffusion Models | Markus Pobitzer et.al. | 2402.16421v1 | null |
2024-02-26 | Placing Objects in Context via Inpainting for Out-of-distribution Segmentation | Pau de Jorge et.al. | 2402.16392v1 | link |
2024-02-26 | Generative AI in Vision: A Survey on Models, Metrics and Applications | Gaurav Raut et.al. | 2402.16369v1 | null |
2024-02-26 | Feedback Efficient Online Fine-Tuning of Diffusion Models | Masatoshi Uehara et.al. | 2402.16359v1 | null |
2024-02-26 | Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion | Xuantong Liu et.al. | 2402.16305v1 | null |
2024-02-26 | Graph Diffusion Policy Optimization | Yijing Liu et.al. | 2402.16302v1 | link |
2024-02-23 | Seamless Human Motion Composition with Blended Positional Encodings | German Barquero et.al. | 2402.15509v1 | link |
2024-02-23 | Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition | Chun-Hsiao Yeh et.al. | 2402.15504v1 | link |
2024-02-23 | Solute transport due to periodic loading in a soft porous material | Matilde Fiori et.al. | 2402.15451v1 | null |
2024-02-23 | ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation | Yi Zhang et.al. | 2402.15429v1 | link |
2024-02-23 | Understanding Oversmoothing in Diffusion-Based GNNs From the Perspective of Operator Semigroup Theory | Weichen Zhao et.al. | 2402.15326v1 | null |
2024-02-23 | Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models | Shunyu Liu et.al. | 2402.15289v1 | link |
2024-02-22 | Cameras as Rays: Pose Estimation via Ray Diffusion | Jason Y. Zhang et.al. | 2402.14817v1 | null |
2024-02-22 | GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion | Xueyi Liu et.al. | 2402.14810v1 | link |
2024-02-22 | Consolidating Attention Features for Multi-view Image Editing | Or Patashnik et.al. | 2402.14792v1 | null |
2024-02-22 | Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models | Yixuan Ren et.al. | 2402.14780v1 | null |
2024-02-22 | Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormality Screening | Zhenrong Shen et.al. | 2402.14707v1 | null |
2024-02-22 | Debiasing Text-to-Image Diffusion Models | Ruifei He et.al. | 2402.14577v1 | null |
2024-02-22 | DynGMA: a robust approach for learning stochastic differential equations from data | Aiqing Zhu et.al. | 2402.14475v1 | link |
2024-02-21 | D-Flow: Differentiating through Flows for Controlled Generation | Heli Ben-Hamu et.al. | 2402.14017v1 | null |
2024-02-21 | SDXL-Lightning: Progressive Adversarial Diffusion Distillation | Shanchuan Lin et.al. | 2402.13929v1 | null |
2024-02-21 | Non-asymptotic Convergence of Discrete-time Diffusion Models: New Approach and Improved Rate | Yuchen Liang et.al. | 2402.13901v1 | null |
2024-02-21 | NeuralDiffuser: Controllable fMRI Reconstruction with Primary Visual Feature Guided Diffusion | Haoyu Li et.al. | 2402.13809v1 | null |
2024-02-21 | The Geography of Information Diffusion in Online Discourse on Europe and Migration | Elisa Leonardelli et.al. | 2402.13800v1 | null |
2024-02-21 | Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions | Jiayu Chen et.al. | 2402.13777v1 | link |
2024-02-21 | Music Style Transfer with Time-Varying Inversion of Diffusion Models | Sifei Li et.al. | 2402.13763v1 | null |
2024-02-20 | Neural Network Diffusion | Kai Wang et.al. | 2402.13144v1 | link |
2024-02-20 | Excited state-specific CASSCF theory for the torsion of ethylene | Sandra Saade et.al. | 2402.13046v1 | null |
2024-02-20 | Text-Guided Molecule Generation with Diffusion Language Model | Haisong Gong et.al. | 2402.13040v1 | link |
2024-02-20 | Visual Style Prompting with Swapping Self-Attention | Jaeseok Jeong et.al. | 2402.12974v1 | link |
2024-02-20 | CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection | Sohail Ahmed Khan et.al. | 2402.12927v1 | null |
2024-02-20 | RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models | Xinchen Zhang et.al. | 2402.12908v1 | link |
2024-02-19 | FiT: Flexible Vision Transformer for Diffusion Model | Zeyu Lu et.al. | 2402.12376v1 | link |
2024-02-19 | Analysis of Persian News Agencies on Instagram, A Words Co-occurrence Graph-based Approach | Mohammad Heydari et.al. | 2402.12272v1 | null |
2024-02-19 | Synthetic location trajectory generation using categorical diffusion models | Simon Dirmeier et.al. | 2402.12242v1 | link |
2024-02-19 | Diffusion Tempering Improves Parameter Estimation with Probabilistic Integrators for Ordinary Differential Equations | Jonas Beck et.al. | 2402.12231v1 | link |
2024-02-19 | Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training | Leo Hyun Park et.al. | 2402.12187v1 | null |
2024-02-19 | Human Video Translation via Query Warping | Haiming Zhu et.al. | 2402.12099v1 | null |
2024-02-16 | Fusion of Diffusion Weighted MRI and Clinical Data for Predicting Functional Outcome after Acute Ischemic Stroke with Deep Contrastive Learning | Chia-Ling Tsai et.al. | 2402.10894v1 | null |
2024-02-16 | 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations | Tsung-Wei Ke et.al. | 2402.10885v1 | null |
2024-02-16 | Control Color: Multimodal Diffusion-based Interactive Image Colorization | Zhexin Liang et.al. | 2402.10855v1 | null |
2024-02-16 | Training Class-Imbalanced Diffusion Model Via Overlap Optimization | Divin Yan et.al. | 2402.10821v1 | link |
2024-02-16 | VATr++: Choose Your Words Wisely for Handwritten Text Generation | Bram Vanherle et.al. | 2402.10798v1 | null |
2024-02-16 | Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation | Hongbin Na et.al. | 2402.10699v1 | null |
2024-02-16 | Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm | Yuanzhen Xie et.al. | 2402.10671v1 | link |
2024-02-15 | Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation | Huizhuo Yuan et.al. | 2402.10210v1 | null |
2024-02-15 | Recovering the Pre-Fine-Tuning Weights of Generative Models | Eliahu Horwitz et.al. | 2402.10208v1 | link |
2024-02-15 | Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment | Rui Yang et.al. | 2402.10207v1 | link |
2024-02-15 | Energy Flux Decomposition in Magnetohydrodynamic Turbulence | D. Capocci et.al. | 2402.10125v1 | null |
2024-02-15 | Collision efficiency of droplets across diffusive, electrostatic and inertial regimes | Florian Poydenot et.al. | 2402.10117v1 | null |
2024-02-15 | Quantized Embedding Vectors for Controllable Diffusion Language Models | Cheng Kang et.al. | 2402.10107v1 | null |
2024-02-15 | Classification Diffusion Models | Shahar Yadin et.al. | 2402.10095v1 | null |
2024-02-14 | Magic-Me: Identity-Specific Video Customized Diffusion | Ze Ma et.al. | 2402.09368v1 | link |
2024-02-14 | Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music Audio | Pablo Alonso-Jiménez et.al. | 2402.09318v1 | null |
2024-02-14 | Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection | Pengfei Zhou et.al. | 2402.09242v1 | link |
2024-02-13 | IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation | Luke Melas-Kyriazi et.al. | 2402.08682v1 | null |
2024-02-13 | Target Score Matching | Valentin De Bortoli et.al. | 2402.08667v1 | null |
2024-02-13 | Learning Continuous 3D Words for Text-to-Image Generation | Ta-Ying Cheng et.al. | 2402.08654v1 | null |
2024-02-13 | Latent Inversion with Timestep-aware Sampling for Training-free Non-rigid Editing | Yunji Jung et.al. | 2402.08601v1 | null |
2024-02-13 | Denoising Diffusion Restoration Tackles Forward and Inverse Problems for the Laplace Operator | Amartya Mukherjee et.al. | 2402.08563v1 | null |
2024-02-13 | Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases | Ziyi Zhang et.al. | 2402.08552v1 | null |
2024-02-13 | Hyperballistic transport in dense ionized matter under external AC electric fields | Daniele Gamba et.al. | 2402.08519v1 | null |
2024-02-12 | Label-Efficient Model Selection for Text Generation | Shir Ashury-Tahan et.al. | 2402.07891v1 | null |
2024-02-12 | High-order harmonic generation in 2D Transition Metal Disulphides | Jose Manuel Iglesias et.al. | 2402.07850v1 | null |
2024-02-12 | Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models | Jiacheng Ye et.al. | 2402.07754v1 | link |
2024-02-12 | Topological Edge States in Reconfigurable Multi-stable Mechanical Metamaterials | Zhen Wang et.al. | 2402.07707v1 | null |
2024-02-12 | Higher-order Connection Laplacians for Directed Simplicial Complexes | Xue Gong et.al. | 2402.07631v1 | null |
2024-02-09 | Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following | Brian Yang et.al. | 2402.06559v1 | null |
2024-02-09 | Sequential Flow Matching for Generative Modeling | Jongmin Yoon et.al. | 2402.06461v1 | null |
2024-02-09 | ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic Segmentation | Fengyi Shen et.al. | 2402.06446v1 | null |
2024-02-09 | Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation | Peter Hönig et.al. | 2402.06436v1 | null |
2024-02-09 | Enhanced bubble growth near an advancing solidification front | Jochem G. Meijer et.al. | 2402.06409v1 | null |
2024-02-08 | InstaGen: Enhancing Object Detection by Training on Synthetic Dataset | Chengjian Feng et.al. | 2402.05937v1 | null |
2024-02-08 | Time Series Diffusion in the Frequency Domain | Jonathan Crabbé et.al. | 2402.05933v1 | link |
2024-02-08 | AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning | Wamiq Reyaz Para et.al. | 2402.05803v1 | null |
2024-02-08 | Determining the significance and relative importance of parameters of a simulated quenching algorithm using statistical tools | Pedro A. Castillo et.al. | 2402.05791v1 | null |
2024-02-08 | DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer | Zhiyuan Ma et.al. | 2402.05712v1 | link |
2024-02-08 | Scalable Diffusion Models with State Space Backbone | Zhengcong Fei et.al. | 2402.05608v1 | link |
2024-02-07 | On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling | Marcin Sendera et.al. | 2402.05098v1 | link |
2024-02-07 | NITO: Neural Implicit Fields for Resolution-free Topology Optimization | Amin Heyrani Nobari et.al. | 2402.05073v1 | null |
2024-02-07 | LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation | Jiaxiang Tang et.al. | 2402.05054v1 | null |
2024-02-06 | SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models | Yichen Shi et.al. | 2402.04178v1 | link |
2024-02-06 | Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning | Ruoqi Zhang et.al. | 2402.04080v1 | link |
2024-02-06 | Generative Modeling of Graphs via Joint Diffusion of Node and Edge Attributes | Nimrod Berman et.al. | 2402.04046v1 | null |
2024-02-06 | Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced Segmentation | Zolnamar Dorjsembe et.al. | 2402.04031v1 | link |
2024-02-06 | Space Group Constrained Crystal Generation | Rui Jiao et.al. | 2402.03992v1 | null |
2024-02-06 | Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting | Yiming Xu et.al. | 2402.03981v1 | null |
2024-02-06 | Weibel- and non-resonant Whistler wave growth in an expanding plasma in a 1D simulation geometry | M E Dieckmann et.al. | 2402.03925v1 | null |
2024-02-05 | Do Diffusion Models Learn Semantically Meaningful and Efficient Representations? | Qiyao Liang et.al. | 2402.03305v1 | null |
2024-02-05 | Zero-shot Object-Level OOD Detection with Context-Aware Inpainting | Quang-Huy Nguyen et.al. | 2402.03292v1 | null |
2024-02-05 | InstanceDiffusion: Instance-level Control for Image Generation | Xudong Wang et.al. | 2402.03290v1 | link |
2024-02-05 | Organic or Diffused: Can We Distinguish Human Art from AI-generated Images? | Anna Yoo Jeong Ha et.al. | 2402.03214v1 | null |
2024-02-05 | Light and Optimal Schrödinger Bridge Matching | Nikita Gushchin et.al. | 2402.03207v1 | link |
2024-02-05 | Guidance with Spherical Gaussian Constraint for Conditional Diffusion | Lingxiao Yang et.al. | 2402.03201v1 | null |
2024-02-05 | Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion | Shiyuan Yang et.al. | 2402.03162v1 | null |
2024-02-05 | DARTS: Diffusion Approximated Residual Time Sampling for Low Variance Time-of-flight Rendering in Homogeneous Scattering Medium | Qianyue He et.al. | 2402.03106v1 | null |
2024-02-02 | NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties | Jingyuan Sun et.al. | 2402.01590v1 | null |
2024-02-02 | Boximator: Generating Rich and Controllable Motions for Video Synthesis | Jiawei Wang et.al. | 2402.01566v1 | null |
2024-02-02 | Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations | Panos Kakoulidis et.al. | 2402.01520v1 | null |
2024-02-02 | Cross-view Masked Diffusion Transformers for Person Image Synthesis | Trung X. Pham et.al. | 2402.01516v1 | null |
2024-02-01 | AToM: Amortized Text-to-Mesh using 2D Diffusion | Guocheng Qian et.al. | 2402.00867v1 | null |
2024-02-01 | ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields | Jiahua Dong et.al. | 2402.00864v1 | link |
2024-02-01 | Distilling Conditional Diffusion Models for Offline Reinforcement Learning through Trajectory Stitching | Shangzhe Li et.al. | 2402.00807v1 | null |
2024-02-01 | AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning | Fu-Yun Wang et.al. | 2402.00769v1 | link |
2024-02-01 | CapHuman: Capture Your Moments in Parallel Universes | Chao Liang et.al. | 2402.00627v1 | link |
2024-02-01 | Diffusion-based Light Field Synthesis | Ruisheng Gao et.al. | 2402.00575v1 | null |
2024-01-31 | Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators | Daniel Geng et.al. | 2401.18085v1 | null |
2024-01-31 | An electrodynamic wave model for the action potential | Vitaly L. Galinsky et.al. | 2401.18051v1 | null |
2024-01-31 | Investigation of Microstructure and Corrosion Resistance of Ti-Al-V Titanium Alloys Obtained by Spark Plasma Sintering | Aleksey Nokhrin et.al. | 2401.17941v1 | null |
2024-01-31 | AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error | Jonas Ricker et.al. | 2401.17879v1 | link |
2024-01-30 | You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation | Mehdi Noroozi et.al. | 2401.17258v1 | null |
2024-01-30 | ContactGen: Contact-Guided Interactive 3D Human Generation for Partners | Dongjun Gu et.al. | 2401.17212v1 | null |
2024-01-30 | Transfer Learning for Text Diffusion Models | Kehang Han et.al. | 2401.17181v1 | null |
2024-01-29 | Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models | Zhongjie Duan et.al. | 2401.16224v1 | null |
2024-01-29 | Rapidly rotating radiatively driven convection: experimental and numerical validation of the `geostrophic turbulence' scaling predictions | Gabriel Hadjerci et.al. | 2401.16200v1 | null |
2024-01-29 | Spatial-Aware Latent Initialization for Controllable Image Generation | Wenqiang Sun et.al. | 2401.16157v1 | null |
2024-01-29 | Acoustic Screens based on Sonic Crystals with high Diffusion properties | M. P. Peiró-Torres et.al. | 2401.16074v1 | null |
2024-01-26 | Annotated Hands for Generative Models | Yue Yang et.al. | 2401.15075v1 | link |
2024-01-26 | Emulating Complex Synapses Using Interlinked Proton Conductors | Lifu Zhang et.al. | 2401.15045v1 | null |
2024-01-26 | DAM: Diffusion Activation Maximization for 3D Global Explanations | Hanxiao Tan et.al. | 2401.14938v1 | link |
2024-01-26 | Social norms and cooperation in higher-order networks | Yin-Jie Ma et.al. | 2401.14905v1 | null |
2024-01-25 | Deconstructing Denoising Diffusion Models for Self-Supervised Learning | Xinlei Chen et.al. | 2401.14404v1 | null |
2024-01-25 | pix2gestalt: Amodal Segmentation by Synthesizing Wholes | Ege Ozguroglu et.al. | 2401.14398v1 | link |
2024-01-25 | Manifold GCN: Diffusion-based Convolutional Neural Network for Manifold-valued Graphs | Martin Hanik et.al. | 2401.14381v1 | null |
2024-01-25 | UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models | Timo Kapsalis et.al. | 2401.14379v1 | null |
2024-01-25 | Modeling Global Surface Dust Deposition Using Physics-Informed Neural Networks | Constanza A. Molina Catricheo et.al. | 2401.14372v1 | link |
2024-01-25 | Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation | Minglin Chen et.al. | 2401.14257v1 | null |
2024-01-24 | Bi-Hamiltonian in Semiflexible Polymer as Strongly Coupled System | Heeyuen Koh et.al. | 2401.13655v1 | null |
2024-01-24 | On the self-similarity of unbounded viscous Marangoni flows | Fernando Temprano-Coleto et.al. | 2401.13647v1 | null |
2024-01-24 | Winding Clearness for Differentiable Point Cloud Optimization | Dong Xiao et.al. | 2401.13639v1 | null |
2024-01-24 | Guided Diffusion for Fast Inverse Design of Density-based Mechanical Metamaterials | Yanyan Yang et.al. | 2401.13570v1 | null |
2024-01-24 | Expressive Acoustic Guitar Sound Synthesis with an Instrument-Specific Input Representation and Diffusion Outpainting | Hounsu Kim et.al. | 2401.13498v1 | null |
2024-01-23 | GALA: Generating Animatable Layered Assets from a Single Scan | Taeksoo Kim et.al. | 2401.12979v1 | null |
2024-01-23 | Zero-Shot Learning for the Primitives of 3D Affordance in General Objects | Hyeonwoo Kim et.al. | 2401.12978v1 | null |
2024-01-23 | Lumiere: A Space-Time Diffusion Model for Video Generation | Omer Bar-Tal et.al. | 2401.12945v1 | null |
2024-01-23 | Long-range three-dimensional tracking of nanoparticles using interferometric scattering (iSCAT) microscopy | Kiarash Kasaian et.al. | 2401.12939v1 | null |
2024-01-22 | DITTO: Diffusion Inference-Time T-Optimization for Music Generation | Zachary Novack et.al. | 2401.12179v1 | null |
2024-01-22 | Single-View 3D Human Digitalization with Large Reconstruction Models | Zhenzhen Weng et.al. | 2401.12175v1 | null |
2024-01-22 | Improved accuracy of continuum surface flux models for metal additive manufacturing melt pool simulations | Nils Much et.al. | 2401.12114v1 | null |
2024-01-22 | Experimental investigation and scale analysis on melting of salty ice in a 3D-printed cavity filled with porous media | Xiaotian Liand Yuming Wang et.al. | 2401.12009v1 | null |
2024-01-22 | Claim Detection for Automated Fact-checking: A Survey on Monolingual, Multilingual and Cross-Lingual Research | Rrubaa Panchendrarajan et.al. | 2401.11969v1 | null |
2024-01-22 | Feature Denoising Diffusion Model for Blind Image Quality Assessment | Xudong Li et.al. | 2401.11949v1 | null |
2024-01-19 | Synthesizing Moving People with 3D Control | Boyi Li et.al. | 2401.10889v1 | null |
2024-01-19 | ActAnywhere: Subject-Aware Video Background Generation | Boxiao Pan et.al. | 2401.10822v1 | null |
2024-01-19 | Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion | Zuoyue Li et.al. | 2401.10786v1 | null |
2024-01-19 | Signatures of s-wave scattering in bound electronic states | Robin E. Moorby et.al. | 2401.10714v1 | null |
2024-01-19 | Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model | Yinan Zheng et.al. | 2401.10700v1 | link |
2024-01-19 | Refractive index measurement of pharmaceutical powders in the short-wave infrared range using index matching assisted with phase imaging | Cory Juntunen et.al. | 2401.10667v1 | null |
2024-01-19 | Analysis of the Patent of a Protective Cover for Vertical-Axis Wind Turbines (VAWTs): Simulations of Wind Flow | JA Moleón Baca et.al. | 2401.10656v1 | null |
2024-01-18 | A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting | Wouter Van Gansbeke et.al. | 2401.10227v1 | link |
2024-01-18 | Towards Language-Driven Video Inpainting via Multimodal Large Language Models | Jianzong Wu et.al. | 2401.10226v1 | null |
2024-01-18 | Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation | Changgu Chen et.al. | 2401.10150v1 | null |
2024-01-18 | DiffusionGPT: LLM-Driven Text-to-Image Generation System | Jie Qin et.al. | 2401.10061v1 | null |
2024-01-18 | CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects | Zhao Wang et.al. | 2401.09962v1 | null |
2024-01-17 | TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion | Yu-Ying Yeh et.al. | 2401.09416v1 | null |
2024-01-17 | Vlogger: Make Your Dream A Vlog | Shaobin Zhuang et.al. | 2401.09414v1 | link |
2024-01-17 | Siamese Meets Diffusion Network: SMDNet for Enhanced Change Detection in High-Resolution RS Imagery | Jia Jia et.al. | 2401.09325v1 | null |
2024-01-17 | Tailoring chaotic motion of microcavity photons in ray and wave dynamics by tuning the curvature of space | Wei Lin et.al. | 2401.09303v1 | null |
2024-01-17 | T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis | Yoonjin Chung et.al. | 2401.09294v1 | null |
2024-01-16 | Robotic Imitation of Human Actions | Josua Spisak et.al. | 2401.08381v1 | null |
2024-01-16 | Optimization of the plasmonic properties of titanium nitride films sputtered at room temperature through microstructure and thickness control | Mateusz Nieborek et.al. | 2401.08353v1 | null |
2024-01-16 | Modeling Spoof Noise by De-spoofing Diffusion and its Application in Face Anti-spoofing | Bin Zhang et.al. | 2401.08275v1 | null |
2024-01-16 | Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization | Chongzhi Zhang et.al. | 2401.08232v1 | null |
2024-01-12 | Decoupling Pixel Flipping and Occlusion Strategy for Consistent XAI Benchmarks | Stefan Blücher et.al. | 2401.06654v1 | link |
2024-01-12 | Adversarial Examples are Misaligned in Diffusion Model Manifolds | Peter Lorenz et.al. | 2401.06637v1 | null |
2024-01-12 | Motion2VecSets: 4D Latent Vector Set Diffusion for Non-rigid Shape Reconstruction and Tracking | Wei Cao et.al. | 2401.06614v1 | null |
2024-01-12 | 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model | Qian Wang et.al. | 2401.06578v1 | null |
2024-01-11 | E |
Yifan Gong et.al. | 2401.06127v1 | null |
2024-01-11 | Numerical thermalization in 2D PIC simulations: Practical estimates for low temperature plasma simulations | Sierra Jubin et.al. | 2401.06057v1 | null |
2024-01-11 | DiffDA: a diffusion model for weather-scale data assimilation | Langwen Huang et.al. | 2401.05932v1 | null |
2024-01-11 | Efficient Image Deblurring Networks based on Diffusion Models | Kang Chen et.al. | 2401.05907v1 | link |
2024-01-10 | InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes | Mohamad Shahbazi et.al. | 2401.05335v1 | null |
2024-01-10 | Score Distillation Sampling with Learned Manifold Corrective | Thiemo Alldieck et.al. | 2401.05293v1 | null |
2024-01-10 | PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models | Junsong Chen et.al. | 2401.05252v1 | link |
2024-01-10 | Derm-T2IM: Harnessing Synthetic Skin Lesion Data via Stable Diffusion Models for Enhanced Skin Disease Classification using ViT and CNN | Muhammad Ali Farooq et.al. | 2401.05159v1 | null |
2024-01-10 | CrossDiff: Exploring Self-Supervised Representation of Pansharpening via Cross-Predictive Diffusion Model | Yinghui Xing et.al. | 2401.05153v1 | null |
2024-01-09 | Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation | Xiyi Chen et.al. | 2401.04728v1 | null |
2024-01-09 | EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models | Jingyuan Yang et.al. | 2401.04608v1 | null |
2024-01-09 | Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models | Xuewen Liu et.al. | 2401.04585v1 | link |
2024-01-09 | MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation | Weimin Wang et.al. | 2401.04468v1 | null |
2024-01-09 | D3AD: Dynamic Denoising Diffusion Probabilistic Model for Anomaly Detection | Justin Tebbe et.al. | 2401.04463v1 | link |
2024-01-08 | D3PRefiner: A Diffusion-based Denoise Method for 3D Human Pose Refinement | Danqi Yan et.al. | 2401.03914v1 | null |
2024-01-05 | Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction | Yuxin Yang et.al. | 2401.02916v1 | null |
2024-01-05 | Plug-in Diffusion Model for Sequential Recommendation | Haokai Ma et.al. | 2401.02913v1 | link |
2024-01-05 | Generating Non-Stationary Textures using Self-Rectification | Yang Zhou et.al. | 2401.02847v1 | link |
2024-01-05 | Diffbody: Diffusion-based Pose and Shape Editing of Human Images | Yuta Okuyama et.al. | 2401.02804v1 | link |
2024-01-05 | Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors | Top Piriyakulkij et.al. | 2401.02739v1 | null |
2024-01-05 | Geometric-Facilitated Denoising Diffusion Model for 3D Molecule Generation | Can Xu et.al. | 2401.02683v1 | link |
2024-01-04 | Bring Metric Functions into Diffusion Models | Jie An et.al. | 2401.02414v1 | null |
2024-01-04 | Image denoising and model-independent parameterization for improving IVIM MRI | Caleb Sample et.al. | 2401.02394v1 | null |
2024-01-04 | Integration of physics-informed operator learning and finite element method for parametric learning of partial differential equations | Shahed Rezaei et.al. | 2401.02363v1 | null |
2024-01-04 | Robust Physics Informed Neural Networks | Marcin Łoś et.al. | 2401.02300v1 | null |
2024-01-03 | From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations | Evonne Ng et.al. | 2401.01885v1 | link |
2024-01-03 | DGDNN: Decoupled Graph Diffusion Neural Network for Stock Movement Prediction | Zinuo You et.al. | 2401.01846v1 | link |
2024-01-03 | Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions | David Junhao Zhang et.al. | 2401.01827v1 | link |
2024-01-03 | aMUSEd: An Open MUSE Reproduction | Suraj Patil et.al. | 2401.01808v1 | link |
2024-01-03 | Short-time expansion of one-dimensional Fokker-Planck equations with heterogeneous diffusion | Tom Dupont et.al. | 2401.01765v1 | null |
2024-01-02 | Influence of scanning plane on Human Spinal Cord functional Magnetic Resonance echo planar imaging | Marta Moraschi et.al. | 2401.01281v1 | null |
2024-01-02 | Fairness Certification for Natural Language Processing and Large Language Models | Vincent Freiberger et.al. | 2401.01262v1 | null |
2024-01-02 | VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM | Fuchen Long et.al. | 2401.01256v1 | null |
2024-01-02 | Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation | Renshuai Liu et.al. | 2401.01207v1 | null |
2024-01-02 | Learning Surface Scattering Parameters From SAR Images Using Differentiable Ray Tracing | Jiangtao Wei et.al. | 2401.01175v1 | null |
2024-01-02 | Joint Generative Modeling of Scene Graphs and Images via Diffusion Models | Bicheng Xu et.al. | 2401.01130v1 | null |
2023-12-29 | FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis | Feng Liang et.al. | 2312.17681v1 | null |
2023-12-29 | Data Augmentation for Supervised Graph Outlier Detection with Latent Diffusion Models | Kay Liu et.al. | 2312.17679v1 | link |
2023-12-29 | Leveraging Open-Vocabulary Diffusion to Camouflaged Instance Segmentation | Tuan-Anh Vu et.al. | 2312.17505v1 | null |
2023-12-28 | iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views | Chin-Hsuan Wu et.al. | 2312.17250v1 | link |
2023-12-28 | Amodal Ground Truth and Completion in the Wild | Guanqi Zhan et.al. | 2312.17247v1 | link |
2023-12-28 | Personalized Restoration via Dual-Pivot Tuning | Pradyumna Chari et.al. | 2312.17234v1 | null |
2023-12-28 | 4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency | Yuyang Yin et.al. | 2312.17225v1 | null |
2023-12-28 | EFHQ: Multi-purpose ExtremePose-Face-HQ dataset | Trung Tuan Dao et.al. | 2312.17205v1 | null |
2023-12-28 | Restoration by Generation with Constrained Priors | Zheng Ding et.al. | 2312.17161v1 | null |
2023-12-28 | InsActor: Instruction-driven Physics-based Characters | Jiawei Ren et.al. | 2312.17135v1 | null |
2023-12-28 | 100-fold improvement in relaxed eddy accumulation flux estimates through error diffusion | Anas Emad et.al. | 2312.17027v1 | link |
2023-12-26 | One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications | Mengyao Lyu et.al. | 2312.16145v1 | null |
2023-12-26 | HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D | Sangmin Woo et.al. | 2312.15980v1 | link |
2023-12-26 | Semantic Guidance Tuning for Text-To-Image Diffusion Models | Hyun Kang et.al. | 2312.15964v1 | null |
2023-12-26 | EnchantDance: Unveiling the Potential of Music-Driven Dance Movement | Bo Han et.al. | 2312.15946v1 | link |
2023-12-22 | MACS: Mass Conditioned 3D Hand and Object Motion Synthesis | Soshi Shimada et.al. | 2312.14929v1 | null |
2023-12-22 | BrainVis: Exploring the Bridge between Brain and Visual Signals via Image Reconstruction | Honghao Fu et.al. | 2312.14871v1 | null |
2023-12-22 | Dreaming of Electrical Waves: Generative Modeling of Cardiac Excitation Waves using Diffusion Models | Tanish Baranwal et.al. | 2312.14830v1 | null |
2023-12-22 | Neural network models for preferential concentration of particles in two-dimensional turbulence | Thibault Maurel-Oujia et.al. | 2312.14829v1 | null |
2023-12-22 | Plan, Posture and Go: Towards Open-World Text-to-Motion Generation | Jinpeng Liu et.al. | 2312.14828v1 | null |
2023-12-22 | Disorder-induced non-linear growth of viscously-unstable immiscible two-phase flow fingers in porous media | Santanu Sinha et.al. | 2312.14799v1 | null |
2023-12-22 | Diffusion Maps for Signal Filtering in Graph Learning | Todd Hildebrant et.al. | 2312.14758v1 | null |
2023-12-21 | Diffusion Reward: Learning Rewards via Conditional Video Diffusion | Tao Huang et.al. | 2312.14134v1 | null |
2023-12-21 | Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation | Philipp Schröppel et.al. | 2312.14124v1 | link |
2023-12-21 | HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models | Hayk Manukyan et.al. | 2312.14091v1 | link |
2023-12-21 | Designing Artificial Intelligence Equipped Social Decentralized Autonomous Organizations for Tackling Sextortion Cases Version 0.7 | Norta Alex et.al. | 2312.14090v1 | null |
2023-12-21 | The influence of controlled vibration effects on fluid flow | Alexey Fedyushkin et.al. | 2312.14079v1 | null |
2023-12-21 | Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning | Desai Xie et.al. | 2312.13980v1 | null |
2023-12-21 | Controllable 3D Face Generation with Conditional Style Code Diffusion | Xiaolong Shen et.al. | 2312.13941v1 | link |
2023-12-20 | Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting | Junwu Zhang et.al. | 2312.13271v1 | link |
2023-12-20 | Conditional Image Generation with Pretrained Generative Model | Rajesh Shrestha et.al. | 2312.13253v1 | null |
2023-12-20 | Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model | Saurabh Saxena et.al. | 2312.13252v1 | null |
2023-12-20 | Diffusion Models With Learned Adaptive Noise | Subham Sekhar Sahoo et.al. | 2312.13236v1 | link |
2023-12-20 | MoSAR: Monocular Semi-Supervised Model for Avatar Reconstruction using Differentiable Shading | Abdallah Dib et.al. | 2312.13091v1 | null |
2023-12-20 | DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis | Yuming Gu et.al. | 2312.13016v1 | link |
2023-12-20 | A comparative study of analytical models of diffuse reflectance in homogeneous biological tissues: Gelatin based phantoms and Monte Carlo experiments | Anisha Bahl et.al. | 2312.12935v1 | null |
2023-12-19 | On Inference Stability for Diffusion Models | Viet Nguyen et.al. | 2312.12431v1 | link |
2023-12-19 | SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process | Mengyu Wang et.al. | 2312.12425v1 | link |
2023-12-19 | Scene-Conditional 3D Object Stylization and Composition | Jinghao Zhou et.al. | 2312.12419v1 | null |
2023-12-19 | LASA: Instance Reconstruction from Real Scans using A Large-scale Aligned Shape Annotation Dataset | Haolin Liu et.al. | 2312.12418v1 | null |
2023-12-19 | Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models | Shweta Mahajan et.al. | 2312.12416v1 | null |
2023-12-19 | Intrinsic Image Diffusion for Single-view Material Estimation | Peter Kocsis et.al. | 2312.12274v1 | link |
2023-12-19 | Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model | Lingjun Zhang et.al. | 2312.12232v1 | link |
2023-12-18 | A novel diffusion recommendation algorithm based on multi-scale cnn and residual lstm | Yong Niu et.al. | 2312.10885v1 | null |
2023-12-17 | Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models | Nikita Starodubcev et.al. | 2312.10835v1 | link |
2023-12-17 | From mixing to displacement of miscible phases in porous media: The role of heterogeneity and inlet pressure | Yahel Eliyahu-Yakir et.al. | 2312.10722v1 | null |
2023-12-17 | CogCartoon: Towards Practical Story Visualization | Zhongyang Zhu et.al. | 2312.10718v1 | null |
2023-12-17 | A Framework of Full-Process Generation Design for Park Green Spaces Based on Remote Sensing Segmentation-GAN-Diffusion | Ran Chen et.al. | 2312.10674v1 | null |
2023-12-15 | Movement Primitive Diffusion: Learning Gentle Robotic Manipulation of Deformable Objects | Paul Maria Scheikl et.al. | 2312.10008v1 | null |
2023-12-15 | Contributions to the geomagnetic secular variation from a reanalysis of core surface dynamics | Olivier Barrois et.al. | 2312.09942v1 | null |
2023-12-15 | Assimilation of ground and satellite magnetic measurements: inference of core surface magnetic and velocity field changes | Olivier Barrois et.al. | 2312.09878v1 | null |
2023-12-15 | Integrating New Technologies into Science: The case of AI | Stefano Bianchini et.al. | 2312.09843v1 | null |
2023-12-15 | Socio-Economic Deprivation Analysis: Diffusion Maps | June Moh Goo et.al. | 2312.09830v1 | null |
2023-12-15 | Comparison of Quasi-Geostrophic, Hybrid and 3D models of planetary core convection | Olivier Barrois et.al. | 2312.09826v1 | null |
2023-12-15 | Neural networks for turbulent transport prediction in a simplified model of tokamak plasmas | L. M. Pomârjanschi et.al. | 2312.09807v1 | null |
2023-12-14 | LIME: Localized Image Editing via Attention Regularization in Diffusion Models | Enis Simsar et.al. | 2312.09256v1 | null |
2023-12-14 | FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection | Hongsuk Choi et.al. | 2312.09252v1 | null |
2023-12-14 | Single Mesh Diffusion Models with Field Latents for Texture Generation | Thomas W. Mitchel et.al. | 2312.09250v1 | null |
2023-12-14 | Text2Immersion: Generative Immersive Scene with 3D Gaussians | Hao Ouyang et.al. | 2312.09242v1 | null |
2023-12-14 | A framework for conditional diffusion modelling with applications in motif scaffolding for protein design | Kieran Didi et.al. | 2312.09236v1 | null |
2023-12-14 | Reliability in Semantic Segmentation: Can We Use Synthetic Data? | Thibaut Loiseau et.al. | 2312.09231v1 | null |
2023-12-14 | Mosaic-SDF for 3D Generative Models | Lior Yariv et.al. | 2312.09222v1 | null |
2023-12-14 | Measurement in the Age of LLMs: An Application to Ideological Scaling | Sean O'Hagan et.al. | 2312.09203v1 | null |
2023-12-14 | Fast Sampling via De-randomization for Discrete Diffusion Models | Zixiang Chen et.al. | 2312.09193v1 | null |
2023-12-13 | PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary Confusion | Xin You et.al. | 2312.08323v1 | link |
2023-12-13 | Black-box Membership Inference Attacks against Fine-tuned Diffusion Models | Yan Pang et.al. | 2312.08207v1 | link |
2023-12-13 | SPD-DDPM: Denoising Diffusion Probabilistic Models in the Symmetric Positive Definite Space | Yunchen Li et.al. | 2312.08200v1 | link |
2023-12-13 | Concept-centric Personalization with Large-scale Diffusion Priors | Pu Cao et.al. | 2312.08195v1 | link |
2023-12-13 | Maxwell X. Cai et.al. | 2312.08153v1 | link | |
2023-12-13 | Clockwork Diffusion: Efficient Generation With Model-Step Distillation | Amirhossein Habibian et.al. | 2312.08128v1 | link |
2023-12-12 | FreeInit: Bridging Initialization Gap in Video Diffusion Models | Tianxing Wu et.al. | 2312.07537v1 | link |
2023-12-12 | FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition | Sicheng Mo et.al. | 2312.07536v1 | null |
2023-12-12 | PEEKABOO: Interactive Video Generation via Masked-Diffusion | Yash Jain et.al. | 2312.07509v1 | null |
2023-12-12 | MinD-3D: Reconstruct High-quality 3D objects in Human Brain | Jianxiong Gao et.al. | 2312.07485v1 | null |
2023-12-12 | DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing | Kaiwen Zhang et.al. | 2312.07409v1 | null |
2023-12-12 | Boosting Latent Diffusion with Flow Matching | Johannes S. Fischer et.al. | 2312.07360v1 | link |
2023-12-12 | Momentum Particle Maximum Likelihood | Jen Ning Lim et.al. | 2312.07335v1 | null |
2023-12-11 | CAD: Photorealistic 3D Generation via Adversarial Distillation | Ziyu Wan et.al. | 2312.06663v1 | null |
2023-12-11 | Photorealistic Video Generation with Diffusion Models | Agrim Gupta et.al. | 2312.06662v1 | null |
2023-12-11 | UpFusion: Novel View Diffusion from Unposed Sparse View Observations | Bharath Raj Nagoor Kani et.al. | 2312.06661v1 | null |
2023-12-11 | Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior | Fangfu Liu et.al. | 2312.06655v1 | link |
2023-12-11 | Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution | Shangchen Zhou et.al. | 2312.06640v1 | null |
2023-12-11 | DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection | Haoyang He et.al. | 2312.06607v1 | link |
2023-12-11 | ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models | Denis Zavadski et.al. | 2312.06573v1 | link |
2023-12-11 | HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models | Xiaogang Peng et.al. | 2312.06553v1 | null |
2023-12-11 | In-situ Synchrotron X-Ray Photoelectron Spectroscopy Study of Medium-Temperature Baking of Niobium for SRF Application | Alena Prudnikava et.al. | 2312.06529v1 | null |
2023-12-08 | KBFormer: A Diffusion Model for Structured Entity Completion | Ouail Kitouni et.al. | 2312.05253v1 | null |
2023-12-08 | SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation | Thuan Hoang Nguyen et.al. | 2312.05239v1 | null |
2023-12-08 | Stoichiometry preservation and generalization of Bilger mixture fraction for non-premixed combustion with differential molecular diffusion | Haifeng Wang et.al. | 2312.05204v1 | null |
2023-12-08 | Membership Inference Attacks on Diffusion Models via Quantile Regression | Shuai Tang et.al. | 2312.05140v1 | null |
2023-12-08 | DreaMoving: A Human Dance Video Generation Framework based on Diffusion Models | Mengyang Feng et.al. | 2312.05107v1 | null |
2023-12-08 | Application of deep learning to the estimation of normalization coefficients in diffusion-based covariance models | Folke K Skrunes et.al. | 2312.05068v1 | link |
2023-12-08 | SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control | Jaskirat Singh et.al. | 2312.05039v1 | null |
2023-12-08 | Numerical determination of iron dust laminar flame speeds with the counterflow twin-flame technique | C. E. A. G. van Gool et.al. | 2312.04994v1 | null |
2023-12-07 | Gen2Det: Generate to Detect | Saksham Suri et.al. | 2312.04566v1 | null |
2023-12-07 | NeRFiller: Completing Scenes via Generative 3D Inpainting | Ethan Weber et.al. | 2312.04560v1 | null |
2023-12-07 | PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation | Zhaoxi Chen et.al. | 2312.04559v1 | link |
2023-12-07 | GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation | Shoufa Chen et.al. | 2312.04557v1 | null |
2023-12-07 | SPIDeRS: Structured Polarization for Invisible Depth and Reflectance Sensing | Tomoki Ichikawa et.al. | 2312.04553v1 | null |
2023-12-07 | Generating Illustrated Instructions | Sachit Menon et.al. | 2312.04552v1 | null |
2023-12-07 | PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play | Lili Chen et.al. | 2312.04549v1 | null |
2023-12-07 | HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image | Tong Wu et.al. | 2312.04543v1 | null |
2023-12-07 | Diffusion Reflectance Map: Single-Image Stochastic Inverse Rendering of Illumination and Reflectance | Yuto Enyo et.al. | 2312.04529v1 | null |
2023-12-07 | RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models | Ozgur Kara et.al. | 2312.04524v1 | link |
2023-12-06 | Relightable Gaussian Codec Avatars | Shunsuke Saito et.al. | 2312.03704v1 | null |
2023-12-06 | Self-conditioned Image Generation via Generating Representations | Tianhong Li et.al. | 2312.03701v1 | link |
2023-12-06 | Memory Triggers: Unveiling Memorization in Text-To-Image Generative Models through Word-Level Duplication | Ali Naseh et.al. | 2312.03692v1 | null |
2023-12-06 | WarpDiffusion: Efficient Diffusion Model for High-Fidelity Virtual Try-on | xujie zhang et.al. | 2312.03667v1 | null |
2023-12-06 | TokenCompose: Grounding Diffusion with Token-level Supervision | Zirui Wang et.al. | 2312.03626v1 | link |
2023-12-06 | DreamComposer: Controllable 3D Object Generation via Multi-View Conditions | Yunhan Yang et.al. | 2312.03611v1 | null |
2023-12-06 | DiffusionSat: A Generative Foundation Model for Satellite Imagery | Samar Khanna et.al. | 2312.03606v1 | null |
2023-12-06 | MMM: Generative Masked Motion Model | Ekkasit Pinyoanuntapong et.al. | 2312.03596v1 | link |
2023-12-05 | ReconFusion: 3D Reconstruction with Diffusion Priors | Rundi Wu et.al. | 2312.02981v1 | null |
2023-12-05 | Alchemist: Parametric Control of Material Properties with Diffusion Models | Prafull Sharma et.al. | 2312.02970v1 | null |
2023-12-05 | AmbiGen: Generating Ambigrams from Pre-trained Diffusion Model | Boheng Zhao et.al. | 2312.02967v1 | null |
2023-12-05 | Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection | Cheng-Ju Ho et.al. | 2312.02966v1 | link |
2023-12-05 | Drag-A-Video: Non-rigid Video Editing with Point-based Interaction | Yao Teng et.al. | 2312.02936v1 | null |
2023-12-05 | WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation | Jiachen Lu et.al. | 2312.02934v1 | link |
2023-12-05 | LivePhoto: Real Image Animation with Text-guided Motion Control | Xi Chen et.al. | 2312.02928v1 | null |
2023-12-05 | Multimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image Restoration | Yuang Ai et.al. | 2312.02918v1 | null |
2023-12-04 | Latent Feature-Guided Diffusion Models for Shadow Removal | Kangfu Mei et.al. | 2312.02156v1 | null |
2023-12-04 | Readout Guidance: Learning Control from Diffusion Features | Grace Luo et.al. | 2312.02150v1 | null |
2023-12-04 | Generative Powers of Ten | Xiaojuan Wang et.al. | 2312.02149v1 | null |
2023-12-04 | Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation | Bingxin Ke et.al. | 2312.02145v1 | link |
2023-12-04 | DiffiT: Diffusion Vision Transformers for Image Generation | Ali Hatamizadeh et.al. | 2312.02139v1 | link |
2023-12-04 | Style Aligned Image Generation via Shared Attention | Amir Hertz et.al. | 2312.02133v1 | link |
2023-12-04 | VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence | Yuchao Gu et.al. | 2312.02087v1 | null |
2023-12-04 | Computational Investigation on Collective Dynamical Behaviors of Flickering Laminar Buoyant Diffusion Flames in Circular Arrays | Tao Yang et.al. | 2312.02018v1 | null |
2023-12-01 | MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video | Hengyi Wang et.al. | 2312.00778v1 | null |
2023-12-01 | VideoBooth: Diffusion-based Video Generation with Image Prompts | Yuming Jiang et.al. | 2312.00777v1 | null |
2023-12-01 | CompuCell3D Model of Cell Migration Reproduces Chemotaxis | Pedro C. Dal-Castel et.al. | 2312.00776v1 | link |
2023-12-01 | Effects of three-dimensional slit geometry on flashback of premixed hydrogen flames in perforated burners | Filippo Fruzza et.al. | 2312.00744v1 | null |
2023-12-01 | Resource-constrained knowledge diffusion processes inspired by human peer learning | Ehsan Beikihassan et.al. | 2312.00660v1 | null |
2023-12-01 | TrackDiffusion: Multi-object Tracking Data Generation via Diffusion Models | Pengxiang Li et.al. | 2312.00651v1 | null |
2023-12-01 | How the zebra got its stripes: Curvature-dependent diffusion orients Turing patterns on 3D surfaces | Michael F. Staddon et.al. | 2312.00637v1 | null |
2023-11-30 | VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models | Zhen Xing et.al. | 2311.18837v1 | null |
2023-11-30 | ART |
Wenming Weng et.al. | 2311.18834v1 | null |
2023-11-30 | Exploiting Diffusion Prior for Generalizable Pixel-Level Semantic Prediction | Hsin-Ying Lee et.al. | 2311.18832v1 | link |
2023-11-30 | MotionEditor: Editing Video Motion via Content-Aware Diffusion | Shuyuan Tu et.al. | 2311.18830v1 | link |
2023-11-30 | MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation | Yanhui Wang et.al. | 2311.18829v1 | null |
2023-11-30 | One-step Diffusion with Distribution Matching Distillation | Tianwei Yin et.al. | 2311.18828v1 | null |
2023-11-30 | ElasticDiffusion: Training-free Arbitrary Size Image Generation | Moayed Haji-Ali et.al. | 2311.18822v1 | link |
2023-11-30 | Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters | James Seale Smith et.al. | 2311.18763v1 | null |
2023-11-29 | Do text-free diffusion models learn discriminative visual representations? | Soumik Mukhopadhyay et.al. | 2311.17921v1 | link |
2023-11-29 | Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models | Daniel Geng et.al. | 2311.17919v1 | null |
2023-11-29 | AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text | Jianfeng Zhang et.al. | 2311.17917v1 | null |
2023-11-29 | CG3D: Compositional Generation for Text-to-3D via Gaussian Splatting | Alexander Vilesov et.al. | 2311.17907v1 | null |
2023-11-29 | SODA: Bottleneck Diffusion Models for Representation Learning | Drew A. Hudson et.al. | 2311.17901v1 | null |
2023-11-29 | Leveraging Graph Diffusion Models for Network Refinement Tasks | Puja Trivedi et.al. | 2311.17856v1 | null |
2023-11-29 | SPiC-E : Structural Priors in 3D Diffusion Models using Cross Entity Attention | Etai Sella et.al. | 2311.17834v1 | null |
2023-11-29 | Analyzing and Explaining Image Classifiers via Diffusion Guidance | Maximilian Augustin et.al. | 2311.17833v1 | null |
2023-11-28 | Material Palette: Extraction of Materials from a Single Image | Ivan Lopes et.al. | 2311.17060v1 | null |
2023-11-28 | ReMoS: Reactive 3D Motion Synthesis for Two-Person Interactions | Anindita Ghosh et.al. | 2311.17057v1 | null |
2023-11-28 | DiffuseBot: Breeding Soft Robots With Physics-Augmented Generative Diffusion Models | Tsun-Hsuan Wang et.al. | 2311.17053v1 | null |
2023-11-28 | Surf-D: High-Quality Surface Generation for Arbitrary Topologies using Diffusion Models | Zhengming Yu et.al. | 2311.17050v1 | null |
2023-11-28 | Adversarial Diffusion Distillation | Axel Sauer et.al. | 2311.17042v1 | link |
2023-11-28 | Rumors with Changing Credibility | Charlotte Out et.al. | 2311.17040v1 | null |
2023-11-28 | Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features | Niladri Shekhar Dutt et.al. | 2311.17024v1 | link |
2023-11-28 | Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer | Danah Yatim et.al. | 2311.17009v1 | null |
2023-11-28 | Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following | Yutong Feng et.al. | 2311.17002v1 | null |
2023-11-27 | Test-time Adaptation of Discriminative Models via Diffusion Generative Feedback | Mihir Prabhudesai et.al. | 2311.16102v1 | null |
2023-11-27 | CG-HOI: Contact-Guided 3D Human-Object Interaction Generation | Christian Diller et.al. | 2311.16097v1 | null |
2023-11-27 | Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images | Aiyu Cui et.al. | 2311.16094v1 | null |
2023-11-27 | Self-correcting LLM-controlled Diffusion Models | Tsung-Han Wu et.al. | 2311.16090v1 | null |
2023-11-27 | DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization | Zhaoyang Xia et.al. | 2311.16060v1 | link |
2023-11-27 | Exploring Attribute Variations in Style-based GANs using Diffusion Models | Rishubh Parihar et.al. | 2311.16052v1 | null |
2023-11-27 | GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions | Jiemin Fang et.al. | 2311.16037v1 | null |
2023-11-27 | Closing the ODE-SDE gap in score-based diffusion models through the Fokker-Planck equation | Teo Deveney et.al. | 2311.15996v1 | null |
2023-11-27 | DiffAnt: Diffusion Models for Action Anticipation | Zeyun Zhong et.al. | 2311.15991v1 | null |
2023-11-24 | CatVersion: Concatenating Embeddings for Diffusion-Based Text-to-Image Personalization | Ruoyu Zhao et.al. | 2311.14631v1 | null |
2023-11-24 | Received Signal and Channel Parameter Estimation in Molecular Communications | O. Tansel Baydas et.al. | 2311.14621v1 | null |
2023-11-24 | Animate124: Animating One Image to 4D Dynamic Scene | Yuyang Zhao et.al. | 2311.14603v1 | null |
2023-11-24 | On the thermodynamic invariance of fine-grain and coarse-grain fluid models | Thomas Dubos et.al. | 2311.14564v1 | null |
2023-11-24 | ToddlerDiffusion: Flash Interpretable Controllable Diffusion Model | Eslam Mohamed Bakr et.al. | 2311.14542v1 | null |
2023-11-24 | GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting | Yiwen Chen et.al. | 2311.14521v1 | link |
2023-11-24 | MVControl: Adding Conditional Control to Multi-view Diffusion for Controllable Text-to-3D Generation | Zhiqi Li et.al. | 2311.14494v1 | link |
2023-11-22 | On diffusion-based generative models and their error bounds: The log-concave case with full convergence estimates | Stefano Bruno et.al. | 2311.13584v1 | null |
2023-11-22 | WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space | Katja Schwarz et.al. | 2311.13570v1 | null |
2023-11-22 | ADriver-I: A General World Model for Autonomous Driving | Fan Jia et.al. | 2311.13549v1 | null |
2023-11-22 | DiffusionMat: Alpha Matting as Sequential Refinement Learning | Yangyang Xu et.al. | 2311.13535v1 | null |
2023-11-22 | Guided Flows for Generative Modeling and Decision Making | Qinqing Zheng et.al. | 2311.13443v1 | null |
2023-11-22 | LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes | Jaeyoung Chung et.al. | 2311.13384v1 | null |
2023-11-21 | Bubble departure and sliding in high-pressure flow boiling of water | Artyom Kossolapov et.al. | 2311.12749v1 | null |
2023-11-21 | GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning | Jiaxi Lv et.al. | 2311.12631v1 | null |
2023-11-21 | HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis | Sang-Hoon Lee et.al. | 2311.12454v1 | link |
2023-11-21 | Stable Diffusion For Aerial Object Detection | Yanan Jian et.al. | 2311.12345v1 | null |
2023-11-21 | LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis | Peiang Zhao et.al. | 2311.12342v1 | null |
2023-11-21 | Overcoming Pathology Image Data Deficiency: Generating Images from Pathological Transformation Process | Zeyu Liu et.al. | 2311.12316v1 | link |
2023-11-20 | Macroscopic description of a heavy particle immersed within a flow of light particles | Radek Erban et.al. | 2311.12021v1 | null |
2023-11-20 | An Image is Worth Multiple Words: Multi-attribute Inversion for Constrained Text-to-Image Synthesis | Aishwarya Agarwal et.al. | 2311.11919v1 | null |
2023-11-20 | Evolution of internal gravity waves in meso-scale eddies | Pablo Sebastia Saez et.al. | 2311.11916v1 | null |
2023-11-20 | Log-periodic oscillations as real-time signatures of hierarchical dynamics in proteins | Emanuel Dorbath et.al. | 2311.11839v1 | null |
2023-11-20 | Holistic Inverse Rendering of Complex Facade via Aerial 3D Scanning | Zixuan Xie et.al. | 2311.11825v1 | null |
2023-11-17 | Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning | Rohit Girdhar et.al. | 2311.10709v1 | null |
2023-11-17 | SelfEval: Leveraging the discriminative nature of generative models for evaluation | Sai Saketh Rambhatla et.al. | 2311.10708v1 | null |
2023-11-17 | Enhancing Object Coherence in Layout-to-Image Synthesis | Yibin Wang et.al. | 2311.10522v1 | link |
2023-11-16 | The Chosen One: Consistent Characters in Text-to-Image Diffusion Models | Omri Avrahami et.al. | 2311.10093v1 | null |
2023-11-16 | Spontaneous Opinion Swings in the Voter Model with Latency | Giovanni Palermo et.al. | 2311.10045v1 | null |
2023-11-16 | TransFusion -- A Transparency-Based Diffusion Model for Anomaly Detection | Matic Fučka et.al. | 2311.09999v1 | null |
2023-11-16 | The divergence-free velocity formulation of the consistent Navier-Stokes Cahn-Hilliard model with non-matching densities, divergence-conforming discretization, and benchmarks | M. ten Eikelder et.al. | 2311.09966v1 | null |
2023-11-16 | DSR-Diff: Depth Map Super-Resolution with Diffusion Model | Yuan Shi et.al. | 2311.09919v1 | null |
2023-11-15 | Single-Image 3D Human Digitization with Shape-Guided Diffusion | Badour AlBahar et.al. | 2311.09221v1 | null |
2023-11-15 | DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model | Yinghao Xu et.al. | 2311.09217v1 | null |
2023-11-15 | Finding polarised communities and tracking information diffusion on Twitter: The Irish Abortion Referendum | Caroline Pena et.al. | 2311.09196v1 | null |
2023-11-15 | Fast Detection of Phase Transitions with Multi-Task Learning-by-Confusion | Julian Arnold et.al. | 2311.09128v1 | null |
2023-11-15 | Contrastive Transformer Learning with Proximity Data Generation for Text-Based Person Search | Hefeng Wu et.al. | 2311.09084v1 | link |
2023-11-15 | A Spectral Diffusion Prior for Hyperspectral Image Super-Resolution | Jianjun Liu et.al. | 2311.08955v1 | null |
2023-11-13 | Fast and Space-Efficient Parallel Algorithms for Influence Maximization | Letong Wang et.al. | 2311.07554v1 | link |
2023-11-13 | Harnessing elastic instabilities for enhanced mixing and reaction kinetics in porous media | Christopher A. Browne et.al. | 2311.07431v1 | link |
2023-11-13 | Robust semi-supervised segmentation with timestep ensembling diffusion models | Margherita Rosnati et.al. | 2311.07421v1 | null |
2023-11-10 | Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization | Weiyang Liu et.al. | 2311.06243v1 | null |
2023-11-10 | Diffusion Models for Earth Observation Use-cases: from cloud removal to urban change detection | Fulvio Sanguigni et.al. | 2311.06222v1 | null |
2023-11-10 | Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model | Jiahao Li et.al. | 2311.06214v1 | null |
2023-11-10 | Turbulence Scaling from Deep Learning Diffusion Generative Models | Tim Whittaker et.al. | 2311.06112v1 | null |
2023-11-09 | Diffusion-Generative Multi-Fidelity Learning for Physical Simulation | Zheng Wang et.al. | 2311.05606v1 | null |
2023-11-09 | Bayesian Methods for Media Mix Modelling with shape and funnel effects | Javier Marin et.al. | 2311.05587v1 | null |
2023-11-09 | LCM-LoRA: A Universal Stable-Diffusion Acceleration Module | Simian Luo et.al. | 2311.05556v1 | link |
2023-11-09 | From Stability to Change: The Potential Application of Bifurcation Theory to Opinion Dynamics Considerations | Yasuko Kawahata et.al. | 2311.05488v1 | null |
2023-11-09 | Lithium-ion battery performance model including solvent segregation effects | Ruihe Li et.al. | 2311.05467v1 | null |
2023-11-09 | 3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models | Haibo Yang et.al. | 2311.05464v1 | link |
2023-11-09 | ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors | Jingwen Chen et.al. | 2311.05463v1 | null |
2023-11-08 | Transferability of atomic energies from alchemical decomposition | Michael J. Sahre et.al. | 2311.04784v1 | link |
2023-11-08 | Weakly-supervised deepfake localization in diffusion-generated images | Dragos Tantaru et.al. | 2311.04584v1 | link |
2023-11-07 | I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models | Shiwei Zhang et.al. | 2311.04145v1 | link |
2023-11-07 | Simple Bundles of Complex Networks | Alexandre Benatti et.al. | 2311.04133v1 | null |
2023-11-07 | Generative Structural Design Integrating BIM and Diffusion Model | Zhili He et.al. | 2311.04052v1 | link |
2023-11-07 | A Method to Improve the Performance of Reinforcement Learning Based on the Y Operator for a Class of Stochastic Differential Equation-Based Child-Mother Systems | Cheng Yin et.al. | 2311.04014v1 | null |
2023-11-06 | TS-Diffusion: Generating Highly Complex Time Series with Diffusion Models | Yangming Li et.al. | 2311.03303v1 | null |
2023-11-06 | LDM3D-VR: Latent Diffusion Model for 3D VR | Gabriela Ben Melech Stan et.al. | 2311.03226v1 | null |
2023-11-06 | Persistent homology for high-dimensional data based on spectral methods | Sebastian Damrich et.al. | 2311.03087v1 | link |
2023-11-06 | AnyText: Multilingual Visual Text Generation And Editing | Yuxiang Tuo et.al. | 2311.03054v1 | link |
2023-11-03 | Latent Diffusion Model for Conditional Reservoir Facies Generation | Daesoo Lee et.al. | 2311.01968v1 | null |
2023-11-03 | DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoder | Tao Liu et.al. | 2311.01811v1 | null |
2023-11-03 | On the Generalization Properties of Diffusion Models | Puheng Li et.al. | 2311.01797v1 | link |
2023-11-03 | PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation | Yuhan Ding et.al. | 2311.01773v1 | null |
2023-11-03 | CDGraph: Dual Conditional Social Graph Synthesizing via Diffusion Model | Jui-Yi Tsai et.al. | 2311.01729v1 | null |
2023-11-02 | Time Series Anomaly Detection using Diffusion-based Models | Ioana Pintilie et.al. | 2311.01452v1 | link |
2023-11-02 | Constrained-Context Conditional Diffusion Models for Imitation Learning | Vaibhav Saxena et.al. | 2311.01419v1 | null |
2023-11-02 | The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing | Shen Nie et.al. | 2311.01410v1 | null |
2023-11-02 | Sim2Real Bilevel Adaptation for Object Surface Classification using Vision-Based Tactile Sensors | Gabriele M. Caddeo et.al. | 2311.01380v1 | link |
2023-11-02 | DP-Mix: Mixup-based Data Augmentation for Differentially Private Learning | Wenxuan Bao et.al. | 2311.01295v1 | link |
2023-11-02 | Unraveling Diffusion in Fusion Plasma: A Case Study of In Situ Processing and Particle Sorting | Junmin Gu et.al. | 2311.01288v1 | null |
2023-11-01 | De-Diffusion Makes Text a Strong Cross-Modal Interface | Chen Wei et.al. | 2311.00618v1 | null |
2023-11-01 | Controllable Music Production with Diffusion Models and Guidance Gradients | Mark Levy et.al. | 2311.00613v1 | null |
2023-11-01 | Intriguing Properties of Data Attribution on Diffusion Models | Xiaosen Zheng et.al. | 2311.00500v1 | link |
2023-11-01 | Diffusion models for probabilistic programming | Simon Dirmeier et.al. | 2311.00474v1 | link |
2023-11-01 | Dual Conditioned Diffusion Models for Out-Of-Distribution Detection: Application to Fetal Ultrasound Videos | Divyanshu Mishra et.al. | 2311.00469v1 | null |
2023-10-31 | SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction | Xinyuan Chen et.al. | 2310.20700v1 | null |
2023-10-31 | Diffusion Reconstruction of Ultrasound Images with Informative Uncertainty | Yuxin Zhang et.al. | 2310.20618v1 | null |
2023-10-29 | JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation | Yao Yao et.al. | 2310.19180v1 | null |
2023-10-29 | Learning to Follow Object-Centric Image Editing Instructions Faithfully | Tuhin Chakrabarty et.al. | 2310.19145v1 | link |
2023-10-29 | Backward and Forward Inference in Interacting Independent-Cascade Processes: A Scalable and Convergent Message-Passing Approach | Nouman Khan et.al. | 2310.19138v1 | null |
2023-10-29 | Bespoke Solvers for Generative Flow Models | Neta Shaul et.al. | 2310.19075v1 | null |
2023-10-29 | Controllable Group Choreography using Contrastive Diffusion | Nhat Le et.al. | 2310.18986v1 | null |
2023-10-29 | Adversarial Examples Are Not Real Features | Ang Li et.al. | 2310.18936v1 | link |
2023-10-27 | Gen2Sim: Scaling up Robot Learning in Simulation with Generative Models | Pushkal Katara et.al. | 2310.18308v1 | null |
2023-10-27 | Unsteady evolution of slip and drag in surfactant-contaminated superhydrophobic channels | Samuel D. Tomlinson et.al. | 2310.18184v1 | null |
2023-10-27 | Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN | Neeraj Kumar et.al. | 2310.18169v1 | null |
2023-10-27 | Lost in Translation -- Multilingual Misinformation and its Evolution | Dorian Quelle et.al. | 2310.18089v1 | null |
2023-10-27 | ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image | Kyle Sargent et.al. | 2310.17994v1 | null |
2023-10-26 | 6-DoF Stability Field via Diffusion Models | Takuma Yoneda et.al. | 2310.17649v1 | null |
2023-10-26 | Generative Fractional Diffusion Models | Gabriel Nobis et.al. | 2310.17638v1 | null |
2023-10-26 | Orbital-optimized Density Functional Calculations of Molecular Rydberg Excited States with Real Space Grid Representation and Self-Interaction Correction | Alec E. Sigurðarson et.al. | 2310.17605v1 | null |
2023-10-26 | Noise-Free Score Distillation | Oren Katzir et.al. | 2310.17590v1 | null |
2023-10-27 | Global Structure-Aware Diffusion Process for Low-Light Image Enhancement | Jinhui Hou et.al. | 2310.17577v2 | link |
2023-10-26 | DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation | Yongxin Zhu et.al. | 2310.17570v1 | null |
2023-10-26 | SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching | Xinghui Li et.al. | 2310.17569v1 | null |
2023-10-27 | The Expressive Power of Low-Rank Adaptation | Yuchen Zeng et.al. | 2310.17513v2 | link |
2023-10-25 | PERF: Panoramic Neural Radiance Field from a Single Panorama | Guangcong Wang et.al. | 2310.16831v1 | link |
2023-10-25 | CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images | Aaron Gokaslan et.al. | 2310.16825v1 | link |
2023-10-26 | DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior | Jingxiang Sun et.al. | 2310.16818v2 | link |
2023-10-25 | Optical Kinetic Theory of Nonlinear Multi-mode Photonic Networks | Arkady Kurnosov et.al. | 2310.16784v1 | null |
2023-10-25 | Kiki or Bouba? Sound Symbolism in Vision-and-Language Models | Morris Alper et.al. | 2310.16781v1 | null |
2023-10-25 | Multi-scale Diffusion Denoised Smoothing | Jongheon Jeong et.al. | 2310.16779v1 | link |
2023-10-25 | Discrete variance decay analysis of spurious mixing | Tridib Banerjee et.al. | 2310.16768v1 | null |
2023-10-25 | Scalar mass conservation in turbulent mixture fraction based combustion models through consistent local flow parameters | Marco Davidovic et.al. | 2310.16743v1 | null |
2023-10-24 | From Posterior Sampling to Meaningful Diversity in Image Restoration | Noa Cohen et.al. | 2310.16047v1 | null |
2023-10-24 | CVPR 2023 Text Guided Video Editing Competition | Jay Zhangjie Wu et.al. | 2310.16003v1 | link |
2023-10-24 | Classical wave-particle localization in disordered landscapes | Abel J. Abraham et.al. | 2310.16000v1 | null |
2023-10-25 | Improving Robustness and Reliability in Medical Image Classification with Latent-Guided Diffusion and Nested-Ensembles | Xing Shen et.al. | 2310.15952v2 | null |
2023-10-24 | Language-driven Scene Synthesis using Multi-conditional Diffusion Model | An Vuong et.al. | 2310.15948v1 | link |
2023-10-23 | FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling | Haonan Qiu et.al. | 2310.15169v1 | link |
2023-10-23 | Matryoshka Diffusion Models | Jiatao Gu et.al. | 2310.15111v1 | null |
2023-10-23 | Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model | Ruoxi Shi et.al. | 2310.15110v1 | link |
2023-10-24 | Wonder3D: Single Image to 3D using Cross-Domain Diffusion | Xiaoxiao Long et.al. | 2310.15008v2 | null |
2023-10-23 | Orientation-Aware Leg Movement Learning for Action-Driven Human Motion Prediction | Chunzhi Gu et.al. | 2310.14907v1 | null |
2023-10-20 | Achieving Single-Electron Sensitivity at Enhanced Speed in Fully-Depleted CCDs with Double-Gate MOSFETs | Miguel Sofo-Haro et.al. | 2310.13644v1 | null |
2023-10-20 | ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection | Zhongzhan Huang et.al. | 2310.13545v1 | link |
2023-10-20 | A Critical Insight into Pretransitional Behavior and Dielectric Tunability of Relaxor Ceramics | Sylwester J. Rzoska et.al. | 2310.13326v1 | null |
2023-10-19 | Variational Inference for SDEs Driven by Fractional Noise | Rembert Daems et.al. | 2310.12975v1 | null |
2023-10-19 | A Markovian dynamics for |
Antonio C. Costa et.al. | 2310.12883v1 | link |
2023-10-19 | EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model | Zheyuan Zhang et.al. | 2310.12868v1 | null |
2023-10-19 | An effective theory of collective deep learning | Lluís Arola-Fernández et.al. | 2310.12802v1 | link |
2023-10-19 | Energy-Based Models For Speech Synthesis | Wanli Sun et.al. | 2310.12765v1 | null |
2023-10-18 | Object-aware Inversion and Reassembly for Image Editing | Zhen Yang et.al. | 2310.12149v1 | null |
2023-10-18 | Quality Diversity through Human Feedback | Li Ding et.al. | 2310.12103v1 | link |
2023-10-18 | Image Super-resolution Via Latent Diffusion: A Sampling-space Mixture Of Experts And Frequency-augmented Decoder Approach | Feng Luo et.al. | 2310.12004v1 | link |
2023-10-18 | Bayesian Flow Networks in Continual Learning | Mateusz Pyla et.al. | 2310.12001v1 | null |
2023-10-18 | InfoDiffusion: Information Entropy Aware Diffusion Process for Non-Autoregressive Text Generation | Renzhi Wang et.al. | 2310.11976v1 | link |
2023-10-17 | Elucidating The Design Space of Classifier-Guided Diffusion Generation | Jiajun Ma et.al. | 2310.11311v1 | link |
2023-10-17 | Favorable and unfavorable many-body interactions for near-field radiative heat transfer in nanoparticle networks | Minggang Luo et.al. | 2310.11273v1 | null |
2023-10-17 | A diffusive wetting model for water entry/exit based on the weakly-compressible SPH method | Shuoguo Zhang et.al. | 2310.11179v1 | null |
2023-10-17 | Leveraging Content-based Features from Multiple Acoustic Models for Singing Voice Conversion | Xueyao Zhang et.al. | 2310.11160v1 | link |
2023-10-17 | BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference | Siqi Kou et.al. | 2310.11142v1 | link |
2023-10-17 | 3D Structure-guided Network for Tooth Alignment in 2D Photograph | Yulong Dou et.al. | 2310.11106v1 | link |
2023-10-16 | A Survey on Video Diffusion Models | Zhen Xing et.al. | 2310.10647v1 | link |
2023-10-16 | TOSS:High-quality Text-guided Novel View Synthesis from a Single Image | Yukai Shi et.al. | 2310.10644v1 | null |
2023-10-16 | LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts | Hanan Gani et.al. | 2310.10640v1 | link |
2023-10-16 | Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models | Kevin Black et.al. | 2310.10639v1 | link |
2023-10-16 | DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing | Jia-Wei Liu et.al. | 2310.10624v1 | null |
2023-10-16 | ViPE: Visualise Pretty-much Everything | Hassan Shahmohammadi et.al. | 2310.10543v1 | link |
2023-10-13 | Hypernymy Understanding Evaluation of Text-to-Image Models via WordNet Hierarchy | Anton Baryshnikov et.al. | 2310.09247v1 | link |
2023-10-13 | Unseen Image Synthesis with Diffusion Models | Ye Zhu et.al. | 2310.09213v1 | null |
2023-10-13 | The effect of solar wind on the charged particles' diffusion coefficients | J. F. Wang et.al. | 2310.09211v1 | null |
2023-10-12 | OmniControl: Control Any Joint at Any Time for Human Motion Generation | Yiming Xie et.al. | 2310.08580v1 | link |
2023-10-12 | HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion | Xian Liu et.al. | 2310.08579v1 | null |
2023-10-12 | NetDiffusion: Network Data Augmentation Through Protocol-Constrained Traffic Generation | Xi Jiang et.al. | 2310.08543v1 | null |
2023-10-12 | GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors | Taoran Yi et.al. | 2310.08529v1 | link |
2023-10-12 | MotionDirector: Motion Customization of Text-to-Video Diffusion Models | Rui Zhao et.al. | 2310.08465v1 | link |
2023-10-12 | Debias the Training of Diffusion Models | Hu Yu et.al. | 2310.08442v1 | null |
2023-10-12 | Neural Diffusion Models | Grigory Bartosh et.al. | 2310.08337v1 | null |
2023-10-11 | ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models | Yingqing He et.al. | 2310.07702v1 | link |
2023-10-11 | ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation | Bo Peng et.al. | 2310.07697v1 | link |
2023-10-11 | Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models | Lai Zeqiang et.al. | 2310.07653v1 | link |
2023-10-11 | Flux gradient relations and their dependence on turbulence anisotropy | Samuele Mosso et.al. | 2310.07503v1 | null |
2023-10-11 | Boosting Black-box Attack to Deep Neural Networks with Conditional Diffusion Models | Renyang Liu et.al. | 2310.07492v1 | null |
2023-10-11 | Multi-Concept T2I-Zero: Tweaking Only The Text Embeddings and Nothing Else | Hazarapet Tunanyan et.al. | 2310.07419v1 | null |
2023-10-10 | What Does Stable Diffusion Know about the 3D Scene? | Guanqi Zhan et.al. | 2310.06836v1 | link |
2023-10-10 | Impact of grain boundary and surface diffusion on predicted fission gas bubble behavior and release in UO |
Md Ali Muntaha et.al. | 2310.06795v1 | null |
2023-10-10 | HiFi-123: Towards High-fidelity One Image to 3D Content Generation | Wangbo Yu et.al. | 2310.06744v1 | null |
2023-10-10 | Latent Diffusion Counterfactual Explanations | Karim Farid et.al. | 2310.06668v1 | null |
2023-10-10 | Tertiary Lymphoid Structures Generation through Graph-based Diffusion | Manuel Madeira et.al. | 2310.06661v1 | null |
2023-10-09 | FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing | Yuren Cong et.al. | 2310.05922v1 | null |
2023-10-10 | Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models | Zhili Liu et.al. | 2310.05873v2 | null |
2023-10-09 | A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models | Sebastian G. Gruber et.al. | 2310.05833v1 | null |
2023-10-09 | DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models | Shansan Gong et.al. | 2310.05793v1 | link |
2023-10-09 | Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation | Lijun Yu et.al. | 2310.05737v1 | link |
2023-10-09 | CIFAR-10-Warehouse: Broad and More Realistic Testbeds in Model Generalization Analysis | Xiaoxiao Sun et.al. | 2310.04414v2 | null |
2023-10-06 | Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference | Simian Luo et.al. | 2310.04378v1 | link |
2023-10-05 | Aligning Text-to-Image Diffusion Models with Reward Backpropagation | Mihir Prabhudesai et.al. | 2310.03739v1 | link |
2023-10-05 | Stochastic interpolants with data-dependent couplings | Michael S. Albergo et.al. | 2310.03725v1 | null |
2023-10-05 | Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints | Chuan Fang et.al. | 2310.03602v1 | null |
2023-10-05 | Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion | Anton Razzhigaev et.al. | 2310.03502v1 | link |
2023-10-05 | Deep Generative Models of Music Expectation | Ninon Lizé Masclef et.al. | 2310.03500v1 | null |
2023-10-05 | An Extended Phase Graph-based framework for DANTE-SPACE simulations including physiological, temporal, and spatial variations | Matthijs H. S. de Buck et.al. | 2310.03429v1 | null |
2023-10-04 | Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models | Jianglong Ye et.al. | 2310.03020v1 | null |
2023-10-04 | Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day | Yifan Jiang et.al. | 2310.03015v1 | null |
2023-10-04 | Probing Intersectional Biases in Vision-Language Models with Counterfactual Examples | Phillip Howard et.al. | 2310.02988v1 | null |
2023-10-04 | T |
Yuze He et.al. | 2310.02977v1 | link |
2023-10-04 | Fast, Expressive SE |
Erik J Bekkers et.al. | 2310.02970v1 | link |
2023-10-04 | Optimal Transport with Adaptive Regularisation | Hugues Van Assel et.al. | 2310.02925v1 | null |
2023-10-04 | Boosting Dermatoscopic Lesion Segmentation via Diffusion Models with Visual and Textual Prompts | Shiyi Du et.al. | 2310.02906v1 | null |
2023-10-03 | Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models | Huaijin Pi et.al. | 2310.02242v1 | null |
2023-10-03 | Leveraging Diffusion Disentangled Representations to Mitigate Shortcuts in Underspecified Visual Tasks | Luca Scimeca et.al. | 2310.02230v1 | null |
2023-10-03 | Global Attractor for a Reaction-Diffusion Model Arising in Biological Dynamic in 3D Soil Structure | Mohamed Elghandouri et.al. | 2310.02060v1 | null |
2023-10-03 | AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model | Zibin Dong et.al. | 2310.02054v1 | null |
2023-10-03 | Spectral operator learning for parametric PDEs without data reliance | Junho Choi et.al. | 2310.02013v1 | null |
2023-10-03 | Optimizing microlens arrays for incoherent HiLo microscopy | Ziao Jiao et.al. | 2310.01939v1 | null |
2023-10-02 | LLM-grounded Video Diffusion Models | Long Lian et.al. | 2309.17444v2 | null |
2023-09-29 | Directly Fine-Tuning Diffusion Models on Differentiable Rewards | Kevin Clark et.al. | 2309.17400v1 | null |
2023-09-29 | Physics-Informed Neural Network for the Transient Diffusivity Equation in Reservoir Engineering | Daniel Badawi et.al. | 2309.17345v1 | null |
2023-09-28 | KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing | Jiancheng Huang et.al. | 2309.16608v1 | null |
2023-09-28 | CCEdit: Creative and Controllable Video Editing via Diffusion Models | Ruoyu Feng et.al. | 2309.16496v1 | null |
2023-09-28 | Distilling ODE Solvers of Diffusion Models into Smaller Steps | Sanghwan Kim et.al. | 2309.16421v1 | null |
2023-09-27 | Exploiting the Signal-Leak Bias in Diffusion Models | Martin Nicolas Everaert et.al. | 2309.15842v1 | null |
2023-09-27 | Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation | David Junhao Zhang et.al. | 2309.15818v1 | link |
2023-09-27 | Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack | Xiaoliang Dai et.al. | 2309.15807v1 | null |
2023-09-27 | Factorized Diffusion Architectures for Unsupervised Image Generation and Segmentation | Xin Yuan et.al. | 2309.15726v1 | null |
2023-09-27 | Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing | Kai Wang et.al. | 2309.15664v1 | link |
2023-09-27 | Direct Sensing of Remote Nuclei: Expanding the Reach of Cross-Effect Dynamic Nuclear Polarization | Amaria Javed et.al. | 2309.15653v1 | null |
2023-09-26 | Generating Visual Scenes from Touch | Fengyu Yang et.al. | 2309.15117v1 | null |
2023-09-27 | LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models | Yaohui Wang et.al. | 2309.15103v2 | link |
2023-09-26 | FEC: Three Finetuning-free Methods to Enhance Consistency for Real Image Editing | Songyan Chen et.al. | 2309.14934v1 | null |
2023-09-27 | ITEM3D: Illumination-Aware Directional Texture Editing for 3D Models | Shengqi Liu et.al. | 2309.14872v2 | null |
2023-09-26 | Navigating Text-To-Image Customization:From LyCORIS Fine-Tuning to Model Evaluation | Shin-Ying Yeh et.al. | 2309.14859v1 | link |
2023-09-25 | Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic Segmentation | Quang Nguyen et.al. | 2309.14303v1 | link |
2023-09-25 | Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion Models | Yangming Li et.al. | 2309.14068v1 | null |
2023-09-25 | Mixing as a correlated aggregation process | Joris Heyman et.al. | 2309.14040v1 | link |
2023-09-22 | MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation | Jiahao Xie et.al. | 2309.13042v1 | link |
2023-09-22 | Diffusion Augmentation for Sequential Recommendation | Qidong Liu et.al. | 2309.12858v1 | link |
2023-09-22 | Accuracy and stability analysis of horizontal discretizations used in unstructured grid ocean models | Fabricio Rodrigues Lapolli et.al. | 2309.12832v1 | null |
2023-09-22 | Synthetic Boost: Leveraging Synthetic Data for Enhanced Vision-Language Segmentation in Echocardiography | Rabin Adhikari et.al. | 2309.12829v1 | link |
2023-09-22 | Semantic Change Driven Generative Semantic Communication Framework | Wanting Yang et.al. | 2309.12775v1 | link |
2023-09-21 | A Diffusion-Model of Joint Interactive Navigation | Matthew Niedoba et.al. | 2309.12508v1 | null |
2023-09-21 | License Plate Super-Resolution Using Diffusion Models | Sawsan AlHalawani et.al. | 2309.12506v1 | null |
2023-09-21 | Performance Conditioning for Diffusion-Based Multi-Instrument Music Synthesis | Ben Maman et.al. | 2309.12283v1 | null |
2023-09-20 | FreeU: Free Lunch in Diffusion U-Net | Chenyang Si et.al. | 2309.11497v1 | link |
2023-09-20 | Generative Agent-Based Modeling: Unveiling Social System Dynamics through Coupling Mechanistic Models with Generative Artificial Intelligence | Navid Ghaffarzadegan et.al. | 2309.11456v1 | null |
2023-09-20 | Deep Networks as Denoising Algorithms: Sample-Efficient Learning of Diffusion Models in High-Dimensional Graphical Models | Song Mei et.al. | 2309.11420v1 | null |
2023-09-20 | EDMP: Ensemble-of-costs-guided Diffusion for Motion Planning | Kallol Saha et.al. | 2309.11414v1 | link |
2023-09-20 | Face Aging via Diffusion-based Editing | Xiangyi Chen et.al. | 2309.11321v1 | link |
2023-09-20 | FaceDiffuser: Speech-Driven 3D Facial Animation Synthesis Using Diffusion | Stefan Stan et.al. | 2309.11306v1 | link |
2023-09-19 | PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance | Peiqing Yang et.al. | 2309.10810v1 | link |
2023-09-19 | Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation | Yatong Bai et.al. | 2309.10740v1 | link |
2023-09-19 | Reconstruct-and-Generate Diffusion Model for Detail-Preserving Image Denoising | Yujin Wang et.al. | 2309.10714v1 | null |
2023-09-18 | Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees | Alexia Jolicoeur-Martineau et.al. | 2309.09968v1 | link |
2023-09-18 | What is a Fair Diffusion Model? Designing Generative Text-To-Image Models to Incorporate Various Worldviews | Zoe De Simone et.al. | 2309.09944v1 | link |
2023-09-18 | DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving | Xiaofeng Wang et.al. | 2309.09777v1 | null |
2023-09-18 | Application-driven Validation of Posteriors in Inverse Problems | Tim J. Adler et.al. | 2309.09764v1 | null |
2023-09-19 | Non-Hermitian physics and topological phenomena in convective thermal metamaterials | Zhoufei Liu et.al. | 2309.09681v2 | null |
2023-09-18 | Anomalous Diffusion of Lithium-Anion Clusters in Ionic Liquids | YeongKyu Lee et.al. | 2309.09674v1 | null |
2023-09-15 | Compositional Foundation Models for Hierarchical Planning | Anurag Ajay et.al. | 2309.08587v1 | null |
2023-09-15 | Denoising Diffusion Probabilistic Models for Hardware-Impaired Communications | Mehdi Letafati et.al. | 2309.08568v1 | null |
2023-09-15 | Breathing New Life into 3D Assets with Generative Repainting | Tianfu Wang et.al. | 2309.08523v1 | link |
2023-09-15 | Diffuse-illumination holographic optical coherence tomography | Léo Puyo et.al. | 2309.08486v1 | null |
2023-09-15 | Large-Vocabulary 3D Diffusion Model with Transformer | Ziang Cao et.al. | 2309.07920v2 | null |
2023-09-14 | Generative Image Dynamics | Zhengqi Li et.al. | 2309.07906v1 | null |
2023-09-14 | Beta Diffusion | Mingyuan Zhou et.al. | 2309.07867v1 | link |
2023-09-14 | Study and evaluation of the Ronen Method accuracy at material interfaces | Johan Cufe et.al. | 2309.07756v1 | null |
2023-09-14 | Dual-angle interferometric scattering microscopy for optical multiparametric particle characterization | Erik Olsén et.al. | 2309.07572v1 | null |
2023-09-13 | UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons | Sicheng Yang et.al. | 2309.07051v1 | link |
2023-09-13 | Experimental Study on the Detection of Frozen Diffused Ammonia Blockage in the Inactive Section of a Variable Conductance Heat Pipe | F. K. Miranda et.al. | 2309.06936v1 | null |
2023-09-13 | DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models | Namhyuk Ahn et.al. | 2309.06933v1 | null |
2023-09-13 | MagiCapture: High-Resolution Multi-Concept Portrait Customization | Junha Hyung et.al. | 2309.06895v1 | null |
2023-09-13 | DCTTS: Discrete Diffusion Model with Contrastive Learning for Text-to-speech Generation | Zhichao Wu et.al. | 2309.06787v1 | null |
2023-09-13 | High throughput sampling of phase space with deep learning potentials: |
Chenxing Luo et.al. | 2309.06712v1 | null |
2023-09-13 | Generalizable improvement of the Spalart-Allmaras model through assimilation of experimental data | Deepinder Jot Singh Aulakh et.al. | 2309.06679v1 | null |
2023-09-12 | InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation | Xingchao Liu et.al. | 2309.06380v1 | link |
2023-09-12 | Dispersion versus diffusion in mixing fronts | Gauthier Rousseau et.al. | 2309.06347v1 | null |
2023-09-12 | Unraveling biochemical spatial patterns: machine learning approaches to the inverse problem of Turing patterns | Antonio Matas-Gil et.al. | 2309.06339v1 | link |
2023-09-12 | Fg-T2M: Fine-Grained Text-Driven Human Motion Generation via Diffusion Model | Yin Wang et.al. | 2309.06284v1 | null |
2023-09-11 | Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips | Yufei Ye et.al. | 2309.05663v1 | null |
2023-09-11 | PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud | Chengyu Wang et.al. | 2309.05534v1 | null |
2023-09-11 | NExT-GPT: Any-to-Any Multimodal LLM | Shengqiong Wu et.al. | 2309.05519v1 | link |
2023-09-08 | Variations and Relaxations of Normalizing Flows | Keegan Kelly et.al. | 2309.04433v1 | null |
2023-09-08 | Create Your World: Lifelong Text-to-Image Diffusion | Gan Sun et.al. | 2309.04430v1 | null |
2023-09-08 | MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask | Yupeng Zhou et.al. | 2309.04399v1 | null |
2023-09-08 | MoEController: Instruction-based Arbitrary Image Manipulation with Mixture-of-Expert Controllers | Sijia Li et.al. | 2309.04372v1 | null |
2023-09-08 | The role of tumbling in bacterial scattering at convex obstacles | Theresa Jakuszeit et.al. | 2309.04326v1 | null |
2023-09-07 | InstructDiffusion: A Generalist Modeling Interface for Vision Tasks | Zigang Geng et.al. | 2309.03895v1 | null |
2023-09-07 | DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection | Manlin Zhang et.al. | 2309.03893v1 | null |
2023-09-07 | Text-to-feature diffusion for audio-visual few-shot learning | Otniel-Bogdan Mercea et.al. | 2309.03869v1 | link |
2023-09-07 | Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption | Teng Hu et.al. | 2309.03729v1 | link |
2023-09-07 | DiffDefense: Defending against Adversarial Attacks via Diffusion Models | Hondamunige Prasanna Silva et.al. | 2309.03702v1 | link |
2023-09-07 | Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model | Sungwon Hwang et.al. | 2309.03550v1 | null |
2023-09-06 | My Art My Choice: Adversarial Protection Against Unruly AI | Anthony Rhodes et.al. | 2309.03198v1 | null |
2023-09-06 | SLiMe: Segment Like Me | Aliasghar Khani et.al. | 2309.03179v1 | link |
2023-09-06 | MCM: Multi-condition Motion Synthesis Framework for Multi-scenario | Zeyu Ling et.al. | 2309.03031v1 | null |
2023-09-05 | Generating Realistic Images from In-the-wild Sounds | Taegyeong Lee et.al. | 2309.02405v1 | null |
2023-09-05 | A Diffusion Quantum Monte Carlo Approach to the Polaritonic Ground State | Braden M. Weight et.al. | 2309.02349v1 | link |
2023-09-05 | Robust frequency-dependent diffusion kurtosis computation using an efficient direction scheme, axisymmetric modelling, and spatial regularization | J. Hamilton et.al. | 2309.02319v1 | null |
2023-09-05 | Neuromorphic nanocluster networks: Critical role of the substrate in nano-link formation | Wenkai Wu et.al. | 2309.02299v1 | null |
2023-09-05 | Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models | Haixu Song et.al. | 2309.02218v1 | link |
2023-09-01 | Iterative Multi-granular Image Editing using Diffusion Models | K J Joseph et.al. | 2309.00613v1 | null |
2023-09-01 | VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation | Xin Li et.al. | 2309.00398v1 | null |
2023-09-01 | Fast Diffusion EM: a diffusion model for blind inverse problems with application to deconvolution | Charles Laroche et.al. | 2309.00287v1 | link |
2023-09-01 | Data-driven Topology Optimization of Channel Flow Problems | Ce Guan et.al. | 2309.00278v1 | null |
2023-08-31 | InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion | Sirui Xu et.al. | 2308.16905v1 | link |
2023-09-01 | GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields | Yanjie Ze et.al. | 2308.16891v2 | link |
2023-08-31 | Prediction of Diblock Copolymer Morphology via Machine Learning | Hyun Park et.al. | 2308.16886v1 | null |
2023-08-31 | Diffusion Models for Interferometric Satellite Aperture Radar | Alexandre Tuel et.al. | 2308.16847v1 | link |
2023-09-01 | Irregular Traffic Time Series Forecasting Based on Asynchronous Spatio-Temporal Graph Convolutional Network | Weijia Zhang et.al. | 2308.16818v2 | null |
2023-09-01 | Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models | Minheng Ni et.al. | 2308.16777v2 | null |
2023-08-31 | Terrain Diffusion Network: Climatic-Aware Terrain Generation with Geological Sketch Guidance | Zexin Hu et.al. | 2308.16725v1 | null |
2023-08-30 | SignDiff: Learning Diffusion Models for American Sign Language Production | Sen Fang et.al. | 2308.16082v1 | null |
2023-08-30 | Click Metamaterials: Fast Acquisition of Thermal Conductivity and Functionality Diversities | Chengmeng Wang et.al. | 2308.16057v1 | null |
2023-08-30 | DiffuVolume: Diffusion Model for Volume based Stereo Matching | Dian Zheng et.al. | 2308.15989v1 | null |
2023-08-30 | Physics-Informed DeepMRI: Bridging the Gap from Heat Diffusion to k-Space Interpolation | Zhuo-Xu Cui et.al. | 2308.15918v1 | null |
2023-08-29 | ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer | Zachary Horvitz et.al. | 2308.15459v1 | link |
2023-08-29 | Vortex core radius in baroclinic turbulence: Implications for scaling predictions | Gabriel Hadjerci et.al. | 2308.15398v1 | null |
2023-08-29 | Rayleigh-Bénard instability in a horizontal porous layer with anomalous diffusion | Antonio Barletta et.al. | 2308.15359v1 | null |
2023-08-30 | Elucidating the Exposure Bias in Diffusion Models | Mang Ning et.al. | 2308.15321v2 | link |
2023-08-28 | Total Selfie: Generating Full-Body Selfies | Bowei Chen et.al. | 2308.14740v1 | null |
2023-08-28 | Oscillating reaction in porous media under saddle flow | Satoshi Izumoto et.al. | 2308.14723v1 | null |
2023-08-28 | 360-Degree Panorama Generation from Few Unregistered NFoV Images | Jionghao Wang et.al. | 2308.14686v1 | link |
2023-08-28 | Effect of gas diffusion layer fiber shape on cathode two-phase dynamics in proton exchange membrane fuel cells | Danan Yang et.al. | 2308.14539v1 | null |
2023-08-28 | Priority-Centric Human Motion Generation in Discrete Latent Space | Hanyang Kong et.al. | 2308.14480v1 | null |
2023-08-25 | Distribution-Aligned Diffusion for Human Mesh Recovery | Lin Geng Foo et.al. | 2308.13369v1 | null |
2023-08-25 | Age of Information Diffusion on Social Networks: Optimizing Multi-Stage Seeding Strategies | Songhua Li et.al. | 2308.13303v1 | null |
2023-08-25 | EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Prior | Minda Zhao et.al. | 2308.13223v1 | link |
2023-08-25 | Diff-Retinex: Rethinking Low-light Image Enhancement with A Generative Diffusion Model | Xunpeng Yi et.al. | 2308.13164v1 | null |
2023-08-25 | A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions | Tianyi Zhang et.al. | 2308.13142v1 | null |
2023-08-24 | Dense Text-to-Image Generation with Attention Modulation | Yunji Kim et.al. | 2308.12964v1 | link |
2023-08-24 | Language as Reality: A Co-Creative Storytelling Game Experience in 1001 Nights using Generative AI | Yuqian Sun et.al. | 2308.12915v1 | null |
2023-08-24 | Hydrogen jet diffusion modeling by using physics-informed graph neural network and sparsely-distributed sensor data | Xinqi Zhang et.al. | 2308.12621v1 | null |
2023-08-24 | APLA: Additional Perturbation for Latent Noise with Adversarial Training Enables Consistency | Yupu Yao et.al. | 2308.12605v1 | null |
2023-08-23 | On-Manifold Projected Gradient Descent | Aaron Mahler et.al. | 2308.12279v1 | null |
2023-08-23 | Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning | Jiasheng Ye et.al. | 2308.12219v1 | link |
2023-08-23 | Pulse shape discrimination for the CONUS experiment in the keV and sub-keV regime | H. Bonet et.al. | 2308.12105v1 | null |
2023-08-22 | Theory of Transverse Mode Instability in Fiber Amplifiers with Multimode Excitations | Kabish Wisal et.al. | 2308.11599v1 | null |
2023-08-22 | NIPG-DG schemes for transformed master equations modeling open quantum systems | Jose A. Morales Escalante et.al. | 2308.11580v1 | null |
2023-08-22 | IT3D: Improved Text-to-3D Generation with Explicit View Synthesis | Yiwen Chen et.al. | 2308.11473v1 | link |
2023-08-22 | SDeMorph: Towards Better Facial De-morphing from Single Morph | Nitish Shukla et.al. | 2308.11442v1 | null |
2023-08-21 | TADA! Text to Animatable Digital Avatars | Tingting Liao et.al. | 2308.10899v1 | null |
2023-08-21 | Election Manipulation in Social Networks with Single-Peaked Agents | Vincenzo Auletta et.al. | 2308.10845v1 | null |
2023-08-21 | Backdooring Textual Inversion for Concept Censorship | Yutong wu et.al. | 2308.10718v1 | null |
2023-08-21 | EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints | Yutao Chen et.al. | 2308.10648v1 | null |
2023-08-21 | Frequency Compensated Diffusion Model for Real-scene Dehazing | Jing Wang et.al. | 2308.10510v1 | link |
2023-08-21 | Texture Generation on 3D Meshes with Point-UV Diffusion | Xin Yu et.al. | 2308.10490v1 | null |
2023-08-18 | Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization | Soumik Mukhopadhyay et.al. | 2308.09716v1 | link |
2023-08-18 | HumanLiff: Layer-wise 3D Human Generation with Diffusion Model | Shoukang Hu et.al. | 2308.09712v1 | null |
2023-08-18 | SimDA: Simple Diffusion Adapter for Efficient Video Generation | Zhen Xing et.al. | 2308.09710v1 | null |
2023-08-18 | Guide3D: Create 3D Avatars from Text and Image Guidance | Yukang Cao et.al. | 2308.09705v1 | null |
2023-08-18 | PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation | Hanbing Liu et.al. | 2308.09678v1 | link |
2023-08-18 | Constrained Bayesian Optimization Using a Lagrange Multiplier Applied to Power Transistor Design | Ping-Ju Chuang et.al. | 2308.09612v1 | null |
2023-08-18 | Language-Guided Diffusion Model for Visual Grounding | Sijia Chen et.al. | 2308.09599v1 | null |
2023-08-18 | StableVideo: Text-driven Consistency-aware Diffusion Video Editing | Wenhao Chai et.al. | 2308.09592v1 | link |
2023-08-18 | O^2-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion Model | Yubin Hu et.al. | 2308.09591v1 | link |
2023-08-16 | TeCH: Text-guided Reconstruction of Lifelike Clothed Humans | Yangyi Huang et.al. | 2308.08545v1 | link |
2023-08-16 | Voxlines: Streamline Transparency through Voxelization and View-Dependent Line Orders | Besm Osman et.al. | 2308.08436v1 | null |
2023-08-16 | Diff-CAPTCHA: An Image-based CAPTCHA with Security Enhanced by Denoising Diffusion Model | Ran Jiang et.al. | 2308.08367v1 | null |
2023-08-18 | Dual-Stream Diffusion Net for Text-to-Video Generation | Binhui Liu et.al. | 2308.08316v2 | null |
2023-08-16 | Electron transfer efficiency in liquid xenon across THGEM holes | G. Martínez-Lema et.al. | 2308.08314v1 | null |
2023-08-15 | StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion Models | Zhizhong Wang et.al. | 2308.07863v1 | null |
2023-08-15 | CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction | Yan Di et.al. | 2308.07837v1 | null |
2023-08-15 | DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding | Jeongsoo Choi et.al. | 2308.07787v1 | link |
2023-08-15 | Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model | Bosheng Qin et.al. | 2308.07749v1 | null |
2023-08-14 | Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation | Alexander Martin et.al. | 2308.07316v1 | link |
2023-08-14 | DiffSED: Sound Event Detection with Denoising Diffusion | Swapnil Bhosale et.al. | 2308.07293v1 | null |
2023-08-14 | Diffusion Based Augmentation for Captioning and Retrieval in Cultural Heritage | Dario Cioni et.al. | 2308.07151v1 | link |
2023-08-14 | Temporal clustering of social interactions trades-off disease spreading and knowledge diffusion | Giulia Cencetti et.al. | 2308.07058v1 | link |
2023-08-14 | Bayesian Flow Networks | Alex Graves et.al. | 2308.07037v1 | link |
2023-08-14 | An efficient topology optimization method for steady gas flows in all flow regimes | Ruifeng Yuan et.al. | 2308.07018v1 | null |
2023-08-14 | Discrete Conditional Diffusion for Reranking in Recommendation | Xiao Lin et.al. | 2308.06982v1 | null |
2023-08-11 | Acoustofluidic Engineering Functional Vessel-on-a-Chip | Yue Wu et.al. | 2308.06219v1 | null |
2023-08-11 | DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models | Weijia Wu et.al. | 2308.06160v1 | link |
2023-08-11 | Taming the Power of Diffusion Models for High-Quality Virtual Try-On with Appearance Flow | Junhong Gou et.al. | 2308.06101v1 | link |
2023-08-11 | Diffusion-based Visual Counterfactual Explanations -- Towards Systematic Quantitative Evaluation | Philipp Vaeth et.al. | 2308.06100v1 | link |
2023-08-11 | Head Rotation in Denoising Diffusion Models | Andrea Asperti et.al. | 2308.06057v1 | link |
2023-08-11 | Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning | Chun-Mei Feng et.al. | 2308.06038v1 | link |
2023-08-11 | Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation | Yuki Endo et.al. | 2308.06027v1 | link |
2023-08-14 | Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model | Fan Zhang et.al. | 2308.05995v2 | null |
2023-08-10 | AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining | Haohe Liu et.al. | 2308.05734v1 | link |
2023-08-10 | PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers | Phillip Lippe et.al. | 2308.05732v1 | null |
2023-08-10 | Masked Diffusion as Self-supervised Representation Learner | Zixuan Pan et.al. | 2308.05695v1 | null |
2023-08-10 | Generative Diffusion Models for Radio Wireless Channel Modelling and Sampling | Ushnish Sengupta et.al. | 2308.05583v1 | null |
2023-08-10 | Fokker-Planck-Poisson kinetics: Multi-phase flow beyond equilibrium | Mohsen Sadr et.al. | 2308.05580v1 | null |
2023-08-09 | LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation | Leigang Qu et.al. | 2308.05095v1 | null |
2023-08-09 | Do Diffusion Models Suffer Error Propagation? Theoretical Analysis and Consistency Regularization | Yangming Li et.al. | 2308.05021v1 | null |
2023-08-10 | IDiff-Face: Synthetic-based Face Recognition through Fizzy Identity-Conditioned Diffusion Models | Fadi Boutros et.al. | 2308.04995v2 | link |
2023-08-09 | CasCIFF: A Cross-Domain Information Fusion Framework Tailored for Cascade Prediction in Social Networks | Hongjun Zhu et.al. | 2308.04961v1 | link |
2023-08-09 | Interaction-induced directional transport on periodically driven chains | Helena Drüeke et.al. | 2308.04845v1 | null |
2023-08-08 | DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images | Xuechao Zou et.al. | 2308.04417v1 | link |
2023-08-08 | Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On | Daiheng Gao et.al. | 2308.04288v1 | null |
2023-08-08 | MCDAN: a Multi-scale Context-enhanced Dynamic Attention Network for Diffusion Prediction | Xiaowen Wang et.al. | 2308.04266v1 | null |
2023-08-08 | FLIRT: Feedback Loop In-context Red Teaming | Ninareh Mehrabi et.al. | 2308.04265v1 | null |
2023-08-08 | MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion | Yizhuo Lu et.al. | 2308.04249v1 | link |
2023-08-08 | Synthetic Augmentation with Large-scale Unconditional Pre-training | Jiarong Ye et.al. | 2308.04020v1 | link |
2023-08-07 | Diffusion Model in Causal Inference with Unmeasured Confounders | Tatsuhiro Shimizu et.al. | 2308.03669v1 | link |
2023-08-07 | AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose | Huichao Zhang et.al. | 2308.03610v1 | link |
2023-08-08 | DiffSynth: Latent In-Iteration Deflickering for Realistic Video Synthesis | Zhongjie Duan et.al. | 2308.03463v2 | link |
2023-08-04 | Quantum Dynamical Approach to Predicting the Optical Pumping Threshold for Lasing in Organic Materials | Bin Zhang et.al. | 2308.02447v1 | null |
2023-08-04 | Diffusion-Augmented Depth Prediction with Sparse Annotations | Jiaqi Li et.al. | 2308.02283v1 | null |
2023-08-04 | Painterly Image Harmonization using Diffusion Model | Lingxiao Lu et.al. | 2308.02228v1 | link |
2023-08-03 | Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling | Zhao Yang et.al. | 2308.01850v1 | link |
2023-08-03 | DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models | Jianxin Lin et.al. | 2308.01655v1 | null |
2023-08-03 | Reference-Free Isotropic 3D EM Reconstruction using Diffusion Models | Kyungryun Lee et.al. | 2308.01594v1 | null |
2023-08-03 | Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS | Myeongjin Ko et.al. | 2308.01573v1 | link |
2023-08-02 | Patched Denoising Diffusion Models For High-Resolution Image Synthesis | Zheng Ding et.al. | 2308.01316v1 | link |
2023-08-02 | Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image Generation | Guojin Zhong et.al. | 2308.01147v1 | link |
2023-08-02 | DiffusePast: Diffusion-based Generative Replay for Class Incremental Semantic Segmentation | Jingfan Chen et.al. | 2308.01127v1 | null |
2023-08-01 | Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models | Cheng-Yu Hsieh et.al. | 2308.00675v1 | null |
2023-08-01 | Diffusion Model for Camouflaged Object Detection | Zhennan Chen et.al. | 2308.00303v1 | null |
2023-07-31 | Universal Adversarial Defense in Remote Sensing Based on Pre-trained Denoising Diffusion Models | Weikang Yu et.al. | 2307.16865v1 | null |
2023-07-31 | From Generation to Suppression: Towards Effective Irregular Glow Removal for Nighttime Visibility Enhancement | Wanyu Wu et.al. | 2307.16783v1 | null |
2023-07-31 | Understanding Dynamics in Coarse-Grained Models: III. Roles of Rotational Motion and Translation-Rotation Coupling in Coarse-Grained Dynamics | Jaehyeok Jin et.al. | 2307.16747v1 | null |
2023-07-31 | DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation | Runyang Feng et.al. | 2307.16687v1 | null |
2023-07-31 | On the Trustworthiness Landscape of State-of-the-art Generative Models: A Comprehensive Survey | Mingyuan Fan et.al. | 2307.16680v1 | null |
2023-07-28 | Understanding the Anomalous Diffusion of Water in Aqueous Electrolytes Using Machine Learned Potentials | Nikhil V. S. Avula et.al. | 2307.15576v1 | null |
2023-07-28 | Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding | Chunyu Qiang et.al. | 2307.15484v1 | null |
2023-07-27 | The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation | Lingdong Kong et.al. | 2307.15061v1 | link |
2023-07-27 | TEDi: Temporally-Entangled Diffusion for Long-Term Motion Synthesis | Zihan Zhang et.al. | 2307.15042v1 | null |
2023-07-27 | Generative convective parametrization of dry atmospheric boundary layer | Florian Heyder et.al. | 2307.14857v1 | null |
2023-07-27 | Empirical analysis of congestion spreading in Seoul traffic network | Jung-Hoon Jung et.al. | 2307.14800v1 | null |
2023-07-26 | Virtual Mirrors: Non-Line-of-Sight Imaging Beyond the Third Bounce | Diego Royo et.al. | 2307.14341v1 | null |
2023-07-26 | Visual Instruction Inversion: Image Editing via Visual Prompting | Thao Nguyen et.al. | 2307.14331v1 | link |
2023-07-26 | Founding a mathematical diffusion model in linguistics. The case study of German syntactic features in the North-Eastern Italian dialects | I. Lazzizzera et.al. | 2307.14291v1 | null |
2023-07-26 | VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with ControlNet | Zhihao Hu et.al. | 2307.14073v1 | null |
2023-07-25 | Comparing phase-space and phenomenological modeling approaches for Lagrangian particles settling in a turbulent boundary layer | Andrew P. Grace et.al. | 2307.13659v1 | null |
2023-07-25 | Fake It Without Making It: Conditioned Face Generation for Accurate 3D Face Shape Estimation | Will Rowan et.al. | 2307.13639v1 | null |
2023-07-25 | XDLM: Cross-lingual Diffusion Language Model for Machine Translation | Linyao Chen et.al. | 2307.13560v1 | null |
2023-07-25 | Not with my name! Inferring artists' names of input strings employed by Diffusion Models | Roberto Leotta et.al. | 2307.13527v1 | link |
2023-07-24 | A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models | Jindong Gu et.al. | 2307.12980v1 | link |
2023-07-24 | Data-free Black-box Attack based on Diffusion Model | Mingwen Shao et.al. | 2307.12872v1 | null |
2023-07-24 | Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry | Yong-Hyun Park et.al. | 2307.12868v1 | link |
2023-07-24 | The ro-vibrational |
Francesco Mazza et.al. | 2307.12740v1 | null |
2023-07-21 | FEDD -- Fair, Efficient, and Diverse Diffusion-based Lesion Segmentation and Malignancy Classification | Héctor Carrión et.al. | 2307.11654v1 | link |
2023-07-21 | Mixbiotic society measures: Assessment of community well-going as living system | Takeshi Kato et.al. | 2307.11594v1 | null |
2023-07-21 | Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series Forecasting | Marcel Kollovieh et.al. | 2307.11494v1 | link |
2023-07-20 | Hypergraph Diffusions and Resolvents for Norm-Based Hypergraph Laplacians | Konstantinos Ameranis et.al. | 2307.11042v1 | null |
2023-07-20 | Progressive distillation diffusion for raw music generation | Svetlana Pavlova et.al. | 2307.10994v1 | null |
2023-07-20 | Energy-consistent discretization of viscous dissipation with application to natural convection flow | Benjamin Sanderse et.al. | 2307.10874v1 | null |
2023-07-19 | FABRIC: Personalizing Diffusion Models with Iterative Feedback | Dimitri von Rütte et.al. | 2307.10159v1 | link |
2023-07-19 | XSkill: Cross Embodiment Skill Discovery | Mengda Xu et.al. | 2307.09955v1 | link |
2023-07-19 | Visual Representation for Patterned Proliferation of Social Media Addiction: Quantitative Model and Network Analysis | Dibyajyoti Mallick et.al. | 2307.09902v1 | null |
2023-07-19 | BSDM: Background Suppression Diffusion Model for Hyperspectral Anomaly Detection | Jitao Ma et.al. | 2307.09861v1 | link |
2023-07-19 | A Siamese-based Verification System for Open-set Architecture Attribution of Synthetic Images | Lydia Abady et.al. | 2307.09822v1 | link |
2023-07-18 | AnyDoor: Zero-shot Object-level Image Customization | Xi Chen et.al. | 2307.09481v1 | link |
2023-07-18 | Augmenting CLIP with Improved Visio-Linguistic Reasoning | Samyadeep Basu et.al. | 2307.09233v1 | null |
2023-07-17 | Diffusion Models Beat GANs on Image Classification | Soumik Mukhopadhyay et.al. | 2307.08702v1 | null |
2023-07-17 | Flow Matching in Latent Space | Quan Dao et.al. | 2307.08698v1 | link |
2023-07-17 | SEMI-DiffusionInst: A Diffusion Model Based Approach for Semiconductor Defect Classification and Segmentation | Vic De Ridder et.al. | 2307.08693v1 | null |
2023-07-17 | Multimodal Diffusion Segmentation Model for Object Segmentation from Manipulation Instructions | Yui Iioka et.al. | 2307.08597v1 | null |
2023-07-17 | Identity-Preserving Aging of Face Images via Latent Diffusion Models | Sudipta Banerjee et.al. | 2307.08585v1 | link |
2023-07-17 | Synthetic Lagrangian Turbulence by Generative Diffusion Models | Tianyi Li et.al. | 2307.08529v1 | link |
2023-07-17 | How far does turbulence spread? | Alexandros Alexakis et.al. | 2307.08469v1 | null |
2023-07-17 | Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation | Luozhou Wang et.al. | 2307.08448v1 | link |
2023-07-18 | Unstoppable Attack: Label-Only Model Inversion via Conditional Diffusion Model | Rongke Liu et.al. | 2307.08424v2 | null |
2023-07-14 | NIFTY: Neural Object Interaction Fields for Guided Human Motion Synthesis | Nilesh Kulkarni et.al. | 2307.07511v1 | null |
2023-07-14 | DreamTeacher: Pretraining Image Backbones with Deep Generative Models | Daiqing Li et.al. | 2307.07487v1 | null |
2023-07-14 | Inverse Evolution Layers: Physics-informed Regularizers for Deep Neural Networks | Chaoyu Liu et.al. | 2307.07344v1 | null |
2023-07-14 | High-density single-molecule maps reveal transient membrane receptor interactions within a dynamically varying environment | Nicolas Mateos et.al. | 2307.07334v1 | null |
2023-07-14 | Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection | Alessandro Flaborea et.al. | 2307.07205v1 | link |
2023-07-14 | Federated Learning-Empowered AI-Generated Content in Wireless Networks | Xumin Huang et.al. | 2307.07146v1 | null |
2023-07-13 | HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models | Nataniel Ruiz et.al. | 2307.06949v1 | null |
2023-07-12 | Exposing the Fake: Effective Diffusion-Generated Images Detection | Ruipeng Ma et.al. | 2307.06272v1 | null |
2023-07-12 | Diffusion Based Multi-Agent Adversarial Tracking | Sean Ye et.al. | 2307.06244v1 | null |
2023-07-12 | Functional light diffusers based on hybrid CsPbBr |
Lena M. Saure et.al. | 2307.06197v1 | null |
2023-07-12 | Biofilm.jl: a fast solver for one-dimensional biofilm chemistry and ecology | Mark Owkes et.al. | 2307.06153v1 | link |
2023-07-11 | Metropolis Sampling for Constrained Diffusion Models | Nic Fishman et.al. | 2307.05439v1 | null |
2023-07-11 | On the Vulnerability of DeepFake Detectors to Attacks Generated by Denoising Diffusion Models | Marija Ivanovska et.al. | 2307.05397v1 | null |
2023-07-10 | Shelving, Stacking, Hanging: Relational Pose Diffusion for Multi-modal Rearrangement | Anthony Simeonov et.al. | 2307.04751v1 | null |
2023-07-10 | Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback | Jaskirat Singh et.al. | 2307.04749v1 | null |
2023-07-10 | Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning | Suzan Ece Ada et.al. | 2307.04726v1 | null |
2023-07-10 | AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning | Yuwei Guo et.al. | 2307.04725v1 | link |
2023-07-10 | Machine learning potentials with Iterative Boltzmann Inversion: training to experiment | Sakib Matin et.al. | 2307.04712v1 | null |
2023-07-10 | Encapsulation Structure and Dynamics in Hypergraphs | Timothy LaRock et.al. | 2307.04613v1 | link |
2023-07-07 | Three-dimensional Vorticity Effects on Extinction Behavior of Laminar Flamelets | Wes Hellwig et.al. | 2307.03695v1 | null |
2023-07-07 | Simulation-free Schrödinger bridges via score and flow matching | Alexander Tong et.al. | 2307.03672v1 | link |
2023-07-07 | IPO-LDM: Depth-aided 360-degree Indoor RGB Panorama Outpainting via Latent Diffusion Model | Tianhao Wu et.al. | 2307.03177v2 | null |
2023-07-06 | How to Detect Unauthorized Data Usages in Text-to-image Diffusion Models | Zhenting Wang et.al. | 2307.03108v1 | link |
2023-07-06 | Origin-Destination Travel Time Oracle for Map-based Services | Yan Lin et.al. | 2307.03048v1 | null |
2023-07-06 | Multi-modal multi-class Parkinson disease classification using CNN and decision level fusion | Sushanta Kumar Sahu et.al. | 2307.02978v1 | null |
2023-07-06 | On the Cultural Gap in Text-to-Image Generation | Bingshuai Liu et.al. | 2307.02971v1 | null |
2023-07-05 | DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models | Chong Mou et.al. | 2307.02421v1 | link |
2023-07-05 | RADiff: Controllable Diffusion Models for Radio Astronomical Maps Generation | Renato Sortino et.al. | 2307.02392v1 | null |
2023-07-06 | Error Approximation and Bias Correction in Dynamic Problems using a Recurrent Neural Network/Finite Element Hybrid Model | Moritz von Tresckow et.al. | 2307.02349v2 | null |
2023-07-05 | Detecting Images Generated by Deep Diffusion Models using their Local Intrinsic Dimensionality | Peter Lorenz et.al. | 2307.02347v1 | link |
2023-07-05 | SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection | Yuguang Shi et.al. | 2307.02270v1 | null |
2023-07-03 | Improved sampling via learned diffusions | Lorenz Richter et.al. | 2307.01198v1 | null |
2023-07-03 | Squeezing Large-Scale Diffusion Models for Mobile | Jiwoong Choi et.al. | 2307.01193v1 | null |
2023-07-03 | Learning Mixtures of Gaussians Using the DDPM Objective | Kulin Shah et.al. | 2307.01178v1 | null |
2023-07-03 | Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis | Salman Ul Hassan Dar et.al. | 2307.01148v1 | null |
2023-07-03 | A phase field-based framework for electro-chemo-mechanical fracture: crack-contained electrolytes, chemical reactions and stabilisation | T. Hageman et.al. | 2307.01105v1 | null |
2023-07-03 | MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion | Shitao Tang et.al. | 2307.01097v1 | link |
2023-07-03 | TomatoDIFF: On-plant Tomato Segmentation with Denoising Diffusion Models | Marija Ivanovska et.al. | 2307.01064v1 | link |
2023-06-30 | Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors | Guocheng Qian et.al. | 2306.17843v1 | link |
2023-06-30 | Content-Preserving Diffusion Model for Unsupervised AS-OCT image Despeckling | Li Sanqian et.al. | 2306.17717v1 | null |
2023-06-29 | Generate Anything Anywhere in Any Scene | Yuheng Li et.al. | 2306.17154v1 | null |
2023-06-29 | Filtered-Guided Diffusion: Fast Filter Guidance for Black-Box Diffusion Models | Zeqi Gu et.al. | 2306.17141v1 | link |
2023-06-29 | ID-Pose: Sparse-view Camera Pose Estimation by Inverting Diffusion Models | Weihao Cheng et.al. | 2306.17140v1 | null |
2023-06-29 | Learning Nuclei Representations with Masked Image Modelling | Piotr Wójcik et.al. | 2306.17116v1 | null |
2023-06-29 | Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation | Zibo Zhao et.al. | 2306.17115v1 | link |
2023-06-29 | Towards rapid extracellular vesicles colorimetric detection using optofluidics-enhanced color-changing optical metasurface | Chuchuan Hong et.al. | 2306.17102v1 | null |
2023-06-28 | DiffComplete: Diffusion-based Generative 3D Shape Completion | Ruihang Chu et.al. | 2306.16329v1 | null |
2023-06-28 | UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data | Heeseung Kim et.al. | 2306.16083v1 | link |
2023-06-28 | PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment | Jianyuan Wang et.al. | 2306.15667v2 | null |
2023-06-27 | Stabilizing ultrathin Silver (Ag) films on different substrates | Allamula Ashok et.al. | 2306.15575v1 | null |
2023-06-27 | Trajectory Generation, Control, and Safety with Denoising Diffusion Probabilistic Models | Nicolò Botteghi et.al. | 2306.15512v1 | link |
2023-06-27 | Miniaturized gas-solid fluidized beds | Fernando David Cúñez Benalcázar et.al. | 2306.15463v1 | null |
2023-06-27 | Adversarial Training for Graph Neural Networks | Lukas Gosch et.al. | 2306.15427v1 | null |
2023-06-26 | Fuzzy-Conditioned Diffusion and Diffusion Projection Attention Applied to Facial Image Correction | Majed El Helou et.al. | 2306.14891v1 | link |
2023-06-26 | Restart Sampling for Improving Generative Processes | Yilun Xu et.al. | 2306.14878v1 | link |
2023-06-26 | ViNT: A Foundation Model for Visual Navigation | Dhruv Shah et.al. | 2306.14846v1 | null |
2023-06-26 | ProtoDiff: Learning to Learn Prototypical Networks by Task-Guided Diffusion | Yingjun Du et.al. | 2306.14770v1 | link |
2023-06-23 | Fast Macroscopic Forcing Method | Spencer H. Bryngelson et.al. | 2306.13625v1 | link |
2023-06-23 | DreamEditor: Text-Driven 3D Scene Editing with Neural Fields | Jingyu Zhuang et.al. | 2306.13455v1 | link |
2023-06-22 | Continuous Layout Editing of Single Images with Diffusion Models | Zhiyuan Zhang et.al. | 2306.13078v1 | null |
2023-06-22 | Towards More Realistic Membership Inference Attacks on Large Diffusion Models | Jan Dubiński et.al. | 2306.12983v1 | null |
2023-06-22 | On the nature of the two-positron bond: Evidence for a novel bond type | Mohammad Goli et.al. | 2306.12899v1 | null |
2023-06-22 | Stress-induced Artificial neuron spiking in Diffusive memristors | Debi Pattnaik et.al. | 2306.12853v1 | null |
2023-06-21 | DreamTime: An Improved Optimization Strategy for Text-to-3D Content Creation | Yukun Huang et.al. | 2306.12422v1 | null |
2023-06-21 | HumanDiffusion: diffusion model using perceptual gradients | Yota Ueda et.al. | 2306.12169v1 | null |
2023-06-20 | Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning | Huiguo He et.al. | 2306.11731v1 | null |
2023-06-20 | Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision | Ayush Tewari et.al. | 2306.11719v1 | null |
2023-06-20 | Align, Adapt and Inject: Sound-guided Unified Image Generation | Yue Yang et.al. | 2306.11504v1 | null |
2023-06-20 | EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model | Lianying Yin et.al. | 2306.11496v1 | null |
2023-06-16 | Group Orthogonalization Regularization For Vision Models Adaptation and Robustness | Yoav Kurtz et.al. | 2306.10001v1 | link |
2023-06-16 | Towards Better Certified Segmentation via Diffusion Models | Othmane Laousy et.al. | 2306.09949v1 | null |
2023-06-16 | Unique information from common diffusion MRI models about white-matter differences across the human adult lifespan | Rafael Neto Henriques1 et.al. | 2306.09942v1 | link |
2023-06-16 | Drag-guided diffusion models for vehicle image generation | Nikos Arechiga et.al. | 2306.09935v1 | null |
2023-06-16 | Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models | Geon Yeong Park et.al. | 2306.09869v1 | link |
2023-06-16 | AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation | Yifei Zeng et.al. | 2306.09864v1 | null |
2023-06-15 | Generative Proxemics: A Prior for 3D Social Interaction from Images | Lea Müller et.al. | 2306.09337v1 | link |
2023-06-15 | ArtFusion: Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models | Dar-Yen Chen et.al. | 2306.09330v1 | link |
2023-06-15 | Diffusion Models for Zero-Shot Open-Vocabulary Segmentation | Laurynas Karazija et.al. | 2306.09316v1 | null |
2023-06-15 | Fast Training of Diffusion Models with Masked Transformers | Hongkai Zheng et.al. | 2306.09305v1 | link |
2023-06-15 | Conditional Human Sketch Synthesis with Explicit Abstraction Control | Dar-Yen Chen et.al. | 2306.09274v1 | null |
2023-06-15 | Training Diffusion Classifiers with Denoising Assistance | Chandramouli Sastry et.al. | 2306.09192v1 | null |
2023-06-13 | Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation | Shuai Yang et.al. | 2306.07954v1 | null |
2023-06-13 | Viewset Diffusion: (0-)Image-Conditioned 3D Generative Models from 2D Data | Stanislaw Szymanowicz et.al. | 2306.07881v1 | null |
2023-06-13 | Diffusive and convective dissolution of carbon dioxide in a vertical cylindrical cell | Daniël P. Faasen et.al. | 2306.07721v1 | null |
2023-06-12 | Controlling Text-to-Image Diffusion by Orthogonal Finetuning | Zeju Qiu et.al. | 2306.07280v1 | null |
2023-06-12 | MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images | Junchen Zhu et.al. | 2306.07257v1 | null |
2023-06-12 | Diffusion Models for Black-Box Optimization | Siddarth Krishnamoorthy et.al. | 2306.07180v1 | link |
2023-06-12 | InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions | Jiale Xu et.al. | 2306.07154v1 | null |
2023-06-12 | Latent Dynamical Implicit Diffusion Processes | Mohammad R. Rezaei et.al. | 2306.07077v1 | null |
2023-06-09 | Bridging Scales: a Hybrid Model to Simulate Vascular Tumor Growth and Treatment Response | Tobias Duswald et.al. | 2306.05994v1 | link |
2023-06-09 | DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles | Tal Daniel et.al. | 2306.05957v1 | link |
2023-06-09 | Beyond Surface Statistics: Scene Representations in a Latent Diffusion Model | Yida Chen et.al. | 2306.05720v1 | link |
2023-06-12 | Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion | Haogeng Liu et.al. | 2306.05708v2 | null |
2023-06-09 | RePaint-NeRF: NeRF Editting via Semantic Masks and Diffusion Models | Xingchen Zhou et.al. | 2306.05668v1 | null |
2023-06-08 | Grounded Text-to-Image Synthesis with Attention Refocusing | Quynh Phung et.al. | 2306.05427v1 | null |
2023-06-08 | ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process | Changyao Tian et.al. | 2306.05423v1 | null |
2023-06-08 | Stochastic Multi-Person 3D Motion Forecasting | Sirui Xu et.al. | 2306.05421v1 | link |
2023-06-08 | Improving Negative-Prompt Inversion via Proximal Guidance | Ligong Han et.al. | 2306.05414v1 | link |
2023-06-08 | PriSampler: Mitigating Property Inference of Diffusion Models | Hailong Hu et.al. | 2306.05208v1 | null |
2023-06-08 | SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions | Yuseung Lee et.al. | 2306.05178v1 | null |
2023-06-07 | Designing a Better Asymmetric VQGAN for StableDiffusion | Zixin Zhu et.al. | 2306.04632v1 | link |
2023-06-07 | ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image Collections | Chun-Han Yao et.al. | 2306.04619v1 | null |
2023-06-08 | Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt | Kai Chen et.al. | 2306.04607v2 | null |
2023-06-07 | On the Design Fundamentals of Diffusion Models: A Survey | Ziyi Chang et.al. | 2306.04542v1 | null |
2023-06-07 | Multi-modal Latent Diffusion | Mustapha Bounoua et.al. | 2306.04445v1 | null |
2023-06-07 | Synthesizing realistic sand assemblies with denoising diffusion in latent space | Nikolaos N. Vlassis et.al. | 2306.04411v1 | null |
2023-06-07 | Improving Diffusion-based Image Translation using Asymmetric Gradient Guidance | Gihyun Kwon et.al. | 2306.04396v1 | link |
2023-06-06 | Emergent Correspondence from Image Diffusion | Luming Tang et.al. | 2306.03881v1 | link |
2023-06-06 | Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation | Xinrong Hu et.al. | 2306.03878v1 | link |
2023-06-06 | Newly Formed Cities: an AI Curation | Dario Negueruela del Castillo et.al. | 2306.03753v1 | null |
2023-06-06 | Towards Visual Foundational Models of Physical Scenes | Chethan Parameshwara et.al. | 2306.03727v1 | null |
2023-06-06 | Diffusional exchange versus microscopic kurtosis from CTI: two conflicting interpretations of the same data | Arthur Chakwizira et.al. | 2306.03661v1 | null |
2023-06-05 | Brain Diffusion for Visual Exploration: Cortical Discovery using Large Scale Generative Models | Andrew F. Luo et.al. | 2306.03089v1 | null |
2023-06-05 | MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion | Chiyu Max Jiang et.al. | 2306.03083v1 | null |
2023-06-05 | Influence of the finite transverse size of the accelerating region on the relativistic feedback | Alexander Sedelnikov et.al. | 2306.03059v1 | null |
2023-06-05 | HeadSculpt: Crafting 3D Head Avatars with Text | Xiao Han et.al. | 2306.03038v1 | null |
2023-06-05 | Interpretable Alzheimer's Disease Classification Via a Contrastive Diffusion Autoencoder | Ayodeji Ijishakin et.al. | 2306.03022v1 | link |
2023-06-05 | Complex Preferences for Different Convergent Priors in Discrete Graph Diffusion | Alex M. Tseng et.al. | 2306.02957v1 | null |
2023-06-05 | INDigo: An INN-Guided Probabilistic Diffusion Algorithm for Inverse Problems | Di You et.al. | 2306.02949v1 | null |
2023-06-05 | Instruct-Video2Avatar: Video-to-Avatar Generation with Instructions | Shaoxu Li et.al. | 2306.02903v1 | link |
2023-06-02 | Video Colorization with Pre-trained Text-to-Image Diffusion Models | Hanyuan Liu et.al. | 2306.01732v1 | null |
2023-06-02 | Denoising Diffusion Semantic Segmentation with Mask Prior Modeling | Zeqiang Lai et.al. | 2306.01721v1 | link |
2023-06-02 | DiffusEmp: A Diffusion Model-Based Framework with Multi-Grained Control for Empathetic Response Generation | Guanqun Bi et.al. | 2306.01657v1 | null |
2023-06-02 | Influence Maximization with Fairness at Scale (Extended Version) | Yuting Feng et.al. | 2306.01587v1 | null |
2023-06-02 | PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion Models | Jiacheng Chen et.al. | 2306.01461v1 | link |
2023-06-02 | Diffusion Self-Guidance for Controllable Image Generation | Dave Epstein et.al. | 2306.00986v2 | null |
2023-06-01 | StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners | Yonglong Tian et.al. | 2306.00984v1 | link |
2023-06-01 | StyleDrop: Text-to-Image Generation in Any Style | Kihyuk Sohn et.al. | 2306.00983v1 | null |
2023-06-01 | SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds | Yanyu Li et.al. | 2306.00980v1 | link |
2023-06-01 | Intriguing Properties of Text-guided Diffusion Models | Qihao Liu et.al. | 2306.00974v1 | link |
2023-06-01 | Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models | Chang Liu et.al. | 2306.00973v1 | link |
2023-06-01 | ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation | Shaozhe Hao et.al. | 2306.00971v1 | link |
2023-06-01 | The Hidden Language of Diffusion Models | Hila Chefer et.al. | 2306.00966v1 | link |
2023-06-01 | Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation | Minghui Hu et.al. | 2306.00964v1 | null |
2023-06-01 | Differential Diffusion: Giving Each Pixel Its Strength | Eran Levin et.al. | 2306.00950v1 | link |
2023-05-31 | Learning Explicit Contact for Implicit Reconstruction of Hand-held Objects from Monocular Images | Junxing Hu et.al. | 2305.20089v1 | null |
2023-05-31 | Understanding and Mitigating Copying in Diffusion Models | Gowthami Somepalli et.al. | 2305.20086v1 | link |
2023-05-31 | Control4D: Dynamic Portrait Editing by Learning 4D GAN from 2D Diffusion-based Editor | Ruizhi Shao et.al. | 2305.20082v1 | null |
2023-05-31 | Efficient Diffusion Policies for Offline Reinforcement Learning | Bingyi Kang et.al. | 2305.20081v1 | link |
2023-05-31 | A Unified Conditional Framework for Diffusion-based Image Restoration | Yi Zhang et.al. | 2305.20049v1 | link |
2023-06-01 | Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust | Yuxin Wen et.al. | 2305.20030v2 | link |
2023-05-31 | Protein Design with Guided Discrete Diffusion | Nate Gruver et.al. | 2305.20009v1 | link |
2023-05-31 | GANDiffFace: Controllable Generation of Synthetic Datasets for Face Recognition with Realistic Variations | Pietro Melzi et.al. | 2305.19962v1 | null |
2023-05-31 | A Geometric Perspective on Diffusion Models | Defang Chen et.al. | 2305.19947v1 | null |
2023-05-30 | Ambient Diffusion: Learning Clean Distributions from Corrupted Data | Giannis Daras et.al. | 2305.19256v1 | link |
2023-05-30 | PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation | Jialu Li et.al. | 2305.19195v1 | null |
2023-05-30 | Video ControlNet: Towards Temporally Consistent Synthetic-to-Real Video Translation Using Conditional Image Diffusion Models | Ernie Chu et.al. | 2305.19193v1 | null |
2023-05-30 | Calliffusion: Chinese Calligraphy Generation and Style Transfer with Diffusion Modeling | Qisheng Liao et.al. | 2305.19124v1 | null |
2023-05-30 | DiffMatch: Diffusion Model for Dense Matching | Jisu Nam et.al. | 2305.19094v1 | link |
2023-05-30 | Likelihood-Based Diffusion Language Models | Ishaan Gulrajani et.al. | 2305.18619v1 | link |
2023-05-29 | RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths | Zeyue Xue et.al. | 2305.18295v1 | null |
2023-05-29 | Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models | Yuchao Gu et.al. | 2305.18292v1 | link |
2023-05-29 | Photoswap: Personalized Subject Swapping in Images | Jing Gu et.al. | 2305.18286v1 | null |
2023-05-29 | Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors | Paul S. Scotti et.al. | 2305.18274v1 | link |
2023-05-29 | Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising | Fu-Yun Wang et.al. | 2305.18264v1 | link |
2023-05-29 | GlyphControl: Glyph Conditional Control for Visual Text Generation | Yukang Yang et.al. | 2305.18259v1 | link |
2023-05-26 | Improving accuracy of GPT-3/4 results on biomedical data using a retrieval-augmented language model | David Soong et.al. | 2305.17116v1 | null |
2023-05-26 | ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing | Min Zhao et.al. | 2305.17098v1 | link |
2023-05-26 | The reaction-diffusion basis of animated patterns in eukaryotic flagella | James F. Cass et.al. | 2305.17032v1 | link |
2023-05-26 | Accelerating Diffusion Models for Inverse Problems through Shortcut Sampling | Gongye Liu et.al. | 2305.16965v1 | link |
2023-05-26 | Learning to Imagine: Visually-Augmented Natural Language Generation | Tianyi Tang et.al. | 2305.16944v1 | link |
2023-05-26 | DiffusionNAG: Task-guided Neural Architecture Generation with Diffusion Models | Sohyun An et.al. | 2305.16943v1 | link |
2023-05-26 | CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography | Jiwen Yu et.al. | 2305.16936v1 | link |
2023-05-26 | Turbulence calculation based on the extended Naiver-Stokes equations | Shanwen Tan et.al. | 2305.16923v1 | null |
2023-05-25 | Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models | Shihao Zhao et.al. | 2305.16322v1 | link |
2023-05-25 | Eclipse: Disambiguating Illumination and Materials using Unintended Shadows | Dor Verbin et.al. | 2305.16321v1 | null |
2023-05-25 | Parallel Sampling of Diffusion Models | Andy Shih et.al. | 2305.16317v1 | link |
2023-05-25 | NAP: Neural 3D Articulation Prior | Jiahui Lei et.al. | 2305.16315v1 | null |
2023-05-25 | UMat: Uncertainty-Aware Single Image High Resolution Material Capture | Carlos Rodriguez-Pardo et.al. | 2305.16312v1 | null |
2023-05-25 | Break-A-Scene: Extracting Multiple Concepts from a Single Image | Omri Avrahami et.al. | 2305.16311v1 | link |
2023-05-25 | Look Ma, No Hands! Agent-Environment Factorization of Egocentric Videos | Matthew Chang et.al. | 2305.16301v1 | null |
2023-05-25 | Diversify Your Vision Datasets with Automatic Diffusion-Based Augmentation | Lisa Dunlap et.al. | 2305.16289v1 | link |
2023-05-25 | CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graphs | Guangyao Zhai et.al. | 2305.16283v1 | link |
2023-05-25 | UDPM: Upsampling Diffusion Probabilistic Models | Shady Abu-Hussein et.al. | 2305.16269v1 | link |
2023-05-24 | Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape | Rundi Wu et.al. | 2305.15399v1 | link |
2023-05-24 | A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence | Junyi Zhang et.al. | 2305.15347v1 | link |
2023-05-24 | Training on Thin Air: Improve Image Classification with Generated Data | Yongchao Zhou et.al. | 2305.15316v1 | link |
2023-05-24 | MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation | Marco Bellagente et.al. | 2305.15296v1 | null |
2023-05-23 | Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence | Grace Luo et.al. | 2305.14334v1 | null |
2023-05-23 | SEEDS: Exponential SDE Solvers for Fast High-Quality Sampling from Diffusion Models | Martin Gonzalez et.al. | 2305.14267v1 | link |
2023-05-23 | Improved Convergence of Score-Based Diffusion Models via Prediction-Correction | Francesco Pedrotti et.al. | 2305.14164v1 | null |
2023-05-23 | Realistic Noise Synthesis with Diffusion Models | Qi Wu et.al. | 2305.14022v1 | null |
2023-05-23 | Lightweight Channel Codes for ISI Mitigation in Molecular Communication between Bionanosensors | Dongliang Jing et.al. | 2305.14001v1 | null |
2023-05-23 | Node-wise Diffusion for Scalable Graph Learning | Keke Huang et.al. | 2305.14000v1 | link |
2023-05-22 | VDT: An Empirical Study on Video Diffusion with Transformers | Haoyu Lu et.al. | 2305.13311v1 | link |
2023-05-22 | If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection | Shyamgopal Karthik et.al. | 2305.13308v1 | link |
2023-05-23 | Training Diffusion Models with Reinforcement Learning | Kevin Black et.al. | 2305.13301v2 | link |
2023-05-22 | DiffusionNER: Boundary Diffusion for Named Entity Recognition | Yongliang Shen et.al. | 2305.13298v1 | link |
2023-05-22 | U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech | Xin Jing et.al. | 2305.13195v1 | null |
2023-05-22 | Policy Representation via Diffusion Probability Model for Reinforcement Learning | Long Yang et.al. | 2305.13122v1 | link |
2023-05-22 | Energy cascade in the Garrett-Munk spectrum of internal gravity waves | Yue Wu et.al. | 2305.13110v1 | null |
2023-05-19 | Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models | Byungjun Kim et.al. | 2305.11870v1 | link |
2023-05-19 | Any-to-Any Generation via Composable Diffusion | Zineng Tang et.al. | 2305.11846v1 | link |
2023-05-19 | The probability flow ODE is provably fast | Sitan Chen et.al. | 2305.11798v1 | null |
2023-05-19 | Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity | Zijiao Chen et.al. | 2305.11675v1 | null |
2023-05-19 | Few-shot 3D Shape Generation | Jingyuan Zhu et.al. | 2305.11664v1 | null |
2023-05-19 | Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields | Jingbo Zhang et.al. | 2305.11588v1 | link |
2023-05-19 | Brain Captioning: Decoding human brain activity into images and text | Matteo Ferrante et.al. | 2305.11560v1 | null |
2023-05-19 | Efficient Cross-Lingual Transfer for Chinese Stable Diffusion with Images as Pivots | Jinyi Hu et.al. | 2305.11540v1 | null |
2023-05-19 | Late-Constraint Diffusion Guidance for Controllable Image Synthesis | Chang Liu et.al. | 2305.11520v1 | link |
2023-05-19 | DiffuSIA: A Spiral Interaction Architecture for Encoder-Decoder Text Diffusion | Chao-Hong Tan et.al. | 2305.11517v1 | null |
2023-05-18 | UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild | Can Qin et.al. | 2305.11147v1 | link |
2023-05-18 | Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces | Javier E Santos et.al. | 2305.11089v1 | link |
2023-05-18 | Inspecting the Geographical Representativeness of Images from Text-to-Image Models | Abhipsa Basu et.al. | 2305.11080v1 | null |
2023-05-18 | Unsupervised Pansharpening via Low-rank Diffusion Model | Xiangyu Rui et.al. | 2305.10925v1 | link |
2023-05-18 | Structural Pruning for Diffusion Models | Gongfan Fang et.al. | 2305.10924v1 | link |
2023-05-18 | VideoFactory: Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation | Wenjing Wang et.al. | 2305.10874v1 | null |
2023-05-17 | FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention | Guangxuan Xiao et.al. | 2305.10431v1 | link |
2023-05-17 | Raising the Bar for Certified Adversarial Robustness with Diffusion Models | Thomas Altstidl et.al. | 2305.10388v1 | null |
2023-05-17 | A phase field model for droplets suspended in viscous liquids under the influence of electric fields | Yuzhe Qin et.al. | 2305.10296v1 | null |
2023-05-17 | Provably Correct Physics-Informed Neural Networks | Francisco Eiras et.al. | 2305.10157v1 | null |
2023-05-18 | Controllable Mind Visual Diffusion Model | Bohan Zeng et.al. | 2305.10135v2 | link |
2023-05-16 | Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation | Samaneh Azadi et.al. | 2305.09662v1 | null |
2023-05-16 | FitMe: Deep Photorealistic 3D Morphable Model Avatars | Alexandros Lattas et.al. | 2305.09641v1 | null |
2023-05-16 | AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation | Tong Wu et.al. | 2305.09515v1 | link |
2023-05-16 | Discrete Diffusion Probabilistic Models for Symbolic Music Generation | Matthias Plasser et.al. | 2305.09489v1 | link |
2023-05-17 | Multi-Level Global Context Cross Consistency Model for Semi-Supervised Ultrasound Image Segmentation with Diffusion Model | Fenghe Tang et.al. | 2305.09447v2 | link |
2023-05-16 | Diffusion Dataset Generation: Towards Closing the Sim2Real Gap for Pedestrian Detection | Andrew Farley et.al. | 2305.09401v1 | null |
2023-05-17 | AMD: Autoregressive Motion Diffusion | Bo Han et.al. | 2305.09381v2 | null |
2023-05-15 | Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models | Antoni Bigata Casademunt et.al. | 2305.08854v1 | link |
2023-05-15 | Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts | Yuyang Zhao et.al. | 2305.08850v1 | null |
2023-05-15 | The role of magnetic helicity when it is absent on average | Axel Brandenburg et.al. | 2305.08769v1 | null |
2023-05-15 | Diffusion-weighted SPECIAL improves the detection of J-coupled metabolites at ultra-high magnetic field | Jessie Mosso et.al. | 2305.08708v1 | null |
2023-05-15 | A Reproducible Extraction of Training Images from Diffusion Models | Ryan Webster et.al. | 2305.08694v1 | link |
2023-05-12 | Sound waves, diffusive transport, and wall slip in nanoconfined compressible fluids | Hannes Holey et.al. | 2305.07501v1 | null |
2023-05-12 | On a Voter Model with Context-Dependent Opinion Adoption | Luca Becchetti et.al. | 2305.07377v1 | null |
2023-05-12 | Experimental optimization of lensless digital holographic microscopy with rotating diffuser-based coherent noise reduction | Piotr Arcab et.al. | 2305.07373v1 | null |
2023-05-12 | Penguin huddling: a continuum model | Samuel J. Harris et.al. | 2305.07324v1 | link |
2023-05-15 | Phosphorus-Controlled Nanoepitaxy in the Asymmetric Growth of GaAs-InP Core-Shell Bent Nanowires | Spencer McDermott et.al. | 2305.07252v2 | null |
2023-05-12 | Optimal calibration of optical tweezers with arbitrary integration time and sampling frequencies -- A general framework | Laura Pérez-Garcéa et.al. | 2305.07245v1 | null |
2023-05-15 | Fully quantum algorithm for lattice Boltzmann methods with application to partial differential equations | Fatima Ezahra Chrit et.al. | 2305.07148v2 | link |
2023-05-11 | Exploiting Diffusion Prior for Real-World Image Super-Resolution | Jianyi Wang et.al. | 2305.07015v1 | link |
2023-05-11 | A method for automated regression test in scientific computing libraries: illustration with SPHinXsys | Bo Zhang et.al. | 2305.06970v1 | link |
2023-05-11 | CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model | Zhen Ye et.al. | 2305.06908v1 | link |
2023-05-11 | Null-text Guidance in Diffusion Models is Secretly a Cartoon-style Creator | Jing Zhao et.al. | 2305.06710v1 | null |
2023-05-11 | Evaluating Twitter's Algorithmic Amplification of Low-Trust Content: An Observational Study | Giulio Corsi et.al. | 2305.06125v2 | link |
2023-05-10 | Relightify: Relightable 3D Faces from a Single Image via Diffusion Models | Foivos Paraperas Papantoniou et.al. | 2305.06077v1 | null |
2023-05-10 | iEdit: Localised Text-guided Image Editing with Weak Supervision | Rumeysa Bodur et.al. | 2305.05947v1 | null |
2023-05-09 | Large Language Models Humanize Technology | Pratyush Kumar et.al. | 2305.05576v1 | null |
2023-05-09 | Style-A-Video: Agile Diffusion for Arbitrary Text-based Video Style Transfer | Nisha Huang et.al. | 2305.05464v1 | link |
2023-05-10 | Large Language Models Need Holistically Thought in Medical Conversational QA | Yixuan Weng et.al. | 2305.05410v2 | link |
2023-05-09 | The Multi-cluster Two-Wave Fading Model | Juan P. Pena-Martin et.al. | 2305.05342v1 | null |
2023-05-08 | DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models | Sicheng Yang et.al. | 2305.04919v1 | link |
2023-05-08 | CaloClouds: Fast Geometry-Independent Highly-Granular Calorimeter Simulation | Erik Buhmann et.al. | 2305.04847v1 | link |
2023-05-08 | A Drop of Ink may Make a Million Think: The Spread of False Information in Large Language Models | Ning Bian et.al. | 2305.04812v1 | null |
2023-05-08 | Controllable Light Diffusion for Portraits | David Futschik et.al. | 2305.04745v1 | null |
2023-05-08 | A Closest Point Method for Surface PDEs with Interior Boundary Conditions for Geometry Processing | Nathan King et.al. | 2305.04711v1 | null |
2023-05-08 | ReGeneration Learning of Diffusion Models with Rich Prompts for Zero-Shot Image Translation | Yupei Lin et.al. | 2305.04651v1 | null |
2023-05-05 | Reflection of a Diffuser in a Liquid Interface | C. Silva et.al. | 2305.03682v1 | null |
2023-05-05 | Conditional Diffusion Feature Refinement for Continuous Sign Language Recognition | Leming Guo et.al. | 2305.03614v1 | null |
2023-05-05 | Data Curation for Image Captioning with Text-to-Image Generative Models | Wenyan Li et.al. | 2305.03610v1 | link |
2023-05-04 | Personalize Segment Anything Model with One Shot | Renrui Zhang et.al. | 2305.03048v1 | link |
2023-05-05 | Capacity Bounds for Vertically-Drifted First Arrival Position Channels under a Second-Moment Constraint | Yun-Feng Lo et.al. | 2305.02706v2 | null |
2023-05-03 | Nonlocal gravity wave turbulence in presence of condensate | A. O. Korotkevich et.al. | 2305.01930v1 | null |
2023-05-04 | DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion | Kiyohiro Nakayama et.al. | 2305.01921v2 | null |
2023-05-04 | The Impacts of Dimensionality, Diffusion, and Directedness on Intrinsic Cross-Model Simulation in Tile-Based Self-Assembly | Daniel Hader et.al. | 2305.01877v2 | null |
2023-05-03 | Multimodal Data Augmentation for Image Captioning using Diffusion Models | Changrong Xiao et.al. | 2305.01855v1 | link |
2023-05-02 | Unpaired Downscaling of Fluid Flows with Diffusion Bridges | Tobias Bischoff et.al. | 2305.01822v1 | link |
2023-05-02 | Multimodal Procedural Planning via Dual Text-Image Prompting | Yujie Lu et.al. | 2305.01795v1 | link |
2023-05-02 | DiffuSum: Generation Enhanced Extractive Summarization with Diffusion | Haopeng Zhang et.al. | 2305.01735v1 | link |
2023-05-02 | ContactArt: Learning 3D Interaction Priors for Category-level Articulated Object and Hand Poses Estimation | Zehao Zhu et.al. | 2305.01618v1 | null |
2023-05-02 | Adopting AI: How Familiarity Breeds Both Trust and Contempt | Michael C. Horowitz et.al. | 2305.01405v1 | null |
2023-05-02 | Long-Term Rhythmic Video Soundtracker | Jiashuo Yu et.al. | 2305.01319v1 | link |
2023-05-02 | DreamPaint: Few-Shot Inpainting of E-Commerce Items for Virtual Try-On without 3D Modeling | Mehmet Saygin Seyfioglu et.al. | 2305.01257v1 | null |
2023-05-02 | Solving Inverse Problems with Score-Based Generative Priors learned from Noisy Data | Asad Aali et.al. | 2305.01166v1 | null |
2023-05-02 | Geometric Latent Diffusion Models for 3D Molecule Generation | Minkai Xu et.al. | 2305.01140v1 | link |
2023-05-01 | Fractional and tempered fractional models for Reynolds-averaged Navier-Stokes equations | Pavan Pranjivan Mehta et.al. | 2305.00770v1 | null |
2023-05-01 | Diffusion Models for Time Series Applications: A Survey | Lequan Lin et.al. | 2305.00624v1 | null |
2023-04-30 | Class-Balancing Diffusion Models | Yiming Qin et.al. | 2305.00562v1 | link |
2023-04-30 | Towards Computational Architecture of Liberty: A Comprehensive Survey on Deep Learning for Generating Virtual Architecture in the Metaverse | Anqi Wang et.al. | 2305.00510v1 | null |
2023-04-28 | Scaling regimes in rapidly rotating thermal convection at extreme Rayleigh numbers | Jiaxing Song et.al. | 2304.14854v1 | null |
2023-04-28 | Simplified models of diffusion in radially-symmetric geometries | Luke P. Filippini et.al. | 2304.14632v1 | link |
2023-04-28 | MUDiff: Unified Diffusion for Complete Molecule Generation | Chenqing Hua et.al. | 2304.14621v1 | null |
2023-04-28 | Robust Gaussian Process Regression method for efficient reaction pathway optimization: application to surface processes | Wei Fang et.al. | 2304.14596v1 | null |
2023-04-28 | SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis | Azade Farshad et.al. | 2304.14573v1 | null |
2023-04-27 | It is all about where you start: Text-to-image generation with seed selection | Dvir Samuel et.al. | 2304.14530v1 | link |
2023-04-27 | Putting People in Their Place: Affordance-Aware Human Insertion into Scenes | Sumith Kulal et.al. | 2304.14406v1 | link |
2023-04-27 | Motion-Conditioned Diffusion Model for Controllable Video Synthesis | Tsai-Shien Chen et.al. | 2304.14404v1 | null |
2023-04-27 | Maximizing Model Generalization for Manufacturing with Self-Supervised Learning and Federated Learning | Matthew Russell et.al. | 2304.14398v1 | null |
2023-04-27 | Functional Diffusion Maps | María Barroso et.al. | 2304.14378v1 | link |
2023-04-27 | LDPC Decoders Prefer More Reliable Parity Bits: Unequal Data Protection Over BSC | Beyza Dabak et.al. | 2304.14278v1 | null |
2023-04-27 | DataComp: In search of the next generation of multimodal datasets | Samir Yitzhak Gadre et.al. | 2304.14108v1 | link |
2023-04-26 | Heuristic Barycenter Modeling of Fully Absorbing Receivers in Diffusive Molecular Communication Channels | Fardad Vakilipoor et.al. | 2304.13640v1 | null |
2023-04-26 | Identifying the structure patterns to govern the performance of localization in regulating innovation diffusion | Leyang Xue et.al. | 2304.13608v1 | null |
2023-04-26 | Bifractality of fractal scale-free networks | Jun Yamamoto et.al. | 2304.13438v1 | null |
2023-04-26 | Training-Free Location-Aware Text-to-Image Synthesis | Jiafeng Mao et.al. | 2304.13427v1 | null |
2023-04-25 | The Score-Difference Flow for Implicit Generative Modeling | Romann M. Weber et.al. | 2304.12906v1 | null |
2023-04-25 | Latent diffusion models for generative precipitation nowcasting with accurate uncertainty quantification | Jussi Leinonen et.al. | 2304.12891v1 | link |
2023-04-25 | Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning | Cheng Lu et.al. | 2304.12824v1 | link |
2023-04-25 | A Binary Annular Phase Mask to Regulate Spherical Aberration and Allow Super-Localization in Single-Particle Tracking over Extended Depth-of-Focus | Quentin Gresil et.al. | 2304.12774v1 | null |
2023-04-25 | Effect of trap states, ion migration and interfaces on carrier transport in single crystal, polycrystalline and thick film devices of halide perovskites CH $_3$NH$_3$PbX$_3$ (X= I, Br, Cl) | Mohd Warish et.al. | 2304.12701v1 | null |
2023-04-24 | Analyzing the neutron and |
Hiroshi Ito et.al. | 2304.12153v1 | null |
2023-04-24 | Efficient Halftoning via Deep Reinforcement Learning | Haitian Jiang et.al. | 2304.12152v1 | null |
2023-04-24 | Variational Diffusion Auto-encoder: Deep Latent Variable Model with Unconditional Diffusion Prior | Georgios Batzolis et.al. | 2304.12141v1 | null |
2023-04-24 | Customized Load Profiles Synthesis for Electricity Customers Based on Conditional Diffusion Models | Zhenyi Wang et.al. | 2304.12076v1 | null |
2023-04-24 | Improving Synthetically Generated Image Detection in Cross-Concept Settings | Pantelis Dogoulis et.al. | 2304.12053v1 | link |
2023-04-21 | BoDiffusion: Diffusing Sparse Observations for Full-Body Human Motion Synthesis | Angela Castillo et.al. | 2304.11118v1 | null |
2023-04-21 | Improved Diffusion-based Image Colorization via Piggybacked Models | Hanyuan Liu et.al. | 2304.11105v1 | null |
2023-04-21 | Perturbatively corrected ring-polymer instanton theory for accurate tunneling splittings | Joseph E. Lawrence et.al. | 2304.10963v1 | null |
2023-04-20 | Farm3D: Learning Articulated 3D Animals by Distilling 2D Diffusion | Tomas Jakab et.al. | 2304.10535v1 | null |
2023-04-20 | Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFs | Frederik Warburg et.al. | 2304.10532v1 | link |
2023-04-20 | Collaborative Diffusion for Multi-Modal Face Generation and Editing | Ziqi Huang et.al. | 2304.10530v1 | link |
2023-04-20 | Prediction of the evolution of the nuclear reactor core parameters using artificial neural network | Krzysztof Palmi et.al. | 2304.10337v1 | null |
2023-04-20 | Avoiding methane emission rate underestimates when using the divergence method | Clayton Roberts et.al. | 2304.10303v1 | null |
2023-04-20 | Not Only Generative Art: Stable Diffusion for Content-Style Disentanglement in Art Analysis | Yankun Wu et.al. | 2304.10278v1 | link |
2023-04-19 | Irregular dependence on Stokes number and non-ergodic transport of heavy inertial particles in steady laminar flows | Anu V. S. Nath et.al. | 2304.09804v1 | null |
2023-04-19 | NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models | Seung Wook Kim et.al. | 2304.09787v1 | null |
2023-04-19 | Signatures of heterogeneity in the statistical structure of target state aligned ensembles | Nicolas Lenner et.al. | 2304.09719v1 | null |
2023-04-18 | Monte-Carlo method for incompressible fluid flows past obstacles | Vladislav Cherepanov et.al. | 2304.09152v1 | null |
2023-04-18 | On the seed population of solar energetic particles in the inner heliosphere | Nicolas Wijsen et.al. | 2304.09098v1 | null |
2023-04-18 | Construction of coarse-grained molecular dynamics with many-body non-Markovian memory | Liyao Lyu et.al. | 2304.09044v1 | null |
2023-04-18 | Look ATME: The Discriminator Mean Entropy Needs Attention | Edgardo Solano-Carrillo et.al. | 2304.09024v1 | link |
2023-04-18 | UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer | Soon Yau Cheong et.al. | 2304.08870v1 | link |
2023-04-17 | Text2Performer: Text-Driven Human Video Generation | Yuming Jiang et.al. | 2304.08483v1 | link |
2023-04-18 | Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation | Jie An et.al. | 2304.08477v2 | null |
2023-04-17 | Synthetic Data from Diffusion Models Improves ImageNet Classification | Shekoofeh Azizi et.al. | 2304.08466v1 | null |
2023-04-17 | MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing | Mingdeng Cao et.al. | 2304.08465v1 | link |
2023-04-17 | OVTrack: Open-Vocabulary Multiple Object Tracking | Siyuan Li et.al. | 2304.08408v1 | null |
2023-04-17 | Refusion: Enabling Large-Size Realistic Image Restoration with Latent-Space Diffusion Models | Ziwei Luo et.al. | 2304.08291v1 | link |
2023-04-17 | Solving stiff ordinary differential equations using physics informed neural networks (PINNs): simple recipes to improve training of vanilla-PINNs | Hubert Baty et.al. | 2304.08289v1 | link |
2023-04-14 | A Comparative Study on Generative Models for High Resolution Solar Observation Imaging | Mehdi Cherti et.al. | 2304.07169v1 | link |
2023-04-14 | Towards Controllable Diffusion Models via Reward-Guided Exploration | Hengtong Zhang et.al. | 2304.07132v1 | null |
2023-04-14 | Delta Denoising Score | Amir Hertz et.al. | 2304.07090v1 | null |
2023-04-14 | Memory Efficient Diffusion Probabilistic Models via Patch-based Generation | Shinei Arakawa et.al. | 2304.07087v1 | null |
2023-04-14 | DCFace: Synthetic Face Generation with Dual Condition Diffusion Model | Minchul Kim et.al. | 2304.07060v1 | link |
2023-04-14 | A Diffusion model for POI recommendation | Yifang Qin et.al. | 2304.07041v1 | link |
2023-04-13 | Expressive Text-to-Image Generation with Rich Text | Songwei Ge et.al. | 2304.06720v1 | null |
2023-04-13 | Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction | Hansheng Chen et.al. | 2304.06714v1 | link |
2023-04-13 | DiffusionRig: Learning Personalized Priors for Facial Appearance Editing | Zheng Ding et.al. | 2304.06711v1 | link |
2023-04-13 | Learning Controllable 3D Diffusion Models from Single-view Images | Jiatao Gu et.al. | 2304.06700v1 | null |
2023-04-13 | DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning | Enze Xie et.al. | 2304.06648v1 | null |
2023-04-12 | Continual Diffusion: Continual Customization of Text-to-Image Diffusion with C-LoRA | James Seale Smith et.al. | 2304.06027v1 | null |
2023-04-12 | DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion | Johanna Karras et.al. | 2304.06025v1 | null |
2023-04-12 | Probabilistic Human Mesh Recovery in 3D Scenes from Egocentric Views | Siwei Zhang et.al. | 2304.06024v1 | link |
2023-04-12 | SpectralDiff: Hyperspectral Image Classification with Spectral-Spatial Diffusion Models | Ning Chen et.al. | 2304.05961v1 | link |
2023-04-12 | Diffusion models with location-scale noise | Alexia Jolicoeur-Martineau et.al. | 2304.05907v1 | null |
2023-04-12 | Cancer-Net BCa-S: Breast Cancer Grade Prediction using Volumetric Deep Radiomic Features from Synthetic Correlated Diffusion Imaging | Chi-en Amy Tai et.al. | 2304.05899v1 | link |
2023-04-11 | HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models | Eslam Mohamed Bakr et.al. | 2304.05390v1 | link |
2023-04-11 | Diffusion Models for Constrained Domains | Nic Fishman et.al. | 2304.05364v1 | link |
2023-04-11 | Multi-scale Fusion Fault Diagnosis Method Based on Two-Dimensionaliztion Sequence in Complex Scenarios | Weiyang Jin et.al. | 2304.05198v1 | null |
2023-04-10 | A Cheaper and Better Diffusion Language Model with Soft-Masked Noise | Jiaao Chen et.al. | 2304.04746v1 | link |
2023-04-10 | Ambiguous Medical Image Segmentation using Diffusion Models | Aimon Rahman et.al. | 2304.04745v1 | link |
2023-04-10 | Sequential Recommendation with Diffusion Models | Hanwen Du et.al. | 2304.04541v1 | null |
2023-04-07 | Compressed Regression over Adaptive Networks | Marco Carpentiero et.al. | 2304.03638v1 | null |
2023-04-07 | Exploring Collaborative Distributed Diffusion-Based AI-Generated Content (AIGC) in Wireless Networks | Hongyang Du et.al. | 2304.03446v1 | link |
2023-04-06 | RoSteALS: Robust Steganography using Autoencoder Latent Space | Tu Bui et.al. | 2304.03400v1 | link |
2023-04-06 | Diffusion Models as Masked Autoencoders | Chen Wei et.al. | 2304.03283v1 | null |
2023-04-06 | Inst-Inpaint: Instructing to Remove Objects with Diffusion Models | Ahmet Burak Yildirim et.al. | 2304.03246v1 | link |
2023-04-06 | Face Animation with an Attribute-Guided Diffusion Model | Bohan Zeng et.al. | 2304.03199v1 | link |
2023-04-06 | SketchFFusion: Sketch-guided image editing with diffusion model | Weihang Mao et.al. | 2304.03174v1 | null |
2023-04-05 | Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models | Xuhui Jia et.al. | 2304.02642v1 | null |
2023-04-05 | GenPhys: From Physical Processes to Generative Models | Ziming Liu et.al. | 2304.02637v1 | null |
2023-04-05 | An atlas of the heterogeneous viscoelastic brain with local power-law attenuation synthesised using Prony-series | Oisin Morrison et.al. | 2304.02610v1 | null |
2023-04-05 | Generative Novel View Synthesis with 3D-Aware Diffusion Models | Eric R. Chan et.al. | 2304.02602v1 | null |
2023-04-05 | Diffusion across a concentration step: Strongly nonmonotonic evolution into thermodynamic equilibrium | Hans R. Moser et.al. | 2304.02557v1 | null |
2023-04-04 | viz2viz: Prompt-driven stylized visualization generation using a diffusion model | Jiaqi Wu et.al. | 2304.01919v1 | null |
2023-04-04 | PODIA-3D: Domain Adaptation of 3D Generative Model Across Large Domain Gap Using Pose-Preserved Text-to-Image Diffusion | Gwanghyun Kim et.al. | 2304.01900v1 | null |
2023-04-04 | Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion | Davis Rempe et.al. | 2304.01893v1 | null |
2023-04-04 | Quantitative perfusion and water transport time model from multi b-value diffusion magnetic resonance imaging validated against neutron capture microspheres | M. Liu et.al. | 2304.01888v1 | null |
2023-04-04 | Adaptive learning of effective dynamics: Adaptive real-time, online modeling for complex systems | Ivica Kičić et.al. | 2304.01732v1 | link |
2023-04-03 | Learning to Read Braille: Bridging the Tactile Reality Gap with Diffusion Models | Carolina Higuera et.al. | 2304.01182v1 | link |
2023-04-03 | ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model | Mingyuan Zhang et.al. | 2304.01116v1 | link |
2023-04-03 | ViT-DAE: Transformer-driven Diffusion Autoencoder for Histopathology Image Analysis | Xuan Xu et.al. | 2304.01053v1 | null |
2023-04-03 | DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models | Yukang Cao et.al. | 2304.00916v1 | link |
2023-03-31 | Sam Bond-Taylor et.al. | 2303.18242v1 | link | |
2023-03-31 | A Closer Look at Parameter-Efficient Tuning in Diffusion Models | Chendong Xiang et.al. | 2303.18181v1 | link |
2023-03-31 | One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models | Yasser Benigmim et.al. | 2303.18080v1 | link |
2023-03-30 | AvatarCraft: Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control | Ruixiang Jiang et.al. | 2303.17606v1 | link |
2023-03-30 | Token Merging for Fast Stable Diffusion | Daniel Bolya et.al. | 2303.17604v1 | link |
2023-03-30 | Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models | Wen Wang et.al. | 2303.17599v1 | link |
2023-03-30 | Consistent View Synthesis with Pose-Guided Diffusion Models | Hung-Yu Tseng et.al. | 2303.17598v1 | null |
2023-03-30 | Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models | Eric Zhang et.al. | 2303.17591v1 | link |
2023-03-30 | DDP: Diffusion Model for Dense Visual Prediction | Yuanfeng Ji et.al. | 2303.17559v1 | link |
2023-03-30 | DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder | Chenpng Du et.al. | 2303.17550v1 | null |
2023-03-30 | PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models | Vidit Goel et.al. | 2303.17546v1 | link |
2023-03-29 | Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos | Kun Su et.al. | 2303.16897v1 | null |
2023-03-30 | MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path | Qian Wang et.al. | 2303.16765v2 | link |
2023-03-29 | 4D Facial Expression Diffusion Model | Kaifeng Zou et.al. | 2303.16611v1 | link |
2023-03-29 | WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models | Konstantina Nikolaidou et.al. | 2303.16576v1 | link |
2023-03-29 | Your Diffusion Model is Secretly a Zero-Shot Classifier | Alexander C. Li et.al. | 2303.16203v2 | link |
2023-03-28 | Visual Chain-of-Thought Diffusion Models | William Harvey et.al. | 2303.16187v1 | link |
2023-03-28 | Diffusion Maps for Group-Invariant Manifolds | Paulina Hoyos et.al. | 2303.16169v1 | null |
2023-03-28 | Novel View Synthesis of Humans using Differentiable Rendering | Guillaume Rochette et.al. | 2303.15880v1 | link |
2023-03-27 | The Stable Signature: Rooting Watermarks in Latent Diffusion Models | Pierre Fernandez et.al. | 2303.15435v1 | link |
2023-03-27 | Anti-DreamBooth: Protecting users from personalized text-to-image synthesis | Thanh Van Le et.al. | 2303.15433v1 | link |
2023-03-27 | Debiasing Scores and Prompts of 2D Diffusion for Robust Text-to-3D Generation | Susung Hong et.al. | 2303.15413v1 | link |
2023-03-27 | Training-free Style Transfer Emerges from h-space in Diffusion models | Jaeseok Jeong et.al. | 2303.15403v1 | null |
2023-03-27 | Exploring Continual Learning of Diffusion Models | Michał Zając et.al. | 2303.15342v1 | null |
2023-03-27 | Diffusion Models for Memory-efficient Processing of 3D Medical Images | Florentin Bieder et.al. | 2303.15288v1 | link |
2023-03-27 | Text-to-Image Diffusion Models are Zero-Shot Classifiers | Kevin Clark et.al. | 2303.15233v1 | null |
2023-03-24 | Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior | Junshu Tang et.al. | 2303.14184v1 | link |
2023-03-24 | MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion | Yizhuo Lu et.al. | 2303.14139v1 | null |
2023-03-24 | CIFAKE: Image Classification and Explainable Identification of AI-Generated Synthetic Images | Jordan J. Bird et.al. | 2303.14126v1 | null |
2023-03-24 | Electron transport measurements in liquid xenon with Xenoscope, a large-scale DARWIN demonstrator | L. Baudis et.al. | 2303.13963v1 | null |
2023-03-23 | Ablating Concepts in Text-to-Image Diffusion Models | Nupur Kumari et.al. | 2303.13516v1 | link |
2023-03-23 | ReVersion: Diffusion-Based Relation Inversion from Images | Ziqi Huang et.al. | 2303.13495v1 | link |
2023-03-23 | Scaling laws of two-dimensional incompressible turbulent transport | D. I. Palade et.al. | 2303.13457v1 | null |
2023-03-23 | Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators | Levon Khachatryan et.al. | 2303.13439v1 | link |
2023-03-23 | Medical diffusion on a budget: textual inversion for medical image generation | Bram de Wilde et.al. | 2303.13430v1 | null |
2023-03-23 | DDT: A Diffusion-Driven Transformer-based Framework for Human Mesh Recovery from a Video | Ce Zheng et.al. | 2303.13397v1 | null |
2023-03-23 | Audio Diffusion Model for Speech Synthesis: A Survey on Text To Speech and Speech Enhancement in Generative AI | Chenshuang Zhang et.al. | 2303.13336v1 | null |
2023-03-23 | Decentralized Adversarial Training over Graphs | Ying Cao et.al. | 2303.13326v1 | null |
2023-03-23 | Fourier Diffusion Models: A Method to Control MTF and NPS in Score-Based Stochastic Image Generation | Matthew Tivnan et.al. | 2303.13285v1 | null |
2023-03-22 | Diffuse-Denoise-Count: Accurate Crowd-Counting with Diffusion Models | Yasiru Ranasinghe et.al. | 2303.12790v1 | link |
2023-03-22 | Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions | Ayaan Haque et.al. | 2303.12789v1 | null |
2023-03-22 | FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models | Jianglong Ye et.al. | 2303.12786v1 | null |
2023-03-22 | Effect of gamma radiation on electrical properties of diffusive memristor devices | D. P. Pattnaik et.al. | 2303.12762v1 | null |
2023-03-22 | Pix2Video: Video Editing using Image Diffusion | Duygu Ceylan et.al. | 2303.12688v1 | link |
2023-03-23 | Feature-Conditioned Cascaded Video Diffusion Models for Precise Echocardiogram Synthesis | Hadrien Reynaud et.al. | 2303.12644v2 | link |
2023-03-22 | A Perceptual Quality Assessment Exploration for AIGC Images | Zicheng Zhang et.al. | 2303.12618v1 | null |
2023-03-21 | Vox-E: Text-guided Voxel Editing of 3D Objects | Etai Sella et.al. | 2303.12048v1 | link |
2023-03-21 | Semantic Latent Space Regression of Diffusion Autoencoders for Vertebral Fracture Grading | Matthias Keicher et.al. | 2303.12031v1 | null |
2023-03-21 | Numerical simulation of self-oscillating catalytic reaction in a plug-flow reactor | N. V. Peskov et.al. | 2303.12022v1 | null |
2023-03-21 | 3D-CLFusion: Fast Text-to-3D Rendering with Contrastive Latent Diffusion | Yu-Jhe Li et.al. | 2303.11938v1 | null |
2023-03-21 | CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion | Geonmo Gu et.al. | 2303.11916v1 | link |
2023-03-21 | Projections of Model Spaces for Latent Graph Inference | Haitz Sáez de Ocáriz Borde et.al. | 2303.11754v1 | null |
2023-03-20 | Zero-1-to-3: Zero-shot One Image to 3D Object | Ruoshi Liu et.al. | 2303.11328v1 | link |
2023-03-20 | Localizing Object-level Shape Variations with Text-to-Image Diffusion Models | Or Patashnik et.al. | 2303.11306v1 | null |
2023-03-20 | SVDiff: Compact Parameter Space for Diffusion Fine-Tuning | Ligong Han et.al. | 2303.11305v1 | link |
2023-03-20 | AnimeDiffusion: Anime Face Line Drawing Colorization via Diffusion Models | Yu Cao et.al. | 2303.11137v1 | link |
2023-03-17 | A Recipe for Watermarking Diffusion Models | Yunqing Zhao et.al. | 2303.10137v1 | link |
2023-03-17 | Data-Centric Learning from Unlabeled Graphs with Diffusion Model | Gang Liu et.al. | 2303.10108v1 | link |
2023-03-17 | DialogPaint: A Dialog-based Image Editing Model | Jingxuan Wei et.al. | 2303.10073v1 | null |
2023-03-17 | GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation | Can Qin et.al. | 2303.10056v1 | link |
2023-03-17 | On the momentum diffusion over multiphase surfaces with meshless methods | Johannes C. Joubert et.al. | 2303.09978v1 | null |
2023-03-17 | Adversarial Counterfactual Visual Explanations | Guillaume Jeanneret et.al. | 2303.09962v1 | link |
2023-03-17 | Discovering mesoscopic descriptions of collective movement with neural stochastic modelling | Utkarsh Pratiush et.al. | 2303.09906v1 | link |
2023-03-16 | Efficient Diffusion Training via Min-SNR Weighting Strategy | Tiankai Hang et.al. | 2303.09556v1 | link |
2023-03-16 | Diffusion-HPC: Generating Synthetic Images with Realistic Humans | Zhenzhen Weng et.al. | 2303.09541v1 | link |
2023-03-17 | FateZero: Fusing Attentions for Zero-shot Text-based Video Editing | Chenyang Qi et.al. | 2303.09535v2 | link |
2023-03-16 | Andrey Voynov et.al. | 2303.09522v1 | null | |
2023-03-16 | DiffIR: Efficient Diffusion Model for Image Restoration | Bin Xia et.al. | 2303.09472v1 | link |
2023-03-16 | Unwrapping NPT simulations to calculate diffusion coefficients | Jakob Tómas Bullerjahn et.al. | 2303.09418v1 | null |
2023-03-17 | DINAR: Diffusion Inpainting of Neural Textures for One-Shot Human Avatars | David Svitov et.al. | 2303.09375v2 | link |
2023-03-15 | Stochastic Interpolants: A Unifying Framework for Flows and Diffusions | Michael S. Albergo et.al. | 2303.08797v1 | null |
2023-03-15 | Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion | Inhwa Han et.al. | 2303.08767v1 | null |
2023-03-15 | Advanced Analysis of Radar Cross-Section Measurements in Reverberation Environment | Corentin Charlo et.al. | 2303.08751v1 | null |
2023-03-15 | DiffusionAD: Denoising Diffusion for Anomaly Detection | Hui Zhang et.al. | 2303.08730v1 | link |
2023-03-16 | ResDiff: Combining CNN and Diffusion Model for Image Super-Resolution | Shuyao Shang et.al. | 2303.08714v2 | null |
2023-03-15 | Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer | Serin Yang et.al. | 2303.08622v1 | link |
2023-03-14 | LayoutDM: Discrete Diffusion Model for Controllable Layout Generation | Naoto Inoue et.al. | 2303.08137v1 | link |
2023-03-14 | MeshDiffusion: Score-based Generative 3D Mesh Modeling | Zhen Liu et.al. | 2303.08133v1 | link |
2023-03-14 | Editing Implicit Assumptions in Text-to-Image Diffusion Models | Hadas Orgad et.al. | 2303.08084v1 | link |
2023-03-15 | Interpretable ODE-style Generative Diffusion Model via Force Field Construction | Weiyang Jin et.al. | 2303.08063v2 | null |
2023-03-14 | Edit-A-Video: Single Video Editing with Object-Aware Consistency | Chaehun Shin et.al. | 2303.07945v1 | null |
2023-03-15 | Controllable Mesh Generation Through Sparse Latent Point Diffusion Models | Zhaoyang Lyu et.al. | 2303.07938v2 | null |
2023-03-15 | Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation | Junyoung Seo et.al. | 2303.07937v2 | link |
2023-03-13 | Erasing Concepts from Diffusion Models | Rohit Gandikota et.al. | 2303.07345v1 | link |
2023-03-14 | Parallel Vertex Diffusion for Unified Visual Grounding | Zesen Cheng et.al. | 2303.07216v2 | null |
2023-03-10 | GECCO: Geometrically-Conditioned Point Diffusion Models | Michał J. Tyszkiewicz et.al. | 2303.05916v1 | null |
2023-03-10 | Photon Diffusion in Microscale Solids | Avijit Das et.al. | 2303.05776v1 | null |
2023-03-10 | TrojDiff: Trojan Attacks on Diffusion Models with Diverse Targets | Weixin Chen et.al. | 2303.05762v1 | link |
2023-03-10 | Fast Diffusion Sampler for Inverse Problems by Geometric Decomposition | Hyungjin Chung et.al. | 2303.05754v1 | link |
2023-03-09 | Scaling up GANs for Text-to-Image Synthesis | Minguk Kang et.al. | 2303.05511v1 | null |
2023-03-09 | Resolving quantitative MRI model degeneracy with machine learning via training data distribution design | Michele Guerreri et.al. | 2303.05464v1 | null |
2023-03-09 | 3DGen: Triplane Latent Diffusion for Textured Mesh Generation | Anchit Gupta et.al. | 2303.05371v1 | null |
2023-03-09 | TGDataset: a Collection of Over One Hundred Thousand Telegram Channels | Massimo La Morgia et.al. | 2303.05345v1 | link |
2023-03-09 | Brain-Diffuser: Natural scene reconstruction from fMRI signals using generative latent diffusion | Furkan Ozcelik et.al. | 2303.05334v1 | link |
2023-03-08 | Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models | Jiarui Xu et.al. | 2303.04803v1 | link |
2023-03-08 | Multilevel Diffusion: Infinite Dimensional Score-Based Diffusion Models for Image Generation | Paul Hagemann et.al. | 2303.04772v1 | link |
2023-03-08 | Video-P2P: Video Editing with Cross-attention Control | Shaoteng Liu et.al. | 2303.04761v1 | null |
2023-03-08 | Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models | Chenfei Wu et.al. | 2303.04671v1 | link |
2023-03-08 | Diffusing Gaussian Mixtures for Generating Categorical Data | Florence Regol et.al. | 2303.04635v1 | link |
2023-03-08 | Connecting finite-time Lyapunov exponents with supersaturation and droplet dynamics in the bulk of a turbulent cloud | Vladyslav Pushenko et.al. | 2303.04632v1 | null |
2023-03-08 | Maritime transportation and people mobility in the early diffusion of COVID-19 in Croatia | Corentin Cot et.al. | 2303.04617v1 | null |
2023-03-07 | Diffusion Policy: Visuomotor Policy Learning via Action Diffusion | Cheng Chi et.al. | 2303.04137v1 | null |
2023-03-06 | Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-Type Samplers | Sitan Chen et.al. | 2303.03384v1 | null |
2023-03-06 | StyO: Stylize Your Face in Only One-Shot | Bonan Li et.al. | 2303.03231v1 | null |
2023-03-03 | Unleashing Text-to-Image Diffusion Models for Visual Perception | Wenliang Zhao et.al. | 2303.02153v1 | link |
2023-03-03 | Multi-Agent Adversarial Training Using Diffusion Learning | Ying Cao et.al. | 2303.01936v1 | null |
2023-03-03 | CONTAIN: A Community-based Algorithm for Network Immunization | Özgur Coban et.al. | 2303.01934v1 | link |
2023-03-02 | Consistency Models | Yang Song et.al. | 2303.01469v1 | link |
2023-03-02 | Human Motion Diffusion as a Generative Prior | Yonatan Shafir et.al. | 2303.01418v1 | link |
2023-03-02 | Why (and When) does Local SGD Generalize Better than SGD? | Xinran Gu et.al. | 2303.01215v1 | link |
2023-03-01 | StraIT: Non-autoregressive Generation with Stratified Image Transformer | Shengju Qian et.al. | 2303.00750v1 | null |
2023-03-01 | Diffusing Graph Attention | Daniel Glickman et.al. | 2303.00613v1 | null |
2023-03-01 | Level Up the Deepfake Detection: a Method to Effectively Discriminate Images Generated by GAN Architectures and Diffusion Models | Luca Guarnera et.al. | 2303.00608v1 | null |
2023-03-01 | Unlimited-Size Diffusion Restoration | Yinhuai Wang et.al. | 2303.00354v1 | link |
2023-03-01 | Collage Diffusion | Vishnu Sarukkai et.al. | 2303.00262v1 | null |
2023-03-01 | Diffusion Probabilistic Fields | Peiye Zhuang et.al. | 2303.00165v1 | null |
2023-02-28 | Phase Field Modeling of Dictyostelium Discoideum Chemotaxis | Yunsong Zhang et.al. | 2302.14854v1 | null |
2023-02-28 | Monocular Depth Estimation using Diffusion Models | Saurabh Saxena et.al. | 2302.14816v1 | null |
2023-02-28 | Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection | Jian Shi et.al. | 2302.14696v1 | link |
2023-02-28 | Synthesizing Mixed-type Electronic Health Records using Diffusion Models | Taha Ceritli et.al. | 2302.14679v1 | null |
2023-02-28 | Detecting and Optimising Team Interactions in Software Development | Christian Zingg et.al. | 2302.14609v1 | null |
2023-02-28 | Can We Use Diffusion Probabilistic Models for 3D Motion Prediction? | Hyemin Ahn et.al. | 2302.14503v1 | null |
2023-02-27 | Buoyancy-driven attraction of active droplets | Yibo Chen et.al. | 2302.14008v1 | null |
2023-02-27 | Impact of reconstruction schemes on interpreting lattice Boltzmann results -- A study using the Taylor-Green vortex problem | Jianping Meng et.al. | 2302.13910v1 | null |
2023-02-27 | Differentially Private Diffusion Models Generate Useful Synthetic Images | Sahra Ghalebikesabi et.al. | 2302.13861v1 | null |
2023-02-27 | Denoising Diffusion Samplers | Francisco Vargas et.al. | 2302.13834v1 | null |
2023-02-24 | Modulating Pretrained Diffusion Models for Multimodal Image Synthesis | Cusuh Ham et.al. | 2302.12764v1 | null |
2023-02-24 | Physical interactions promote Turing patterns | Lucas Menou et.al. | 2302.12521v1 | null |
2023-02-24 | Flow instability and momentum exchange in separation control by a synthetic jet | Yoshiaki Abe et.al. | 2302.12496v1 | null |
2023-02-24 | Unsupervised Discovery of Semantic Latent Directions in Diffusion Models | Yong-Hyun Park et.al. | 2302.12469v1 | null |
2023-02-23 | To the Noise and Back: Diffusion for Shared Autonomy | Takuma Yoneda et.al. | 2302.12244v1 | null |
2023-02-23 | DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models | Jamie Wynn et.al. | 2302.12231v1 | link |
2023-02-23 | Designing an Encoder for Fast Personalization of Text-to-Image Models | Rinon Gal et.al. | 2302.12228v1 | null |
2023-02-23 | Metric-oriented Speech Enhancement using Diffusion Probabilistic Model | Chen Chen et.al. | 2302.11989v1 | null |
2023-02-22 | Uncovering Bias in Face Generation Models | Cristian Muñoz et.al. | 2302.11562v1 | null |
2023-02-22 | Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC | Yilun Du et.al. | 2302.11552v1 | link |
2023-02-22 | Scaling Robot Learning with Semantically Imagined Experience | Tianhe Yu et.al. | 2302.11550v1 | null |
2023-02-22 | Aligned Diffusion Schrödinger Bridges | Vignesh Ram Somnath et.al. | 2302.11419v1 | link |
2023-02-22 | Entity-Level Text-Guided Image Manipulation | Yikai Wang et.al. | 2302.11383v1 | link |
2023-02-22 | An agent-based model of the 2020 international policy diffusion in response to the COVID-19 pandemic with particle filter | Yannick Oswald et.al. | 2302.11277v1 | link |
2023-02-21 | Provable Copyright Protection for Generative Models | Nikhil Vyas et.al. | 2302.10870v1 | null |
2023-02-21 | Learning 3D Photography Videos via Self-supervised Diffusion on Single Images | Xiaodong Wang et.al. | 2302.10781v1 | null |
2023-02-21 | On Calibrating Diffusion Probabilistic Models | Tianyu Pang et.al. | 2302.10688v1 | link |
2023-02-21 | Luke Melas-Kyriazi et.al. | 2302.10668v1 | link | |
2023-02-21 | RealFusion: 360° Reconstruction of Any Object from a Single Image | Luke Melas-Kyriazi et.al. | 2302.10663v1 | null |
2023-02-21 | Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels | Zebin You et.al. | 2302.10586v1 | link |
2023-02-20 | Towards Universal Fake Image Detectors that Generalize Across Generative Models | Utkarsh Ojha et.al. | 2302.10174v1 | link |
2023-02-20 | Cross-domain Compositing with Pretrained Diffusion Models | Roy Hachnochi et.al. | 2302.10167v1 | link |
2023-02-20 | NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion | Jiatao Gu et.al. | 2302.10109v1 | null |
2023-02-20 | DINOISER: Diffused Conditional Sequence Learning by Manipulating Noises | Jiasheng Ye et.al. | 2302.10025v1 | link |
2023-02-17 | Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be Consistent | Giannis Daras et.al. | 2302.09057v1 | link |
2023-02-17 | MiDi: Mixed Graph and 3D Denoising Diffusion for Molecule Generation | Clement Vignac et.al. | 2302.09048v1 | link |
2023-02-17 | LDFA: Latent Diffusion Face Anonymization for Self-driving Applications | Marvin Klemp et.al. | 2302.08931v1 | null |
2023-02-17 | Multi-unit Auction over a Social Network | Yuan Fang et.al. | 2302.08924v1 | null |
2023-02-17 | Unraveling the Variations of the Society of England and Wales through Diffusion Maps Analysis on Census 2011 | Gezhi Xiu et.al. | 2302.08701v1 | null |
2023-02-16 | Text-driven Visual Synthesis with Latent Diffusion Prior | Ting-Hsuan Liao et.al. | 2302.08510v1 | null |
2023-02-16 | T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models | Chong Mou et.al. | 2302.08453v1 | link |
2023-02-16 | Explicit Diffusion of Gaussian Mixture Model Based Image Priors | Martin Zach et.al. | 2302.08411v1 | null |
2023-02-16 | Boundary Guided Mixing Trajectory for Semantic Control with Diffusion Models | Ye Zhu et.al. | 2302.08357v1 | link |
2023-02-15 | Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual Generation | Joshua Vendrow et.al. | 2302.07865v1 | link |
2023-02-15 | Denoising Diffusion Probabilistic Models for Robust Image Super-Resolution in the Wild | Hshmat Sahak et.al. | 2302.07864v1 | null |
2023-02-15 | Data Forensics in Diffusion Models: A Systematic Analysis of Membership Privacy | Derui Zhu et.al. | 2302.07801v1 | null |
2023-02-15 | Video Probabilistic Diffusion Models in Projected Latent Space | Sihyun Yu et.al. | 2302.07685v1 | null |
2023-02-14 | Where to Diffuse, How to Diffuse, and How to Get Back: Automated Learning for Multivariate Diffusions | Raghav Singhal et.al. | 2302.07261v1 | null |
2023-02-14 | Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data | Minshuo Chen et.al. | 2302.07194v1 | null |
2023-02-14 | Universal Guidance for Diffusion Models | Arpit Bansal et.al. | 2302.07121v1 | link |
2023-02-14 | Differential privacy diffusion auction of homogeneous items | Fengjuan Jia et.al. | 2302.07072v1 | null |
2023-02-14 | Direct numerical simulations of the Taylor-Green Vortex interacting with a hydrogen diffusion flame: Reynolds number and non-unity Lewis number effects | Yifan Xu et.al. | 2302.07006v1 | null |
2023-02-13 | Raising the Cost of Malicious AI-Powered Image Editing | Hadi Salman et.al. | 2302.06588v1 | link |
2023-02-13 | Preconditioned Score-based Generative Models | Li Zhang et.al. | 2302.06504v1 | link |
2023-02-13 | Technical Note: PDE-constrained Optimization Formulation for Tumor Growth Model Calibration | Baoshan Liang et.al. | 2302.06445v1 | null |
2023-02-13 | ContrasInver: Voxel-wise Contrastive Semi-supervised Learning for Seismic Inversion | Yimin Dou et.al. | 2302.06441v1 | null |
2023-02-13 | Interplay between advective, diffusive, and active barriers in Rayleigh-Bénard flow | Nikolas Aksamit et.al. | 2302.06319v1 | null |
2023-02-10 | Example-Based Sampling with Diffusion Models | Bastien Doignies et.al. | 2302.05116v1 | null |
2023-02-09 | UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models | Wenliang Zhao et.al. | 2302.04867v1 | link |
2023-02-09 | RelightableHands: Efficient Neural Relighting of Articulated Hand Models | Shun Iwase et.al. | 2302.04866v1 | null |
2023-02-09 | Is This Loss Informative? Speeding Up Textual Inversion with Deterministic Objective Evaluation | Anton Voronov et.al. | 2302.04841v1 | link |
2023-02-09 | Better Diffusion Models Further Improve Adversarial Training | Zekai Wang et.al. | 2302.04638v1 | link |
2023-02-09 | Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples | Chumeng Liang et.al. | 2302.04578v1 | link |
2023-02-08 | PFGM++: Unlocking the Potential of Physics-Inspired Generative Models | Yilun Xu et.al. | 2302.04265v1 | link |
2023-02-08 | GLAZE: Protecting Artists from Style Mimicry by Text-to-Image Models | Shawn Shan et.al. | 2302.04222v1 | null |
2023-02-08 | Policy Evaluation in Decentralized POMDPs with Belief Sharing | Mert Kayaalp et.al. | 2302.04151v1 | link |
2023-02-08 | Dimensional lattice Boltzmann method for transport phenomena simulation without conversion to lattice units | Ivan Talão Martins et.al. | 2302.04120v1 | null |
2023-02-07 | Long Horizon Temperature Scaling | Andy Shih et.al. | 2302.03686v1 | link |
2023-02-07 | Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery | Yuxin Wen et.al. | 2302.03668v1 | link |
2023-02-07 | HumanMAC: Masked Motion Completion for Human Motion Prediction | Ling-Hao Chen et.al. | 2302.03665v1 | link |
2023-02-07 | Graph Generation with Destination-Driven Diffusion Mixture | Jaehyeong Jo et.al. | 2302.03596v1 | link |
2023-02-06 | Zero-shot Image-to-Image Translation | Gaurav Parmar et.al. | 2302.03027v1 | link |
2023-02-06 | Structure and Content-Guided Video Synthesis with Diffusion Models | Patrick Esser et.al. | 2302.03011v1 | null |
2023-02-03 | AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners | Zhixuan Liang et.al. | 2302.01877v1 | link |
2023-02-03 | TEXTure: Text-Guided Texturing of 3D Shapes | Elad Richardson et.al. | 2302.01721v1 | link |
2023-02-03 | Learning End-to-End Channel Coding with Diffusion Models | Muah Kim et.al. | 2302.01714v1 | null |
2023-02-03 | A Lipschitz Bandits Approach for Continuous Hyperparameter Optimization | Yasong Feng et.al. | 2302.01539v1 | null |
2023-02-02 | Dreamix: Video Diffusion Models are General Video Editors | Eyal Molad et.al. | 2302.01329v1 | null |
2023-02-02 | Are Diffusion Models Vulnerable to Membership Inference Attacks? | Jinhao Duan et.al. | 2302.01316v1 | link |
2023-02-01 | Stable Target Field for Reduced Variance Score Estimation in Diffusion Models | Yilun Xu et.al. | 2302.00670v1 | link |
2023-01-31 | Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models | Hila Chefer et.al. | 2301.13826v1 | link |
2023-01-30 | Extracting Training Data from Diffusion Models | Nicholas Carlini et.al. | 2301.13188v1 | null |
2023-01-30 | Shape-aware Text-driven Layered Video Editing | Yao-Chih Lee et.al. | 2301.13173v1 | null |
2023-01-30 | GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis | Ming Tao et.al. | 2301.12959v1 | link |
2023-01-30 | ERA-Solver: Error-Robust Adams Solver for Fast Sampling of Diffusion Probabilistic Models | Shengmeng Li et.al. | 2301.12935v1 | null |
2023-01-30 | PromptMix: Text-to-image diffusion models enhance the performance of lightweight networks | Arian Bakhtiarnia et.al. | 2301.12914v1 | null |
2023-01-27 | Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion | Flavio Schneider et.al. | 2301.11757v1 | link |
2023-01-27 | Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines? | Victor Boutin et.al. | 2301.11722v1 | link |
2023-01-26 | simple diffusion: End-to-end diffusion for high resolution images | Emiel Hoogeboom et.al. | 2301.11093v1 | null |
2023-01-26 | On the Importance of Noise Scheduling for Diffusion Models | Ting Chen et.al. | 2301.10972v1 | null |
2023-01-25 | Imitating Human Behaviour with Diffusion Models | Tim Pearce et.al. | 2301.10677v1 | link |
2023-01-24 | Bipartite Graph Diffusion Model for Human Interaction Generation | Baptiste Chopin et.al. | 2301.10134v1 | link |
2023-01-24 | DiffMotion: Speech-Driven Gesture Synthesis Using Denoising Diffusion Model | Fan Zhang et.al. | 2301.10047v1 | link |
2023-01-24 | Membership Inference of Diffusion Models | Hailong Hu et.al. | 2301.09956v1 | link |
2023-01-23 | LEGO-Net: Learning Regular Rearrangements of Objects in Rooms | Qiuhong Anna Wei et.al. | 2301.09629v1 | null |
2023-01-23 | Evaluation of Light Collection from Highly Scattering Media using Wavelength-Shifting Fibers | Andrew Wilhelm et.al. | 2301.09608v1 | null |
2023-01-23 | StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis | Axel Sauer et.al. | 2301.09515v1 | link |
2023-01-23 | DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion | Qitian Wu et.al. | 2301.09474v1 | link |
2023-01-19 | Dif-Fusion: Towards High Color Fidelity in Infrared and Visible Image Fusion with Diffusion Models | Jun Yue et.al. | 2301.08072v1 | null |
2023-01-18 | Targeted Image Reconstruction by Sampling Pre-trained Diffusion Model | Jiageng Zheng et.al. | 2301.07557v1 | null |
2023-01-17 | GLIGEN: Open-Set Grounded Text-to-Image Generation | Yuheng Li et.al. | 2301.07093v1 | link |
2023-01-13 | In BLOOM: Creativity and Affinity in Artificial Lyrics and Art | Evan Crothers et.al. | 2301.05402v1 | link |
2023-01-12 | Guiding Text-to-Image Diffusion Model Towards Grounded Generation | Ziyi Li et.al. | 2301.05221v1 | null |
2023-01-12 | Thompson Sampling with Diffusion Generative Prior | Yu-Guan Hsieh et.al. | 2301.05182v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-04-02 | Sketch3D: Style-Consistent Guidance for Sketch-to-3D Generation | Wangguandong Zheng et.al. | 2404.01843v1 | null |
2024-04-02 | FashionEngine: Interactive Generation and Editing of 3D Clothed Humans | Tao Hu et.al. | 2404.01655v1 | null |
2024-04-01 | Categorical semiotics: Foundations for Knowledge Integration | Carlos Leandro et.al. | 2404.01526v1 | null |
2024-04-01 | Can Biases in ImageNet Models Explain Generalization? | Paul Gavrikov et.al. | 2404.01509v1 | link |
2024-04-02 | GDA: Generalized Diffusion for Robust Test-time Adaptation | Yun-Yun Tsai et.al. | 2404.00095v2 | null |
2024-03-29 | Optimal Communication for Classic Functions in the Coordinator Model and Beyond | Hossein Esfandiari et.al. | 2403.20307v1 | null |
2024-03-29 | Sketch-to-Architecture: Generative AI-aided Architectural Design | Pengzhi Li et.al. | 2403.20186v1 | null |
2024-03-28 | Dealing with Missing Modalities in Multimodal Recommendation: a Feature Propagation-based Approach | Daniele Malitesta et.al. | 2403.19841v1 | null |
2024-03-28 | TASR: A Novel Trust-Aware Stackelberg Routing Algorithm to Mitigate Traffic Congestion | Doris E. M. Brown et.al. | 2403.19831v1 | null |
2024-03-26 | Neural Attributed Community Search at Billion Scale | Jianwei Wang et.al. | 2403.18874v1 | null |
2024-03-27 | A Path Towards Legal Autonomy: An interoperable and explainable approach to extracting, transforming, loading and computing legal information using large language models, expert systems and Bayesian networks | Axel Constant et.al. | 2403.18537v1 | null |
2024-03-27 | U-Sketch: An Efficient Approach for Sketch to Image Diffusion Models | Ilias Mitsouras et.al. | 2403.18425v1 | null |
2024-03-27 | ECNet: Effective Controllable Text-to-Image Diffusion Models | Sicheng Li et.al. | 2403.18417v1 | null |
2024-03-26 | Search and Society: Reimagining Information Access for Radical Futures | Bhaskar Mitra et.al. | 2403.17901v1 | null |
2024-03-26 | ExpressEdit: Video Editing with Natural Language and Sketching | Bekzat Tilekbay et.al. | 2403.17693v1 | null |
2024-03-26 | Equipping Sketch Patches with Context-Aware Positional Encoding for Graphic Sketch Representation | Sicong Zang et.al. | 2403.17525v1 | null |
2024-03-25 | On Policy Reuse: An Expressive Language for Representing and Executing General Policies that Call Other Policies | Blai Bonet et.al. | 2403.16824v1 | null |
2024-03-25 | CodeS: Natural Language to Code Repository via Multi-Layer Sketch | Daoguang Zan et.al. | 2403.16443v1 | link |
2024-03-24 | Combined Task and Motion Planning Via Sketch Decompositions (Extended Version with Supplementary Material) | Magí Dalmau-Moreno et.al. | 2403.16277v1 | null |
2024-03-22 | Efficiently Estimating Mutual Information Between Attributes Across Tables | Aécio Santos et.al. | 2403.15553v1 | null |
2024-03-22 | Fourier Transform-based Estimators for Data Sketches | Seth Pettie et.al. | 2403.15366v1 | null |
2024-03-25 | Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing | Alberto Baldrati et.al. | 2403.14828v2 | link |
2024-03-21 | Object-Centric Domain Randomization for 3D Shape Reconstruction in the Wild | Junhyeong Cho et.al. | 2403.14539v1 | null |
2024-03-21 | External Knowledge Enhanced 3D Scene Generation from Sketch | Zijie Wu et.al. | 2403.14121v1 | null |
2024-03-20 | Towards an extension of Fault Trees in the Predictive Maintenance Scenario | Roberta De Fazio et.al. | 2403.13785v1 | null |
2024-03-25 | Diagrammatic Instructions to Specify Spatial Objectives and Constraints with Applications to Mobile Base Placement | Qilin Sun et.al. | 2403.12465v2 | null |
2024-03-18 | Towards a Theory of Pragmatic Information | Edward D. Weinberger et.al. | 2403.12324v1 | null |
2024-03-17 | Stylized Face Sketch Extraction via Generative Prior with Limited Data | Kwan Yun et.al. | 2403.11263v1 | link |
2024-03-16 | RETINAQA : A Knowledge Base Question Answering Model Robust to both Answerable and Unanswerable Questions | Prayushi Faldu et.al. | 2403.10849v1 | null |
2024-03-15 | Animate Your Motion: Turning Still Images into Dynamic Videos | Mingxiao Li et.al. | 2403.10179v1 | null |
2024-03-14 | What Sketch Explainability Really Means for Downstream Tasks | Hmrishav Bandyopadhyay et.al. | 2403.09480v1 | null |
2024-03-14 | SketchINR: A First Look into Sketches as Implicit Neural Representations | Hmrishav Bandyopadhyay et.al. | 2403.09344v1 | null |
2024-03-14 | Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset | Hugo Laurençon et.al. | 2403.09029v1 | null |
2024-03-13 | ARtVista: Gateway To Empower Anyone Into Artist | Trong-Vu Hoang et.al. | 2403.08876v1 | null |
2024-03-13 | HAIFIT: Human-Centered AI for Fashion Image Translation | Jianan Jiang et.al. | 2403.08651v1 | link |
2024-03-13 | Sketch2Manga: Shaded Manga Screening from Sketch with Diffusion Models | Jian Lin et.al. | 2403.08266v1 | null |
2024-03-12 | It's All About Your Sketch: Democratising Sketch Control in Diffusion Models | Subhadeep Koley et.al. | 2403.07234v1 | link |
2024-03-12 | You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval | Subhadeep Koley et.al. | 2403.07222v1 | null |
2024-03-12 | Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers | Subhadeep Koley et.al. | 2403.07214v1 | null |
2024-03-11 | How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval? | Subhadeep Koley et.al. | 2403.07203v1 | null |
2024-03-11 | Enhancing Image Caption Generation Using Reinforcement Learning with Human Feedback | Adarsh N L et.al. | 2403.06735v1 | null |
2024-03-08 | Data-Dependent LSH for the Earth Mover's Distance | Rajesh Jayaram et.al. | 2403.05041v1 | null |
2024-03-07 | A challenge in A(G)I, cybernetics revived in the Ouroboros Model as one algorithm for all thinking | Knud Thomsen et.al. | 2403.04292v1 | null |
2024-03-06 | NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging | Takahiro Shirakawa et.al. | 2403.03485v1 | link |
2024-03-07 | DLP-GAN: learning to draw modern Chinese landscape photos with generative adversarial network | Xiangquan Gui et.al. | 2403.03456v2 | null |
2024-03-05 | SmartSantander: IoT Experimentation over a Smart City Testbed | Luis Sanchez et.al. | 2403.03196v1 | null |
2024-03-05 | CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following | Kaiyan Zhang et.al. | 2403.03129v1 | null |
2024-03-05 | RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches | Priya Sundaresan et.al. | 2403.02709v1 | null |
2024-03-02 | Euclidean distance compression via deep random features | Brett Leroux et.al. | 2403.01327v1 | null |
2024-02-29 | CoMeT: Count-Min-Sketch-based Row Tracking to Mitigate RowHammer at Low Cost | F. Nisa Bostanci et.al. | 2402.18769v1 | link |
2024-02-28 | DynaWarp -- Efficient, large-scale log storage and retrieval | Julian Reichinger et.al. | 2402.18355v1 | null |
2024-02-28 | Block and Detail: Scaffolding Sketch-to-Image Generation | Vishnu Sarukkai et.al. | 2402.18116v1 | null |
2024-02-27 | Decremental |
Deeksha Adil et.al. | 2402.17929v1 | null |
2024-02-27 | Surgment: Segmentation-enabled Semantic Search and Creation of Visual Question and Feedback to Support Video-Based Surgery Learning | Jingying Wang et.al. | 2402.17903v1 | null |
2024-02-27 | CAD-SIGNet: CAD Language Inference from Point Clouds using Layer-wise Sketch Instance Guided Attention | Mohammad Sadil Khan et.al. | 2402.17678v1 | null |
2024-02-27 | CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing | Chufeng Xiao et.al. | 2402.17624v1 | null |
2024-02-27 | Equivariant ideals of polynomials | Arka Ghosh et.al. | 2402.17604v1 | null |
2024-02-25 | Convolution and Cross-Correlation of Count Sketches Enables Fast Cardinality Estimation of Multi-Join Queries | Mike Heddes et.al. | 2402.15953v1 | link |
2024-02-23 | Genie: Generative Interactive Environments | Jake Bruce et.al. | 2402.15391v1 | null |
2024-02-22 | Semantic Image Synthesis with Unconditional Generator | Jungwoo Chae et.al. | 2402.14395v1 | null |
2024-02-21 | Sketching AI Concepts with Capabilities and Examples: AI Innovation in the Intensive Care Unit | Nur Yildirim et.al. | 2402.13437v1 | null |
2024-02-20 | Quantitative causality, causality-guided scientific discovery, and causal machine learning | X. San Liang et.al. | 2402.13427v1 | null |
2024-02-20 | Almost-Tight Bounds on Preserving Cuts in Classes of Submodular Hypergraphs | Sanjeev Khanna et.al. | 2402.13151v1 | null |
2024-02-17 | Be Persistent: Towards a Unified Solution for Mitigating Shortcuts in Deep Learning | Hadi M. Dolatabadi et.al. | 2402.11237v1 | null |
2024-02-17 | Automated Optimization of Parameterized Data-Plane Programs with Parasol | Mary Hogan et.al. | 2402.11155v1 | null |
2024-02-13 | Sampling Space-Saving Set Sketches | Homin K. Lee et.al. | 2402.08604v1 | link |
2024-02-13 | One-to-many Reconstruction of 3D Geometry of cultural Artifacts using a synthetically trained Generative Model | Thomas Pöllabauer et.al. | 2402.08310v1 | null |
2024-02-13 | Epistemic Power, Objectivity and Gender in AI Ethics Labor: Legitimizing Located Complaints | David Gray Widder et.al. | 2402.08171v1 | null |
2024-02-13 | Randomized Algorithms for Symmetric Nonnegative Matrix Factorization | Koby Hayashi et.al. | 2402.08134v1 | null |
2024-02-10 | Guided Sketch-Based Program Induction by Search Gradients | Ahmad Ayaz Amin et.al. | 2402.06990v1 | null |
2024-02-09 | Squidgets: Sketch-based Widget Design and Direct Manipulation of 3D Scene | Joonho Kim et.al. | 2402.06795v1 | null |
2024-02-08 | InkSight: Offline-to-Online Handwriting Conversion by Learning to Read and Write | Blagoj Mitrevski et.al. | 2402.05804v1 | null |
2024-02-08 | A Concept for Reconstructing Stucco Statues from historic Sketches using synthetic Data only | Thomas Pöllabauer et.al. | 2402.05593v1 | null |
2024-02-06 | Gradient Sketches for Training Data Attribution and Studying the Loss Landscape | Andrea Schioppa et.al. | 2402.03994v1 | null |
2024-02-06 | 3Doodle: Compact Abstraction of Objects with 3D Strokes | Changwoon Choi et.al. | 2402.03690v1 | null |
2024-02-05 | Computing Generic Fibres of Polynomial Ideals with FGLM and Hensel Lifting | Jérémy Berthomieu et.al. | 2402.03144v1 | null |
2024-02-03 | Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization | Bo Yang et.al. | 2402.02141v1 | link |
2024-02-02 | Solitons, dispersive shock waves and Noel Fredrick Smyth | Saleh Baqer et.al. | 2402.01332v1 | null |
2024-02-01 | Deep Robot Sketching: An application of Deep Q-Learning Networks for human-like sketching | Raul Fernandez-Fernandez et.al. | 2402.00676v1 | null |
2024-02-01 | High-Quality Medical Image Generation from Free-hand Sketch | Quan Huu Cap et.al. | 2402.00353v1 | null |
2024-01-31 | On The Power of Subtle Expressive Cues in the Perception of Human Affects | Ezgi Dede et.al. | 2401.18013v1 | null |
2024-02-04 | Fine-Grained Zero-Shot Learning: Advances, Challenges, and Prospects | Jingcai Guo et.al. | 2401.17766v2 | link |
2024-01-31 | Estimating Diffusion Degree on Graph Streams | Vinit Ramesh Gore et.al. | 2401.17611v1 | null |
2024-01-31 | Topology-Aware Latent Diffusion for 3D Shape Generation | Jiangbei Hu et.al. | 2401.17603v1 | null |
2024-01-29 | FPGA Technology Mapping Using Sketch-Guided Program Synthesis | Gus Henry Smith et.al. | 2401.16526v1 | null |
2024-01-29 | Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors | Shiyin Dong et.al. | 2401.16459v1 | null |
2024-01-25 | Incremental Proof Development in Dafny with Module-Based Induction | Son Ho et.al. | 2401.16233v1 | null |
2024-01-26 | Sketch and Refine: Towards Fast and Accurate Lane Detection | Chao Chen et.al. | 2401.14729v1 | link |
2024-01-27 | Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation | Minglin Chen et.al. | 2401.14257v2 | null |
2024-01-22 | PatternPortrait: Draw Me Like One of Your Scribbles | Sabine Wieluch et.al. | 2401.13001v1 | null |
2024-01-22 | Automated Completion of Statements and Proofs in Synthetic Geometry: an Approach based on Constraint Solving | Salwa Tabet Gonzalez et.al. | 2401.11898v1 | null |
2024-01-18 | Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access | Saibo Geng et.al. | 2401.09967v1 | null |
2024-01-21 | Towards Identifiable Unsupervised Domain Translation: A Diversified Distribution Matching Approach | Sagar Shrestha et.al. | 2401.09671v2 | null |
2024-01-12 | Masked Attribute Description Embedding for Cloth-Changing Person Re-identification | Chunlei Peng et.al. | 2401.05646v2 | link |
2024-01-11 | DrawTalking: Building Interactive Worlds by Sketching and Speaking | Karl Toby Rosenberg et.al. | 2401.05631v1 | null |
2024-01-10 | Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval | Eunyi Lyou et.al. | 2401.04860v1 | null |
2024-01-09 | Content-Conditioned Generation of Stylized Free hand Sketches | Jiajun Liu et.al. | 2401.04739v1 | null |
2024-01-09 | Representative Feature Extraction During Diffusion Process for Sketch Extraction with One Example | Kwan Yun et.al. | 2401.04362v1 | null |
2024-01-08 | Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach | Huanyu Liu et.al. | 2401.03742v1 | link |
2024-01-05 | FedNS: A Fast Sketching Newton-Type Algorithm for Federated Learning | Jian Li et.al. | 2401.02734v1 | link |
2024-01-02 | ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text | Dingkun Yan et.al. | 2401.01456v1 | link |
2024-01-01 | Free-form Shape Modeling in XR: A Systematic Review | Shounak Chatterjee et.al. | 2401.00924v1 | null |
2024-01-01 | DiffMorph: Text-less Image Morphing with Diffusion Models | Shounak Chatterjee et.al. | 2401.00739v1 | null |
2023-12-31 | SynCDR : Training Cross Domain Retrieval Models with Synthetic Data | Samarth Mishra et.al. | 2401.00420v1 | link |
2023-12-31 | Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval | Liang Wang et.al. | 2401.00371v1 | link |
2023-12-28 | A randomized algorithm to solve reduced rank operator regression | Giacomo Turri et.al. | 2312.17348v1 | link |
2024-01-03 | SVGDreamer: Text Guided SVG Generation with Diffusion Model | Ximing Xing et.al. | 2312.16476v2 | link |
2023-12-22 | Generative AI and the History of Architecture | Joern Ploennigs et.al. | 2312.15106v1 | null |
2023-12-22 | A Modular Approach to Metatheoretic Reasoning for Extensible Languages | Dawn Michaelson et.al. | 2312.14374v1 | null |
2023-12-21 | On the Hardness of Analyzing Quantum Programs Quantitatively | Martin Avanzini et.al. | 2312.13657v1 | null |
2023-12-18 | Open Vocabulary Semantic Scene Sketch Understanding | Ahmed Bourouis et.al. | 2312.12463v1 | null |
2023-12-19 | Sketch Vision: Artificial Intelligence with Sight for Imagination | Demircan Tas et.al. | 2312.12270v1 | null |
2023-12-19 | Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model | Lingjun Zhang et.al. | 2312.12232v1 | link |
2023-12-19 | CreativeConnect: Supporting Reference Recombination for Graphic Design Ideation with Generative AI | DaEun Choi et.al. | 2312.11949v1 | null |
2023-12-16 | Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image Retrieval | Decheng Liu et.al. | 2312.10320v1 | link |
2023-12-15 | Sketch and shift: a robust decoder for compressive clustering | Ayoub Belhadji et.al. | 2312.09940v1 | null |
2023-12-15 | Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception | Xiao Wang et.al. | 2312.09812v1 | link |
2023-12-14 | Matching Noisy Keys for Obfuscation | Charlie Dickens et.al. | 2312.08981v1 | null |
2023-12-14 | Solving Dense Linear Systems Faster than via Preconditioning | Michał Dereziński et.al. | 2312.08893v1 | null |
2023-12-13 | Enhance Sketch Recognition's Explainability via Semantic Component-Level Parsing | Guangming Zhu et.al. | 2312.07875v1 | link |
2023-12-12 | Improved Frequency Estimation Algorithms with and without Predictions | Anders Aamand et.al. | 2312.07535v1 | null |
2023-12-09 | BARET : Balanced Attention based Real image Editing driven by Target-text Inversion | Yuming Qiao et.al. | 2312.05482v1 | null |
2023-12-07 | Optimal Multi-Pass Lower Bounds for MST in Dynamic Streams | Sepehr Assadi et.al. | 2312.04674v1 | null |
2023-12-07 | Deep3DSketch: 3D modeling from Free-hand Sketches with View- and Structural-Aware Adversarial Training | Tianrun Chen et.al. | 2312.04435v1 | null |
2023-12-07 | DemoCaricature: Democratising Caricature Generation with a Rough Sketch | Dar-Yen Chen et.al. | 2312.04364v1 | null |
2023-12-07 | Doodle Your 3D: From Abstract Freehand Sketches to Precise 3D Shapes | Hmrishav Bandyopadhyay et.al. | 2312.04043v1 | null |
2023-12-06 | CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models | Hailin Zhang et.al. | 2312.03256v1 | link |
2023-12-05 | SEVA: Leveraging sketches to evaluate alignment between human and machine visual abstraction | Kushin Mukherjee et.al. | 2312.03035v1 | link |
2023-12-08 | FreestyleRet: Retrieving Images from Style-Diversified Queries | Hao Li et.al. | 2312.02428v2 | link |
2023-12-04 | CLIPDrawX: Primitive-based Explanations for Text Guided Sketch Synthesis | Nityanand Mathur et.al. | 2312.02345v1 | null |
2023-12-03 | Uncertainty-biased molecular dynamics for learning uniformly accurate interatomic potentials | Viktor Zaverkin et.al. | 2312.01416v1 | null |
2023-11-30 | Sketch Input Method Editor: A Comprehensive Dataset and Methodology for Systematic Input Recognition | Guangming Zhu et.al. | 2311.18254v1 | link |
2023-11-29 | Analyzing Query Optimizer Performance in the Presence and Absence of Cardinality Estimates | Asoke Datta et.al. | 2311.17293v1 | null |
2023-11-28 | Time- and Communication-Efficient Overlay Network Construction via Gossip | Fabien Dufoulon et.al. | 2311.17115v1 | null |
2023-11-28 | SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models | Yuwei Guo et.al. | 2311.16933v1 | null |
2023-11-28 | ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention | Jiawei Wang et.al. | 2311.16682v1 | null |
2023-11-28 | Text-Driven Image Editing via Learnable Regions | Yuanze Lin et.al. | 2311.16432v1 | link |
2023-11-27 | MAST: Model-Agnostic Sparsified Training | Yury Demidovich et.al. | 2311.16086v1 | link |
2023-11-26 | Sketch Video Synthesis | Yudian Zheng et.al. | 2311.15306v1 | link |
2023-11-25 | A unified framework for learning with nonlinear model classes from arbitrary linear samples | Ben Adcock et.al. | 2311.14886v1 | null |
2023-11-24 | Data-to-Text Bilingual Generation | Guy Lapalme et.al. | 2311.14808v1 | link |
2023-11-24 | One Pass Streaming Algorithm for Super Long Token Attention Approximation in Sublinear Space | Raghav Addanki et.al. | 2311.14652v1 | null |
2023-11-21 | Breathing Life Into Sketches Using Text-to-Video Priors | Rinon Gal et.al. | 2311.13608v1 | null |
2023-11-22 | Adaptive Sampling for Deep Learning via Efficient Nonparametric Proxies | Shabnam Daghaghi et.al. | 2311.13583v1 | null |
2023-11-21 | From Concept to Manufacturing: Evaluating Vision-Language Models for Engineering Design | Cyril Picard et.al. | 2311.12668v1 | null |
2023-11-19 | AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort | Wen Wang et.al. | 2311.11243v1 | null |
2023-11-17 | Scaling TabPFN: Sketching and Feature Selection for Tabular Prior-Data Fitted Networks | Benjamin Feuer et.al. | 2311.10609v1 | null |
2023-11-09 | Chain of Images for Intuitively Reasoning | Fanxu Meng et.al. | 2311.09241v1 | link |
2023-11-14 | Language and Sketching: An LLM-driven Interactive Multimodal Multitask Robot Navigation Framework | Weiqin Zu et.al. | 2311.08244v1 | null |
2023-11-13 | Fast and Space-Efficient Parallel Algorithms for Influence Maximization | Letong Wang et.al. | 2311.07554v1 | link |
2023-11-13 | Sketch-based Video Object Segmentation: Benchmark and Analysis | Ruolin Yang et.al. | 2311.07261v1 | null |
2023-11-09 | General Policies, Subgoal Structure, and Planning Width | Blai Bonet et.al. | 2311.05490v1 | null |
2023-11-09 | Control3D: Towards Controllable Text-to-3D Generation | Yang Chen et.al. | 2311.05461v1 | null |
2023-11-08 | Prompt Sketching for Large Language Models | Luca Beurer-Kellner et.al. | 2311.04954v1 | null |
2023-11-07 | DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding | Kehinde Ajayi et.al. | 2311.04098v1 | link |
2023-11-06 | Sketching methods with small window guarantee using minimum decycling sets | Guillaume Marçais et.al. | 2311.03592v1 | link |
2023-11-05 | Sketching Multidimensional Time Series for Fast Discord Mining | Chin-Chia Michael Yeh et.al. | 2311.03393v1 | null |
2023-11-03 | Neural Collage Transfer: Artistic Reconstruction via Material Manipulation | Ganghun Lee et.al. | 2311.02202v1 | link |
2023-11-06 | RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches | Jiayuan Gu et.al. | 2311.01977v2 | null |
2023-11-03 | Hardness of Low Rank Approximation of Entrywise Transformed Matrix Products | Tamas Sarlos et.al. | 2311.01960v1 | null |
2023-11-03 | Towards Concept-Aware Large Language Models | Chen Shani et.al. | 2311.01866v1 | link |
2023-11-07 | inkn'hue: Enhancing Manga Colorization from Multiple Priors with Alignment Multi-Encoder VAE | Tawin Jiramahapokee et.al. | 2311.01804v2 | link |
2023-10-31 | Progress and outlook on advanced fly scans based on Mamba | Peng-Cheng Li et.al. | 2310.20106v1 | link |
2023-10-30 | The Expressibility of Polynomial based Attention Scheme | Zhao Song et.al. | 2310.20051v1 | null |
2023-10-29 | Sketching Algorithms for Sparse Dictionary Learning: PTAS and Turnstile Streaming | Gregory Dexter et.al. | 2310.19068v1 | null |
2023-10-29 | Customize StyleGAN with One Hand Sketch | Shaocong Zhang et.al. | 2310.18949v1 | null |
2023-10-28 | Deep3DSketch+: Obtaining Customized 3D Model by Single Free-Hand Sketch through Deep Learning | Ying Zang et.al. | 2310.18609v1 | null |
2023-10-27 | Deep3DSketch++: High-Fidelity 3D Modeling from Single Free-hand Sketches | Ying Zang et.al. | 2310.18178v1 | null |
2023-10-27 | Reality3DSketch: Rapid 3D Modeling of Objects from Single Freehand Sketches | Tianrun Chen et.al. | 2310.18148v1 | null |
2023-10-27 | On General Language Understanding | David Schlangen et.al. | 2310.18038v1 | null |
2023-10-27 | Sketching and Streaming for Dictionary Compression | Ruben Becker et.al. | 2310.17980v1 | null |
2023-10-26 | Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models | Dingli Yu et.al. | 2310.17567v1 | null |
2023-10-24 | Emergent Communication in Interactive Sketch Question Answering | Zixing Lei et.al. | 2310.15597v1 | link |
2023-10-24 | Fast multiplication of random dense matrices with fixed sparse matrices | Tianyu Liang et.al. | 2310.15419v1 | link |
2023-10-18 | A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenge | Yikun Han et.al. | 2310.11703v1 | null |
2023-10-17 | Matrix Compression via Randomized Low Rank and Low Precision Factorization | Rajarshi Saha et.al. | 2310.11028v1 | link |
2023-10-16 | HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending | Tianyi Wei et.al. | 2310.10651v1 | link |
2023-10-16 | Visual Data-Type Understanding does not emerge from Scaling Vision-Language Models | Vishaal Udandarao et.al. | 2310.08577v2 | link |
2023-10-12 | Visualizing a Nondeterministic to Deterministic Finite-State Machine Transformation | Tijana Minic et.al. | 2310.08248v1 | link |
2023-10-11 | On |
Yu Chen et.al. | 2310.07857v1 | null |
2023-10-10 | SketchBodyNet: A Sketch-Driven Multi-faceted Decoder Network for 3D Human Reconstruction | Fei Wang et.al. | 2310.06577v1 | link |
2023-10-15 | HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation | Yaosen Chen et.al. | 2310.05720v3 | link |
2023-10-09 | Logic-guided Deep Reinforcement Learning for Stock Trading | Zhiming Li et.al. | 2310.05551v1 | null |
2023-10-08 | Transforming Pixels into a Masterpiece: AI-Powered Art Restoration using a Novel Distributed Denoising CNN (DDCNN) | Sankar B. et.al. | 2310.05270v1 | null |
2023-10-06 | Hanging in there: Prenatal origins of antigravity homeostasis in humans | Nicholas M. Wilkinson et.al. | 2310.04168v1 | null |
2023-10-06 | Deterministic Clustering in High Dimensional Spaces: Sketches and Approximation | Vincent Cohen-Addad et.al. | 2310.04076v1 | null |
2023-10-05 | Matrix Completion from One-Bit Dither Samples | Arian Eamaz et.al. | 2310.03224v1 | null |
2023-10-04 | Streaming Euclidean |
Vincent Cohen-Addad et.al. | 2310.02882v1 | null |
2023-10-04 | On the tilt of the Earth's polar axis ( |
V. Courtillot et.al. | 2310.02768v1 | null |
2023-10-03 | View-Independent Adjoint Light Tracing for Lighting Design Optimization | Lukas Lipp et.al. | 2310.02043v1 | null |
2023-10-03 | Randomized Dimension Reduction with Statistical Guarantees | Yijun Dong et.al. | 2310.01739v1 | null |
2023-10-02 | PolySketchFormer: Fast Transformers via Sketches for Polynomial Kernels | Praneeth Kacham et.al. | 2310.01655v1 | null |
2023-09-29 | Toward Operationalizing Pipeline-aware ML Fairness: A Research Agenda for Developing Practical Guidelines and Tools | Emily Black et.al. | 2309.17337v1 | null |
2023-09-28 | Sketch2CADScript: 3D Scene Reconstruction from 2D Sketch using Visual Transformer and Rhino Grasshopper | Hong-Bin Yang et.al. | 2309.16850v1 | null |
2023-09-28 | Multi-Modal Financial Time-Series Retrieval Through Latent Space Projections | Tom Bamford et.al. | 2309.16741v1 | null |
2023-09-28 | Language models in molecular discovery | Nikita Janakarajan et.al. | 2309.16235v1 | null |
2023-10-01 | Sampling Methods for Inner Product Sketching | Majid Daliri et.al. | 2309.16157v2 | link |
2023-09-27 | Fast Locality Sensitive Hashing with Theoretical Guarantee | Zongyuan Tan et.al. | 2309.15479v1 | null |
2023-09-25 | Guess & Sketch: Language Model Guided Transpilation | Celine Lee et.al. | 2309.14396v1 | null |
2023-09-22 | Deep3DSketch+: Rapid 3D Modeling from Single Free-hand Sketches | Tianrun Chen et.al. | 2309.13006v1 | null |
2023-09-22 | Visualization According to Statisticians: An Interview Study on the Role of Visualization for Inferential Statistics | Eric Newburger et.al. | 2309.12684v1 | null |
2023-09-22 | Towards medhub: A Self-Service Platform for Analysts and Physicians | Markus Höhn et.al. | 2309.11234v2 | null |
2023-09-20 | An Empirical Study of Malicious Code In PyPI Ecosystem | Wenbo Guo et.al. | 2309.11021v1 | link |
2023-09-19 | An overview of some mathematical techniques and problems linking 3D vision to 3D printing | Emiliano Cristiani et.al. | 2309.10549v1 | null |
2023-09-19 | Learning Orbitally Stable Systems for Diagrammatically Teaching | Weiming Zhi et.al. | 2309.10298v1 | null |
2023-09-18 | Completeness Thresholds for Memory Safety: Unbounded Guarantees via Bounded Proofs (Extended Abstract) | Tobias Reinhard et.al. | 2309.09731v1 | null |
2023-09-18 | Applying Security Testing Techniques to Automotive Engineering | Irdin Pekaric et.al. | 2309.09647v1 | null |
2023-09-15 | Active Learning for Fine-Grained Sketch-Based Image Retrieval | Himanshu Thakur et.al. | 2309.08743v1 | null |
2023-09-15 | Beyond Domain Gap: Exploiting Subjectivity in Sketch-Based Person Retrieval | Kejun Lin et.al. | 2309.08372v1 | link |
2023-09-14 | Landscape-Sketch-Step: An AI/ML-Based Metaheuristic for Surrogate Optimization Problems | Rafael Monteiro et.al. | 2309.07936v1 | link |
2023-09-12 | Grounded Language Acquisition From Object and Action Imagery | James Robert Kubricht et.al. | 2309.06335v1 | null |
2023-09-12 | OmniSketch: Efficient Multi-Dimensional High-Velocity Stream Analytics with Arbitrary Predicates | Wieger R. Punter et.al. | 2309.06051v1 | null |
2023-09-12 | GA-Sketching: Shape Modeling from Multi-View Sketching with Geometry-Aligned Deep Implicit Functions | Jie Zhou et.al. | 2309.05946v1 | link |
2023-09-11 | Photodetachment dynamics using nonlocal dicrete-state-in-continuum model | Martin Čížek et.al. | 2309.05830v1 | null |
2023-09-10 | Streaming Semidefinite Programs: |
Zhao Song et.al. | 2309.05135v1 | null |
2023-09-08 | Receiving an algorithmic recommendation based on documentary filmmaking techniques | Samuel Gantier et.al. | 2309.04184v1 | null |
2023-09-07 | Learning from Demonstration via Probabilistic Diagrammatic Teaching | Weiming Zhi et.al. | 2309.03835v1 | null |
2023-09-07 | Adjacency Sketches in Adversarial Environments | Moni Naor et.al. | 2309.03728v1 | null |
2023-09-06 | An Evaluation of Software Sketches | Roy Friedman et.al. | 2309.03045v1 | null |
2023-09-03 | Business Process Text Sketch Automation Generation Using Large Language Model | Rui Zhu et.al. | 2309.01071v1 | null |
2023-09-02 | Online Adaptive Mahalanobis Distance Estimation | Lianke Qin et.al. | 2309.01030v1 | null |
2023-09-01 | Randomized Polar Codes for Anytime Distributed Machine Learning | Burak Bartan et.al. | 2309.00682v1 | null |
2023-09-01 | Human-Inspired Facial Sketch Synthesis with Dynamic Adaptation | Fei Gao et.al. | 2309.00216v1 | link |
2023-08-31 | Terrain Diffusion Network: Climatic-Aware Terrain Generation with Geological Sketch Guidance | Zexin Hu et.al. | 2308.16725v1 | null |
2023-08-30 | Surrogate-based Autotuning for Randomized Sketching Algorithms in Regression Problems | Younghyun Cho et.al. | 2308.15720v1 | null |
2023-08-27 | SketchDreamer: Interactive Text-Augmented Creative Sketch Ideation | Zhiyu Qu et.al. | 2308.14191v1 | link |
2023-08-25 | WorldSmith: Iterative and Expressive Prompting for World Building with a Generative AI | Hai Dang et.al. | 2308.13355v1 | null |
2023-08-25 | Bridging the Gap: Fine-to-Coarse Sketch Interpolation Network for High-Quality Animation Sketch Inbetweening | Jiaming Shen et.al. | 2308.13273v1 | null |
2023-08-21 | Geo-Sketcher: Rapid 3D Geological Modeling using Geological and Topographic Map Sketches | Ronan Amorim et.al. | 2308.12152v1 | null |
2023-08-24 | Bayesian Learning for Dynamic Target Localization with Human-provided Spatial Information | Min-Won Seo et.al. | 2308.11839v2 | null |
2023-08-22 | MatFuse: Controllable Material Generation with Diffusion Models | Giuseppe Vecchio et.al. | 2308.11408v1 | link |
2023-08-22 | Minwise-Independent Permutations with Insertion and Deletion of Features | Rameshwar Pratap et.al. | 2308.11240v1 | null |
2023-08-28 | Large Language Models for Software Engineering: A Systematic Literature Review | Xinyi Hou et.al. | 2308.10620v2 | null |
2023-08-16 | Freedom of Speech and AI Output | Eugene Volokh et.al. | 2308.08673v1 | null |
2023-08-16 | Painter: Teaching Auto-regressive Language Models to Draw Sketches | Reza Pourreza et.al. | 2308.08520v1 | null |
2023-08-15 | Inversion-by-Inversion: Exemplar-based Sketch-to-Photo Synthesis via Stochastic Differential Equations without Training | Ximing Xing et.al. | 2308.07665v1 | link |
2023-08-11 | Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation | Yuki Endo et.al. | 2308.06027v1 | link |
2023-08-11 | Uncertainty-Aware Cross-Modal Transfer Network for Sketch-Based 3D Shape Retrieval | Yiyang Cai et.al. | 2308.05948v1 | null |
2023-08-20 | The Fast and the Private: Task-based Dataset Search | Zezhou Huang et.al. | 2308.05637v2 | null |
2023-08-12 | LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation | Leigang Qu et.al. | 2308.05095v2 | null |
2023-08-10 | Apple Vision Pro for Healthcare: "The Ultimate Display"? -- Entering the Wonderland of Precision | Jan Egger et.al. | 2308.04313v3 | null |
2023-08-08 | Iterative Sketching for Secure Coded Regression | Neophytos Charalambides et.al. | 2308.04185v1 | null |
2023-08-06 | Gradient Coding through Iterative Block Leverage Score Sampling | Neophytos Charalambides et.al. | 2308.03096v1 | null |
2023-08-05 | Sketch and Text Guided Diffusion Model for Colored Point Cloud Generation | Zijie Wu et.al. | 2308.02874v1 | null |
2023-08-07 | SoK: The Ghost Trilemma | S. Mukherjee et.al. | 2308.02202v2 | null |
2023-08-07 | BEVControl: Accurately Controlling Street-view Elements with Multi-perspective Consistency via BEV Sketch Layout | Kairui Yang et.al. | 2308.01661v3 | null |
2023-08-03 | PPI-NET: End-to-End Parametric Primitive Inference | Liang Wang et.al. | 2308.01521v1 | null |
2023-08-01 | Neural approximation of Wasserstein distance via a universal architecture for symmetric and factorwise group invariant functions | Samantha Chen et.al. | 2308.00273v1 | null |
2023-08-01 | CONSTRUCT: A Program Synthesis Approach for Reconstructing Control Algorithms from Embedded System Binaries in Cyber-Physical Systems | Ali Shokri et.al. | 2308.00250v1 | null |
2023-07-30 | RealityCanvas: Augmented Reality Sketching for Embedded and Responsive Scribble Animation Effects | Zhijie Xia et.al. | 2307.16116v1 | link |
2023-07-25 | Federated Heavy Hitter Recovery under Linear Sketching | Adria Gascon et.al. | 2307.13347v1 | null |
2023-07-24 | Learning Dense Correspondences between Photos and Sketches | Xuanchen Lu et.al. | 2307.12967v1 | null |
2023-07-18 | Semi-supervised Cycle-GAN for face photo-sketch translation in the wild | Chaofeng Chen et.al. | 2307.10281v1 | null |
2023-07-14 | Volumetric Wireframe Parsing from Neural Attraction Fields | Nan Xue et.al. | 2307.10206v1 | link |
2023-07-17 | Multi-Domain Learning with Modulation Adapters | Ekaterina Iakovleva et.al. | 2307.08528v1 | null |
2023-07-16 | InkSight: Leveraging Sketch Interaction for Documenting Chart Findings in Computational Notebooks | Yanna Lin et.al. | 2307.07922v1 | null |
2023-07-13 | Connectivity Labeling for Multiple Vertex Failures | Merav Parter et.al. | 2307.06276v2 | null |
2023-07-10 | Some Preliminary Steps Towards Metaverse Logic | Antonio L. Furtado et.al. | 2307.05574v1 | null |
2023-07-11 | A "Game of Like" : Online Social Network Sharing As Strategic Interaction | Emmanuel J. Genot et.al. | 2307.05063v1 | null |
2023-07-11 | Diffusion idea exploration for art generation | Nikhil Verma et.al. | 2307.04978v1 | null |
2023-07-08 | Sketch-A-Shape: Zero-Shot Sketch-to-3D Shape Generation | Aditya Sanghi et.al. | 2307.03869v1 | null |
2023-07-06 | Wireless Multi-Agent Generative AI: From Connected Intelligence to Collective Intelligence | Hang Zou et.al. | 2307.02757v1 | null |
2023-07-04 | Text + Sketch: Image Compression at Ultra Low Rates | Eric Lei et.al. | 2307.01944v1 | link |
2023-07-03 | Digital Twin-Empowered Communications: A New Frontier of Wireless Networks | Lina Bariah et.al. | 2307.00973v1 | null |
2023-07-04 | SketchMetaFace: A Learning-based Sketching Interface for High-fidelity 3D Character Face Modeling | Zhongjin Luo et.al. | 2307.00804v2 | null |
2023-06-27 | Cartesian institutions with evidence: Data and system modelling with diagrammatic constraints and generalized sketches | Zinovy Diskin et.al. | 2306.16284v1 | null |
2023-06-26 | Towards Optimal Effective Resistance Estimation | Rajat Vadiraj Dwaraknath et.al. | 2306.14820v1 | null |
2023-06-26 | DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models | Ximing Xing et.al. | 2306.14685v1 | link |
2023-06-25 | ALBUS: a Probabilistic Monitoring Algorithm to Counter Burst-Flood Attacks | Simon Scherrer et.al. | 2306.14328v1 | null |
2023-06-24 | Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner Oracles | Paul Tarau et.al. | 2306.14077v1 | link |
2023-06-21 | PrivSketch: A Private Sketch-based Frequency Estimation Protocol for Data Streams | Ying Li et.al. | 2306.12144v1 | null |
2023-06-20 | Computing a human-like reaction time metric from stable recurrent vision models | Lore Goetschalckx et.al. | 2306.11582v1 | null |
2023-06-23 | 3D VR Sketch Guided 3D Shape Prototyping and Exploration | Ling Luo et.al. | 2306.10830v2 | link |
2023-06-19 | Shape Guided Gradient Voting for Domain Generalization | Jiaqi Xu et.al. | 2306.10809v1 | null |
2023-06-15 | Private Federated Frequency Estimation: Adapting to the Hardness of the Instance | Jingfeng Wu et.al. | 2306.09396v1 | null |
2023-06-15 | Conditional Human Sketch Synthesis with Explicit Abstraction Control | Dar-Yen Chen et.al. | 2306.09274v1 | null |
2023-06-15 | Behaviorally Typed State Machines in TypeScript for Heterogeneous Swarms | Roland Kuhn et.al. | 2306.09068v1 | link |
2023-06-15 | Interleaving Pre-Trained Language Models and Large Language Models for Zero-Shot NL2SQL Generation | Zihui Gu et.al. | 2306.08891v1 | link |
2023-06-14 | Zero-Shot 3D Shape Sketch View Similarity and Retrieval | Gianluca Berardi et.al. | 2306.08541v1 | null |
2023-06-14 | Probing the unfolded configurations of a |
Albert Ardevol et.al. | 2306.08429v1 | null |
2023-06-14 | CLIPXPlore: Coupled CLIP and Shape Spaces for 3D Shape Exploration | Jingyu Hu et.al. | 2306.08226v1 | null |
2023-06-13 | AniFaceDrawing: Anime Portrait Exploration during Your Sketching | Zhengyu Huang et.al. | 2306.07476v1 | null |
2023-06-15 | Strokes2Surface: Recovering Curve Networks From 4D Architectural Design Sketches | S. Rasoulzadeh et.al. | 2306.07220v2 | link |
2023-06-11 | Learning the Positions in CountSketch | Yi Li et.al. | 2306.06611v1 | null |
2023-06-09 | SENS: Sketch-based Implicit Neural Shape Modeling | Alexandre Binninger et.al. | 2306.06088v1 | null |
2023-06-09 | Sketch2Stress: Sketching with Structural Stress Awareness | Deng Yu et.al. | 2306.05911v1 | null |
2023-06-09 | Sketch Beautification: Learning Part Beautification and Structure Refinement for Sketches of Man-made Objects | Deng Yu et.al. | 2306.05832v1 | null |
2023-06-05 | Tracking Evolving labels using Cone based Oracles | Aditya Acharya et.al. | 2306.03306v1 | null |
2023-06-09 | Explicit Construction of q-ary 2-deletion Correcting Codes with Low Redundancy | Shu Liu et.al. | 2306.02868v2 | null |
2023-06-06 | VideoComposer: Compositional Video Synthesis with Motion Controllability | Xiang Wang et.al. | 2306.02018v2 | null |
2023-06-07 | Cross Modal Data Discovery over Structured and Unstructured Data Lakes | Mohamed Y. Eltabakh et.al. | 2306.00932v2 | link |
2023-06-01 | Towards Interactive Image Inpainting via Sketch Refinement | Chang Liu et.al. | 2306.00407v1 | link |
2023-06-01 | Faster Robust Tensor Power Method for Arbitrary Order | Yichuan Deng et.al. | 2306.00406v1 | null |
2023-05-31 | Knowledge Base Question Answering for Space Debris Queries | Paul Darm et.al. | 2305.19734v1 | link |
2023-05-30 | A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation | Omar Seddati et.al. | 2305.18988v1 | null |
2023-05-30 | DiffSketching: Sketch Control Image Synthesis with Diffusion Models | Qiang Wang et.al. | 2305.18812v1 | link |
2023-05-30 | Generalization Bounds for Magnitude-Based Pruning via Sparse Matrix Sketching | Etash Kumar Guha et.al. | 2305.18789v1 | null |
2023-05-29 | Controllable Text-to-Image Generation with GPT-4 | Tianjun Zhang et.al. | 2305.18583v1 | null |
2023-05-29 | ANPL: Compiling Natural Programs with Interactive Decomposition | Di Huang et.al. | 2305.18498v1 | link |
2023-05-30 | TaleCrafter: Interactive Story Visualization with Multiple Characters | Yuan Gong et.al. | 2305.18247v2 | link |
2023-05-27 | Pruning at Initialization -- A Sketching Perspective | Noga Bar et.al. | 2305.17559v1 | null |
2023-05-27 | On the Noise Sensitivity of the Randomized SVD | Elad Romanov et.al. | 2305.17435v1 | link |
2023-05-26 | BIG-C: a Multimodal Multi-Purpose Dataset for Bemba | Claytone Sikasote et.al. | 2305.17202v1 | link |
2023-05-26 | CARAMEL: A Succinct Read-Only Lookup Table via Compressed Static Functions | Benjamin Coleman et.al. | 2305.16545v1 | null |
2023-05-25 | SketchOGD: Memory-Efficient Continual Learning | Benjamin Wright et.al. | 2305.16424v1 | link |
2023-05-24 | DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models | Sungnyun Kim et.al. | 2305.15194v1 | link |
2023-05-23 | Distributed CONGEST Algorithms against Mobile Adversaries | Orr Fischer et.al. | 2305.14300v1 | null |
2023-05-19 | MaGIC: Multi-modality Guided Image Completion | Yongsheng Yu et.al. | 2305.11818v1 | null |
2023-05-19 | MIDI-Draw: Sketching to Control Melody Generation | Tashi Namgyal et.al. | 2305.11605v1 | null |
2023-05-17 | Data Extraction via Semantic Regular Expression Synthesis | Qiaochu Chen et.al. | 2305.10401v1 | null |
2023-05-15 | Scalable and Robust Tensor Ring Decomposition for Large-scale Data | Yicong He et.al. | 2305.09044v1 | null |
2023-05-15 | Validity Constraints for Data Analysis Workflows | Florian Schintke et.al. | 2305.08409v1 | null |
2023-05-15 | Fast and Efficient Matching Algorithm with Deadline Instances | Zhao Song et.al. | 2305.08353v1 | null |
2023-05-15 | Approximation and Progressive Display of Multiverse Analyses | Yang Liu et.al. | 2305.08323v1 | null |
2023-05-11 | Enabling Programming Thinking in Large Language Models Toward Code Generation | Jia Li et.al. | 2305.06599v1 | null |
2023-05-12 | Searching Mobile App Screens via Text + Doodle | Soumik Mohian et.al. | 2305.06165v2 | link |
2023-05-10 | Sketching the Future (STF): Applying Conditional Control Techniques to Text-to-Video Models | Rohan Dhesikan et.al. | 2305.05845v1 | link |
2023-05-09 | Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval | Shiyin Dong et.al. | 2305.05144v1 | null |
2023-05-08 | Behavioural Types for Local-First Software | Roland Kuhn et.al. | 2305.04848v1 | null |
2023-05-09 | Locally Attentional SDF Diffusion for Controllable 3D Shape Generation | Xin-Yang Zheng et.al. | 2305.04461v2 | null |
2023-05-08 | Oblivious algorithms for the Max- |
Noah G. Singer et.al. | 2305.04438v1 | null |
2023-05-05 | Towards Feminist Intersectional XAI: From Explainability to Response-Ability | Goda Klumbyte et.al. | 2305.03375v1 | null |
2023-05-04 | Program Synthesis for Robot Learning from Demonstrations | Noah Patton et.al. | 2305.03129v1 | null |
2023-05-04 | HAISTA-NET: Human Assisted Instance Segmentation Through Attention | Muhammed Korkmaz et.al. | 2305.03105v1 | null |
2023-05-04 | Controllable Visual-Tactile Synthesis | Ruihan Gao et.al. | 2305.03051v1 | link |
2023-05-02 | A Survey of Methods for Converting Unstructured Data to CSG Models | Pierre-Alain Fayolle et.al. | 2305.01220v1 | null |
2023-05-01 | IndoorSim-to-OutdoorReal: Learning to Navigate Outdoors without any Outdoor Experience | Joanne Truong et.al. | 2305.01098v1 | null |
2023-05-01 | Design and Evaluation of a Bioinspired Tendon-Driven 3D-Printed Robotic Eye with Active Vision Capabilities | Hamid Osooli et.al. | 2305.01076v1 | link |
2023-05-01 | semantic neural model approach for face recognition from sketch | Chandana Navuluri et.al. | 2305.01058v1 | null |
2023-04-25 | Bridging graph data models: RDF, RDF-star, and property graphs as directed acyclic graphs | Ewout Gelling et.al. | 2304.13097v1 | link |
2023-04-25 | DualSlide: Global-to-Local Sketching Interface for Slide Content and Layout Design | Jiahao Weng et.al. | 2304.12506v1 | null |
2023-04-23 | SketchXAI: A First Look at Explainability for Human Sketches | Zhiyu Qu et.al. | 2304.11744v1 | null |
2023-04-22 | (Vector) Space is Not the Final Frontier: Product Search as Program Synthesis | Jacopo Tagliabue et.al. | 2304.11473v1 | null |
2023-04-21 | The centaur programmer -- How Kasparov's Advanced Chess spans over to the software development of the future | Pedro Alves et.al. | 2304.11172v1 | null |
2023-04-19 | StyleDEM: a Versatile Model for Authoring Terrains | Simon Perche et.al. | 2304.09626v1 | null |
2023-04-19 | Sensitivity estimation for differentially private query processing | Meifan Zhang et.al. | 2304.09546v1 | null |
2023-04-19 | A Protocol for Cast-as-Intended Verifiability with a Second Device | Johannes Müller et.al. | 2304.09456v1 | null |
2023-04-18 | Optimal Eigenvalue Approximation via Sketching | William Swartworth et.al. | 2304.09281v1 | null |
2023-04-18 | GUILGET: GUI Layout GEneration with Transformer | Andrey Sobolevsky et.al. | 2304.09012v1 | link |
2023-04-18 | Coefficient Synthesis for Threshold Automata | A. R. Balasubramanian et.al. | 2304.08917v1 | null |
2023-04-18 | Online fair division with arbitrary entitlements | Kushagra Chatterjee et.al. | 2304.08864v1 | null |
2023-04-17 | Learning Geometry-aware Representations by Sketching | Hyundo Lee et.al. | 2304.08204v1 | null |
2023-04-15 | Learned Interpolation for Better Streaming Quantile Approximation with Worst-Case Guarantees | Nicholas Schiefer et.al. | 2304.07652v1 | null |
2023-04-15 | Remembering Ludwig Dmitrievich Faddeev, our lifelong partner in mathematical physics | Daniel Sternheimer et.al. | 2304.07577v1 | null |
2023-04-14 | Pool Inference Attacks on Local Differential Privacy: Quantifying the Privacy Guarantees of Apple's Count Mean Sketch in Practice | Andrea Gadotti et.al. | 2304.07134v1 | null |
2023-04-14 | On deterministic, constant memory triangular searches on the integer lattice | J. Alfredo Cruz-Carlon et.al. | 2304.07033v1 | null |
2023-04-13 | Learning Controllable 3D Diffusion Models from Single-view Images | Jiatao Gu et.al. | 2304.06700v1 | null |
2023-04-13 | On streaming approximation algorithms for constraint satisfaction problems | Noah G. Singer et.al. | 2304.06664v1 | null |
2023-04-13 | Solving Tensor Low Cycle Rank Approximation | Yichuan Deng et.al. | 2304.06594v1 | null |
2023-04-12 | TextANIMAR: Text-based 3D Animal Fine-Grained Retrieval | Trung-Nghia Le et.al. | 2304.06053v1 | null |
2023-04-12 | SketchANIMAR: Sketch-based 3D Animal Fine-Grained Retrieval | Trung-Nghia Le et.al. | 2304.05731v1 | null |
2023-04-10 | Identity-Guided Collaborative Learning for Cloth-Changing Person Reidentification | Zan Gao et.al. | 2304.04400v1 | null |
2023-04-09 | On Extend-Only Directed Posets and Derived Byzantine-Tolerant Replicated Data Types (Extended Version) | Florian Jacob et.al. | 2304.04318v1 | null |
2023-04-07 | ChiroDiff: Modelling chirographic data with Diffusion Models | Ayan Das et.al. | 2304.03785v1 | null |
2023-04-06 | SketchFFusion: Sketch-guided image editing with diffusion model | Weihang Mao et.al. | 2304.03174v1 | null |
2023-04-06 | LSketch: A Label-Enabled Graph Stream Sketch Toward Time-Sensitive Queries | Yiling Zeng et.al. | 2304.02897v1 | null |
2023-04-05 | Tracing and Visualizing Human-ML/AI Collaborative Processes through Artifacts of Data Work | Jennifer Rogers and et.al. | 2304.02699v1 | null |
2023-04-05 | Beyond Summarization: Designing AI Support for Real-World Expository Writing Tasks | Zejiang Shen et.al. | 2304.02623v1 | null |
2023-04-05 | Optimal Sketching Bounds for Sparse Linear Regression | Tung Mai et.al. | 2304.02261v1 | null |
2023-04-05 | LogoNet: a fine-grained network for instance-level logo sketch retrieval | Binbin Feng et.al. | 2304.02214v1 | link |
2023-04-04 | Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing | Alberto Baldrati et.al. | 2304.02051v1 | link |
2023-04-02 | Sketch-based Video Object Localization | Sangmin Woo et.al. | 2304.00450v1 | link |
2023-03-31 | Almost Linear Constant-Factor Sketching for |
Alexander Munteanu et.al. | 2304.00051v1 | link |
2023-03-30 | If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval | Finlay G. C. Hudson et.al. | 2303.17703v1 | null |
2023-03-30 | Methods and advancement of content-based fashion image retrieval: A Review | Amin Muhammad Shoib et.al. | 2303.17371v1 | null |
2023-03-29 | Sketch-an-Anchor: Sub-epoch Fast Model Adaptation for Zero-shot Sketch-based Image Retrieval | Leo Sampaio Ferraz Ribeiro et.al. | 2303.16769v1 | null |
2023-03-28 | Visual Chain-of-Thought Diffusion Models | William Harvey et.al. | 2303.16187v1 | link |
2023-03-27 | What Can Human Sketches Do for Object Detection? | Pinaki Nath Chowdhury et.al. | 2303.15149v1 | null |
2023-03-25 | Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style | Fengyin Lin et.al. | 2303.14348v1 | link |
2023-03-24 | Feature Space Sketching for Logistic Regression | Gregory Dexter et.al. | 2303.14284v1 | null |
2023-03-24 | Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR | Aneeshan Sain et.al. | 2303.13779v1 | null |
2023-03-24 | The First Computer Program | Raúl Rojas et.al. | 2303.13740v1 | null |
2023-03-28 | CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not | Aneeshan Sain et.al. | 2303.13440v3 | null |
2023-03-23 | Defining Quality Requirements for a Trustworthy AI Wildflower Monitoring Platform | Petra Heck et.al. | 2303.13151v1 | null |
2023-03-22 | Evaluation of Sketch-Based and Semantic-Based Modalities for Mockup Generation | Tommaso Calò et.al. | 2303.12709v1 | null |
2023-03-22 | An Extended Study of Human-like Behavior under Adversarial Training | Paul Gavrikov et.al. | 2303.12669v1 | link |
2023-03-24 | RaBit: Parametric Modeling of 3D Biped Cartoon Characters with a Topological-consistent Dataset | Zhongjin Luo et.al. | 2303.12564v2 | null |
2023-03-21 | Roots and Requirements for Collaborative AI | Mark Stefik et.al. | 2303.12040v1 | null |
2023-03-23 | Sketch2Saliency: Learning to Detect Salient Objects from Human Drawings | Ayan Kumar Bhunia et.al. | 2303.11502v2 | null |
2023-03-20 | Automatic Measures for Evaluating Generative Design Methods for Architects | Eric Yeh et.al. | 2303.11483v1 | null |
2023-03-20 | Picture that Sketch: Photorealistic Image Generation from Abstract Sketches | Subhadeep Koley et.al. | 2303.11162v1 | null |
2023-03-20 | On the Maximal Independent Sets of |
Leran Ma et.al. | 2303.10926v1 | link |
2023-03-19 | SKED: Sketch-guided Text-based 3D Editing | Aryan Mikaeili et.al. | 2303.10735v1 | null |
2023-03-19 | Trainable Projected Gradient Method for Robust Fine-tuning | Junjiao Tian et.al. | 2303.10720v1 | link |
2023-03-19 | EduVis: Workshop on Visualization Education, Literacy, and Activities | Mandy Keck et.al. | 2303.10708v1 | null |
2023-03-19 | SECAD-Net: Self-Supervised CAD Reconstruction by Learning Sketch-Extrude Operations | Pu Li et.al. | 2303.10613v1 | link |
2023-03-17 | PersonalTailor: Personalizing 2D Pattern Design from 3D Garment Point Clouds | Anran Qi et.al. | 2303.09695v1 | null |
2023-03-15 | Query-guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch | Aditay Tripathi et.al. | 2303.08784v1 | null |
2023-03-15 | RIS-Enabled Smart Wireless Environments: Deployment Scenarios, Network Architecture, Bandwidth and Area of Influence | George C. Alexandropoulos et.al. | 2303.08505v1 | null |
2023-03-14 | Data-Free Sketch-Based Image Retrieval | Abhra Chaudhuri et.al. | 2303.07775v1 | link |
2023-03-13 | Can Workers Meaningfully Consent to Workplace Wellbeing Technologies? | Shreya Chowdhary et.al. | 2303.07242v1 | null |
2023-03-13 | An Improved Sample Complexity for Rank-1 Matrix Sensing | Yichuan Deng et.al. | 2303.06895v1 | null |
2023-03-10 | StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces | Shuai Yang et.al. | 2303.06146v1 | link |
2023-03-08 | Sketching with Spherical Designs for Noisy Data Fitting on Spheres | Shao-Bo Lin et.al. | 2303.04550v1 | null |
2023-03-08 | Models of symbol emergence in communication: a conceptual review and a guide for avoiding local minima | Julian Zubek et.al. | 2303.04544v1 | null |
2023-03-07 | Introspective Cross-Attention Probing for Lightweight Transfer of Pre-trained Models | Yonatan Dukler et.al. | 2303.04105v1 | null |
2023-03-06 | Data Portraits: Recording Foundation Model Training Data | Marc Marone et.al. | 2303.03919v1 | null |
2023-03-07 | Sketch-based Medical Image Retrieval | Kazuma Kobayashi et.al. | 2303.03633v1 | null |
2023-03-06 | Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design | Michelle S. Lam et.al. | 2303.02884v1 | link |
2023-03-05 | Text2Face: A Multi-Modal 3D Face Model | Will Rowan et.al. | 2303.02688v1 | null |
2023-03-03 | Graph-based Extreme Feature Selection for Multi-class Classification Tasks | Shir Friedman et.al. | 2303.01792v1 | null |
2023-03-02 | Coresets for Clustering in Geometric Intersection Graphs | Sayan Bandyapadhyay et.al. | 2303.01400v1 | null |
2023-03-01 | Sketch2Cloth: Sketch-based 3D Garment Generation with Unsigned Distance Fields | Yi He et.al. | 2303.00167v1 | null |
2023-02-26 | Towards Human-Bot Collaborative Software Architecting with ChatGPT | Aakash Ahmad et.al. | 2302.14600v1 | link |
2023-02-28 | On-the-Fly Communication-and-Computing for Distributed Tensor Decomposition Over MIMO Channels | Xu Chen et.al. | 2302.14297v1 | null |
2023-02-27 | Capstone: A Capability-based Foundation for Trustless Secure Memory Access (Extended Version) | Jason Zhijingcheng Yu et.al. | 2302.13863v1 | null |
2023-02-27 | Evaluation of Automatically Constructed Word Meaning Explanations | Marie Stará et.al. | 2302.13625v1 | null |
2023-02-26 | Scalable Weight Reparametrization for Efficient Transfer Learning | Byeonggeun Kim et.al. | 2302.13435v1 | null |
2023-02-24 | Modulating Pretrained Diffusion Models for Multimodal Image Synthesis | Cusuh Ham et.al. | 2302.12764v1 | null |
2023-02-23 | Using Colors and Sketches to Count Subgraphs in a Streaming Graph | Shirin Handjani et.al. | 2302.12210v1 | null |
2023-02-24 | A Scalable Space-efficient In-database Interpretability Framework for Embedding-based Semantic SQL Queries | Prabhakar Kudva et.al. | 2302.12178v2 | null |
2023-02-22 | A Reference Architecture for Observability and Compliance of Cloud Native Applications | William Pourmajidi et.al. | 2302.11617v1 | null |
2023-02-20 | Ontology-aware Network for Zero-shot Sketch-based Image Retrieval | Haoxiang Zhang et.al. | 2302.10040v1 | null |
2023-02-22 | Composer: Creative and Controllable Image Synthesis with Composable Conditions | Lianghua Huang et.al. | 2302.09778v2 | link |
2023-02-16 | Rejecting Cognitivism: Computational Phenomenology for Deep Learning | Pierre Beckmann et.al. | 2302.09071v1 | null |
2023-02-14 | DiffFaceSketch: High-Fidelity Face Image Synthesis with Sketch-Guided Latent Diffusion Model | Yichen Peng et.al. | 2302.06908v1 | link |
2023-02-14 | Text-Guided Scene Sketch-to-Photo Synthesis | AprilPyone MaungMaung et.al. | 2302.06883v1 | null |
2023-02-14 | Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation | Yasheng Sun et.al. | 2302.06857v1 | null |
2023-02-13 | SkCoder: A Sketch-based Approach for Automatic Code Generation | Jia Li et.al. | 2302.06144v1 | link |
2023-02-13 | Learning to Scale Temperature in Masked Self-Attention for Image Inpainting | Xiang Zhou et.al. | 2302.06130v1 | null |
2023-02-11 | An Evaluation Algorithm for Datalog with Equality | Martin E. Bidlingmaier et.al. | 2302.05792v1 | link |
2023-02-11 | Sketch Less Face Image Retrieval: A New Challenge | Dawei Dai et.al. | 2302.05576v1 | link |
2023-02-10 | MaskSketch: Unpaired Structure-guided Masked Image Generation | Dina Bashkirova et.al. | 2302.05496v1 | link |
2023-02-10 | Count-min sketch with variable number of hash functions: an experimental study | Éric Fusy et.al. | 2302.05245v1 | null |
2023-02-10 | Fast Gumbel-Max Sketch and its Applications | Yuanming Zhang et.al. | 2302.05176v1 | null |
2023-02-09 | Projection-free Online Exp-concave Optimization | Dan Garber et.al. | 2302.04859v1 | null |
2023-02-09 | Locally consistent decomposition of strings with applications to edit distance sketching | Sudatta Bhattacharya et.al. | 2302.04475v1 | null |
2023-02-06 | Sketching Robot Programs On the Fly | David Porfirio et.al. | 2302.03088v1 | null |
2023-02-05 | Leaving Reality to Imagination: Robust Classification via Generated Datasets | Hritik Bansal et.al. | 2302.02503v1 | link |
2023-02-04 | An Effective and Differentially Private Protocol for Secure Distributed Cardinality Estimation | Pinghui Wang et.al. | 2302.02158v1 | null |
2023-02-04 | Sketch-Flip-Merge: Mergeable Sketches for Private Distinct Counting | Jonathan Hehir et.al. | 2302.02056v1 | null |
2023-02-01 | A Nearly-Optimal Bound for Fast Regression with |
Zhao Song et.al. | 2302.00248v1 | null |
2023-01-31 | FLAME: A small language model for spreadsheet formulas | Harshit Joshi et.al. | 2301.13779v1 | null |
2023-01-30 | Streaming Anomaly Detection | Siddharth Bhatia et.al. | 2301.13199v1 | link |
2023-01-29 | BERT-based Authorship Attribution on the Romanian Dataset called ROST | Sanda-Maria Avram et.al. | 2301.12500v1 | null |
2023-01-26 | Synesthetic Dice: Sensors, Actuators, And Mappings | Albrecht Kurze et.al. | 2301.11436v1 | null |
2023-01-26 | Cut and Learn for Unsupervised Object Detection and Instance Segmentation | Xudong Wang et.al. | 2301.11320v1 | link |
2023-01-25 | Reflective Artificial Intelligence | Peter R. Lewis et.al. | 2301.10823v1 | null |
2023-01-25 | Distilling Text into Circuits | Vincent Wang-Mascianica et.al. | 2301.10595v1 | null |
2023-01-24 | Capacity Analysis of Vector Symbolic Architectures | Kenneth L. Clarkson et.al. | 2301.10352v1 | null |
2023-01-20 | Improving Sketch Colorization using Adversarial Segmentation Consistency | Samet Hicsonmez et.al. | 2301.08590v1 | link |
2023-01-19 | On Finite Blocklength Lossy Source Coding | Lin Zhou et.al. | 2301.07871v1 | null |
2023-01-17 | Vision Based Machine Learning Algorithms for Out-of-Distribution Generalisation | Hamza Riaz et.al. | 2301.06975v1 | null |
2023-01-17 | Distribution Aligned Feature Clustering for Zero-Shot Sketch-Based Image Retrieval | Yuchen Wu et.al. | 2301.06685v1 | null |
2023-01-16 | A Distributed Palette Sparsification Theorem | Maxime Flin et.al. | 2301.06457v1 | null |
2023-01-14 | Weighted Minwise Hashing Beats Linear Sketching for Inner Product Estimation | Aline Bessa et.al. | 2301.05811v1 | null |
2023-01-06 | Better Differentially Private Approximate Histograms and Heavy Hitters using the Misra-Gries Sketch | Christian Janos Lebeda et.al. | 2301.02457v1 | null |
2023-01-03 | EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions [Technical Report] | Enhao Zhang et.al. | 2301.00929v1 | link |
2023-01-17 | Algorithms for Massive Data -- Lecture Notes | Nicola Prezza et.al. | 2301.00754v2 | null |
2022-12-28 | Modular termination verification with a higher-order concurrent separation logic (Intermediate report) | Justus Fasse et.al. | 2212.14126v1 | null |
2022-12-22 | A Domain-Extensible Compiler with Controllable Automation of Optimisations | Thomas Koehler et.al. | 2212.12035v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-04-03 | Neural Radiance Fields with Torch Units | Bingnan Ni et.al. | 2404.02617v1 | null |
2024-04-03 | TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes | Cheng Zhao et.al. | 2404.02410v1 | null |
2024-04-03 | APC2Mesh: Bridging the gap from occluded building façades to full 3D models | Perpetual Hope Akwensi et.al. | 2404.02391v1 | null |
2024-04-01 | Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects | Yijia Weng et.al. | 2404.01440v1 | link |
2024-04-01 | NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification | Juyeop Han et.al. | 2404.01400v1 | null |
2024-04-01 | FPGA-Accelerated Correspondence-free Point Cloud Registration with PointNet Features | Keisuke Sugiura et.al. | 2404.01237v1 | null |
2024-04-02 | Few-shot point cloud reconstruction and denoising via learned Guassian splats renderings and fine-tuned diffusion features | Pietro Bonazzi et.al. | 2404.01112v2 | null |
2024-03-30 | DiffHuman: Probabilistic Photorealistic 3D Reconstruction of Humans | Akash Sengupta et.al. | 2404.00485v1 | null |
2024-03-30 | 3DGSR: Implicit Surface Reconstruction with 3D Gaussian Splatting | Xiaoyang Lyu et.al. | 2404.00409v1 | null |
2024-03-29 | Sparse Views, Near Light: A Practical Paradigm for Uncalibrated Point-light Photometric Stereo | Mohammed Brahimi et.al. | 2404.00098v1 | null |
2024-03-29 | NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising | Tianchen Deng et.al. | 2403.20034v1 | link |
2024-03-28 | CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians | Avinash Paliwal et.al. | 2403.19495v1 | null |
2024-03-30 | Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction | Xiaoyang Lyu et.al. | 2403.19314v2 | link |
2024-03-28 | Neural Fields for 3D Tracking of Anatomy and Surgical Instruments in Monocular Laparoscopic Video Clips | Beerend G. A. Gerats et.al. | 2403.19265v1 | null |
2024-04-01 | WALT3D: Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects under Occlusion | Khiem Vuong et.al. | 2403.19022v2 | null |
2024-03-29 | Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction | Qiuhong Shen et.al. | 2403.18795v2 | null |
2024-03-29 | SplatFace: Gaussian Splat Face Reconstruction Leveraging an Optimizable Surface | Jiahao Luo et.al. | 2403.18784v2 | null |
2024-03-27 | Breaking the Limitations with Sparse Inputs by Variational Frameworks (BLIss) in Terahertz Super-Resolution 3D Reconstruction | Yiyao Zhang et.al. | 2403.18776v1 | link |
2024-03-27 | SAT-NGP : Unleashing Neural Graphics Primitives for Fast Relightable Transient-Free 3D reconstruction from Satellite Imagery | Camille Billouard et.al. | 2403.18711v1 | null |
2024-03-26 | EgoLifter: Open-world 3D Segmentation for Egocentric Perception | Qiao Gu et.al. | 2403.18118v1 | null |
2024-03-25 | Creating a Digital Twin of Spinal Surgery: A Proof of Concept | Jonas Hein et.al. | 2403.16736v1 | null |
2024-03-25 | Spike-NeRF: Neural Radiance Field Based On Spike Camera | Yijia Guo et.al. | 2403.16410v1 | null |
2024-03-25 | Elite360D: Towards Efficient 360 Depth Estimation via Semantic- and Distance-Aware Bi-Projection Fusion | Hao Ai et.al. | 2403.16376v1 | null |
2024-03-24 | latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction | Christopher Wewer et.al. | 2403.16292v1 | null |
2024-03-23 | UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation | Yuliang Guo et.al. | 2403.15705v1 | null |
2024-03-22 | FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos | Florian Langer et.al. | 2403.15161v1 | null |
2024-03-22 | Recent Trends in 3D Reconstruction of General Non-Rigid Scenes | Raza Yunus et.al. | 2403.15064v1 | null |
2024-03-21 | Hyperspectral Neural Radiance Fields | Gerry Chen et.al. | 2403.14839v1 | null |
2024-03-21 | GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation | Yinghao Xu et.al. | 2403.14621v1 | link |
2024-03-21 | Isotropic Gaussian Splatting for Real-Time Radiance Field Rendering | Yuanhao Gong et.al. | 2403.14244v1 | null |
2024-03-21 | Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions | Jiacong Xu et.al. | 2403.14053v1 | null |
2024-03-20 | T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image | Shijie Zhang et.al. | 2403.13663v1 | null |
2024-03-20 | MULAN-WC: Multi-Robot Localization Uncertainty-aware Active NeRF with Wireless Coordination | Weiying Wang et.al. | 2403.13348v1 | null |
2024-03-19 | GVGEN: Text-to-3D Generation with Volumetric Representation | Xianglong He et.al. | 2403.12957v1 | null |
2024-03-19 | PostoMETRO: Pose Token Enhanced Mesh Transformer for Robust 3D Human Mesh Recovery | Wendi Yang et.al. | 2403.12473v1 | null |
2024-03-18 | LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation | Yushi Lan et.al. | 2403.12019v1 | null |
2024-03-18 | GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image | Xiao Fu et.al. | 2403.12013v1 | null |
2024-03-18 | SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion | Vikram Voleti et.al. | 2403.12008v1 | null |
2024-03-18 | GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors | LI Yang et.al. | 2403.11899v1 | null |
2024-03-18 | OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation | Haochen Jiang et.al. | 2403.11796v1 | null |
2024-03-18 | Fed3DGS: Scalable 3D Gaussian Splatting with Federated Learning | Teppei Suzuki et.al. | 2403.11460v1 | link |
2024-03-18 | BAGS: Building Animatable Gaussian Splatting from a Monocular Video with Diffusion Priors | Tingyang Zhang et.al. | 2403.11427v1 | null |
2024-03-17 | Creating Seamless 3D Maps Using Radiance Fields | Sai Tarun Sathyan et.al. | 2403.11364v1 | null |
2024-03-17 | Recent Advances in 3D Gaussian Splatting | Tong Wu et.al. | 2403.11134v1 | null |
2024-03-17 | Omni-Recon: Towards General-Purpose Neural Radiance Fields for Versatile 3D Applications | Yonggan Fu et.al. | 2403.11131v1 | null |
2024-03-16 | Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription | Hongxiang Zhao et.al. | 2403.10953v1 | null |
2024-03-15 | SCILLA: SurfaCe Implicit Learning for Large Urban Area, a volumetric hybrid solution | Hala Djeghim et.al. | 2403.10344v1 | null |
2024-03-15 | FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model | Qijun Feng et.al. | 2403.10242v1 | null |
2024-03-15 | Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience | Xiaohang Yu et.al. | 2403.09973v1 | null |
2024-03-14 | MARVIS: Motion & Geometry Aware Real and Virtual Image Segmentation | Jiayi Wu et.al. | 2403.09850v1 | link |
2024-03-14 | Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting | Jaewoo Jung et.al. | 2403.09413v1 | link |
2024-03-13 | 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surface | Linyi Jin et.al. | 2403.08768v1 | null |
2024-03-13 | Refractive COLMAP: Refractive Structure-from-Motion Revisited | Mengkun She et.al. | 2403.08640v1 | null |
2024-03-12 | Q-SLAM: Quadric Representations for Monocular SLAM | Chensheng Peng et.al. | 2403.08125v1 | null |
2024-03-11 | Bayesian Diffusion Models for 3D Shape Reconstruction | Haiyang Xu et.al. | 2403.06973v1 | null |
2024-03-08 | DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction | Jaehyeok Shim et.al. | 2403.05005v1 | null |
2024-03-11 | Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed | Yifan Wang et.al. | 2403.04765v2 | null |
2024-03-08 | Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces | Evangelos Skartados et.al. | 2403.04508v2 | null |
2024-03-07 | CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images | Guanlin Shen et.al. | 2403.04198v1 | link |
2024-03-05 | Pooling Image Datasets With Multiple Covariate Shift and Imbalance | Sotirios Panagiotis Chytas et.al. | 2403.02598v1 | null |
2024-03-04 | TripoSR: Fast 3D Object Reconstruction from a Single Image | Dmitry Tochilkin et.al. | 2403.02151v1 | link |
2024-03-03 | A Novel Dynamic Light-Section 3D Reconstruction Method for Wide-Range Sensing | Mengjuan Chen et.al. | 2403.01374v1 | null |
2024-03-08 | G3DR: Generative 3D Reconstruction in ImageNet | Pradyumna Reddy et.al. | 2403.00939v2 | link |
2024-03-01 | DISORF: A Distributed Online NeRF Training and Rendering Framework for Mobile Robots | Chunlin Li et.al. | 2403.00228v1 | null |
2024-03-05 | VEnvision3D: A Synthetic Perception Dataset for 3D Multi-Task Model Research | Jiahao Zhou et.al. | 2402.19059v2 | null |
2024-02-27 | Sora Generates Videos with Stunning Geometrical Consistency | Xuanyi Li et.al. | 2402.17403v1 | null |
2024-02-27 | CharNeRF: 3D Character Generation from Concept Art | Eddy Chu et.al. | 2402.17115v1 | null |
2024-02-26 | DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer | Yizhe Wu et.al. | 2402.16308v1 | null |
2024-02-25 | GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction | Xiao Chen et.al. | 2402.16174v1 | null |
2024-02-24 | A Generative Machine Learning Model for Material Microstructure 3D Reconstruction and Performance Evaluation | Yilin Zheng et.al. | 2402.15815v1 | null |
2024-02-22 | Cameras as Rays: Pose Estimation via Ray Diffusion | Jason Y. Zhang et.al. | 2402.14817v1 | null |
2024-02-22 | Workspace Analysis for Laparoscopic Rectal Surgery : A Preliminary Study | Alexandra Thomieres et.al. | 2402.14386v1 | null |
2024-02-22 | MVD |
Xin-Yang Zheng et.al. | 2402.14253v1 | null |
2024-02-20 | MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction | Shitao Tang et.al. | 2402.12712v1 | null |
2024-02-25 | A Robust Error-Resistant View Selection Method for 3D Reconstruction | Shaojie Zhang et.al. | 2402.11431v2 | null |
2024-02-17 | Dense Matchers for Dense Tracking | Tomáš Jelínek et.al. | 2402.11287v1 | null |
2024-02-17 | DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model | Yu Feng et.al. | 2402.11241v1 | null |
2024-02-15 | Evaluating NeRFs for 3D Plant Geometry Reconstruction in Field Conditions | Muhammad Arbab Arshad et.al. | 2402.10344v1 | null |
2024-02-15 | GES: Generalized Exponential Splatting for Efficient Radiance Field Rendering | Abdullah Hamdi et.al. | 2402.10128v1 | link |
2024-02-14 | PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments | Xiuzhong Hu et.al. | 2402.09325v1 | link |
2024-02-14 | DUDF: Differentiable Unsigned Distance Fields with Hyperbolic Scaling | Miguel Fainstein et.al. | 2402.08876v1 | null |
2024-02-13 | IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation | Luke Melas-Kyriazi et.al. | 2402.08682v1 | null |
2024-02-20 | Camera Calibration through Geometric Constraints from Rotation and Projection Matrices | Muhammad Waleed et.al. | 2402.08437v2 | link |
2024-02-09 | Neural Rendering based Urban Scene Reconstruction for Autonomous Driving | Shihao Shen et.al. | 2402.06826v1 | null |
2024-02-07 | Carousel phase retrieval algorithm for 3D coherent X-ray diffraction imaging | Fangzhou Ai et.al. | 2402.05283v1 | link |
2024-02-06 | EscherNet: A Generative Model for Scalable View Synthesis | Xin Kong et.al. | 2402.03908v1 | link |
2024-02-09 | MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction | Heng Zhou et.al. | 2402.03762v2 | null |
2024-02-05 | Denoising Diffusion via Image-Based Rendering | Titas Anciukevicius et.al. | 2402.03445v1 | null |
2024-02-02 | Di-NeRF: Distributed NeRF for Collaborative Learning with Unknown Relative Poses | Mahboubeh Asadi et.al. | 2402.01485v1 | null |
2024-02-02 | DeepAAT: Deep Automated Aerial Triangulation for Fast UAV-based Mapping | Zequan Chen et.al. | 2402.01134v1 | link |
2024-02-01 | Enhanced fringe-to-phase framework using deep learning | Won-Hoe Kim et.al. | 2402.00977v1 | null |
2024-02-01 | Diffusion-based Light Field Synthesis | Ruisheng Gao et.al. | 2402.00575v1 | null |
2024-01-31 | Local Feature Matching Using Deep Learning: A Survey | Shibiao Xu et.al. | 2401.17592v1 | link |
2024-01-30 | Self-Supervised Representation Learning for Nerve Fiber Distribution Patterns in 3D-PLI | Alexander Oberstrass et.al. | 2401.17207v1 | null |
2024-01-30 | Physical Priors Augmented Event-Based 3D Reconstruction | Jiaxu Wang et.al. | 2401.17121v1 | link |
2024-01-30 | OmniSCV: An Omnidirectional Synthetic Image Generator for Computer Vision | Bruno Berenguel-Baeta et.al. | 2401.17061v1 | link |
2024-01-29 | Domain adaptation strategies for 3D reconstruction of the lumbar spine using real fluoroscopy data | Sascha Jecklin et.al. | 2401.16027v1 | null |
2024-01-29 | 2L3: Lifting Imperfect Generated 2D Images into Accurate 3D | Yizheng Chen et.al. | 2401.15841v1 | null |
2024-01-28 | Multi-Person 3D Pose Estimation from Multi-View Uncalibrated Depth Cameras | Yu-Jhe Li et.al. | 2401.15616v1 | null |
2024-01-26 | 3D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field | Zhenyu Bao et.al. | 2401.14726v1 | link |
2024-01-25 | TIFu: Tri-directional Implicit Function for High-Fidelity 3D Character Reconstruction | Byoungsung Lim et.al. | 2401.14565v1 | null |
2024-01-25 | Range-Agnostic Multi-View Depth Estimation With Keyframe Selection | Andrea Conti et.al. | 2401.14401v1 | link |
2024-01-25 | pix2gestalt: Amodal Segmentation by Synthesizing Wholes | Ege Ozguroglu et.al. | 2401.14398v1 | link |
2024-01-25 | GauU-Scene: A Scene Reconstruction Benchmark on Large Scale 3D Reconstruction Dataset Using Gaussian Splatting | Butian Xiong et.al. | 2401.14032v1 | null |
2024-01-24 | EndoGaussians: Single View Dynamic Gaussian Splatting for Deformable Endoscopic Tissues Reconstruction | Yangsen Chen et.al. | 2401.13352v1 | null |
2024-01-23 | IRIS: Inverse Rendering of Indoor Scenes from Low Dynamic Range Images | Zhi-Hao Lin et.al. | 2401.12977v1 | null |
2024-01-21 | A Survey on African Computer Vision Datasets, Topics and Researchers | Abdul-Hakeem Omotayo et.al. | 2401.11617v1 | null |
2024-01-21 | Multi-View Neural 3D Reconstruction of Micro-/Nanostructures with Atomic Force Microscopy | Shuo Chen et.al. | 2401.11541v1 | link |
2024-01-21 | Deformable Endoscopic Tissues Reconstruction with Gaussian Splatting | Lingting Zhu et.al. | 2401.11535v1 | link |
2024-01-17 | POE: Acoustic Soft Robotic Proprioception for Omnidirectional End-effectors | Uksang Yoo et.al. | 2401.09382v1 | null |
2024-01-16 | Learning Implicit Representation for Reconstructing Articulated Objects | Hao Zhang et.al. | 2401.08809v1 | null |
2024-01-20 | Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis | Zhenhui Ye et.al. | 2401.08503v2 | null |
2024-01-16 | S3M: Semantic Segmentation Sparse Mapping for UAVs with RGB-D Camera | Thanh Nguyen Canh et.al. | 2401.08134v1 | null |
2024-01-12 | 3D Reconstruction of Interacting Multi-Person in Clothing from a Single Image | Junuk Cha et.al. | 2401.06415v1 | null |
2024-01-12 | SD-MVS: Segmentation-Driven Deformation Multi-View Stereo with Spherical Refinement and EM optimization | Zhenlong Yuan et.al. | 2401.06385v1 | null |
2024-01-12 | Surgical-DINO: Adapter Learning of Foundation Models for Depth Estimation in Endoscopic Surgery | Beilei Cui et.al. | 2401.06013v2 | link |
2024-01-10 | Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects | Tianhang Cheng et.al. | 2401.05236v1 | link |
2024-01-07 | RHOBIN Challenge: Reconstruction of Human Object Interaction | Xianghui Xie et.al. | 2401.04143v1 | null |
2024-01-08 | AGG: Amortized Generative 3D Gaussians for Single Image to 3D | Dejia Xu et.al. | 2401.04099v1 | null |
2024-01-08 | A Survey on 3D Gaussian Splatting | Guikun Chen et.al. | 2401.03890v1 | null |
2024-01-03 | S3Net: Innovating Stereo Matching and Semantic Segmentation with a Single-Branch Semantic Stereo Network in Satellite Epipolar Imagery | Qingyuan Yang et.al. | 2401.01643v1 | link |
2023-12-29 | Informative Rays Selection for Few-Shot Neural Radiance Fields | Marco Orsingher et.al. | 2312.17561v1 | null |
2023-12-28 | Toward Semantic Scene Understanding for Fine-Grained 3D Modeling of Plants | Mohamad Qadri et.al. | 2312.17110v1 | null |
2023-12-28 | Learning Spatially Collaged Fourier Bases for Implicit Neural Representation | Jason Chun Lok Li et.al. | 2312.17018v1 | null |
2023-12-27 | In-Hand 3D Object Reconstruction from a Monocular RGB Video | Shijian Jiang et.al. | 2312.16425v1 | null |
2023-12-24 | SUNDIAL: 3D Satellite Understanding through Direct, Ambient, and Complex Lighting Decomposition | Nikhil Behari et.al. | 2312.16215v1 | null |
2023-12-24 | A theory of volumetric representations for opaque solids | Bailey Miller et.al. | 2312.15406v1 | null |
2023-12-22 | Pola4All: survey of polarimetric applications and an open-source toolkit to analyze polarization | Joaquin Rodriguez et.al. | 2312.14697v1 | link |
2023-12-22 | Scalable 3D Reconstruction From Single Particle X-Ray Diffraction Images Based on Online Machine Learning | Jay Shenoy et.al. | 2312.14432v1 | null |
2023-12-21 | PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar | Tzofi Klinghoffer et.al. | 2312.14239v1 | null |
2023-12-21 | 3D Pose Estimation of Two Interacting Hands from a Monocular Event Camera | Christen Millerdurai et.al. | 2312.14157v1 | null |
2023-12-21 | DUSt3R: Geometric 3D Vision Made Easy | Shuzhe Wang et.al. | 2312.14132v1 | link |
2023-12-21 | Anatomical basis of sex differences in human post-myocardial infarction ECG phenotypes identified by novel automated torso-cardiac 3D reconstruction | Hannah J. Smith et.al. | 2312.13976v1 | null |
2023-12-21 | SyncDreamer for 3D Reconstruction of Endangered Animal Species with NeRF and NeuS | Ahmet Haydar Ornek et.al. | 2312.13832v1 | null |
2023-12-21 | Visual Tomography: Physically Faithful Volumetric Models of Partially Translucent Objects | David Nakath et.al. | 2312.13494v1 | null |
2023-12-20 | UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections | Fangjinhua Wang et.al. | 2312.13285v1 | null |
2023-12-20 | Splatter Image: Ultra-Fast Single-View 3D Reconstruction | Stanislaw Szymanowicz et.al. | 2312.13150v1 | link |
2023-12-21 | pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction | David Charatan et.al. | 2312.12337v2 | link |
2023-12-19 | EVI-SAM: Robust, Real-time, Tightly-coupled Event-Visual-Inertial State Estimation and 3D Dense Mapping | Weipeng Guan et.al. | 2312.11911v1 | link |
2023-12-17 | Primitive-based 3D Human-Object Interaction Modelling and Programming | Siqi Liu et.al. | 2312.10714v1 | null |
2023-12-16 | Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers | Zi-Xin Zou et.al. | 2312.09147v2 | null |
2023-12-14 | Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments | Liyuan Zhu et.al. | 2312.09138v1 | null |
2023-12-14 | Scene 3-D Reconstruction System in Scattering Medium | Zhuoyifan Zhang et.al. | 2312.09005v1 | null |
2023-12-11 | Gaussian Splatting SLAM | Hidenobu Matsuki et.al. | 2312.06741v1 | null |
2023-12-10 | UNeR3D: Versatile and Scalable 3D RGB Point Cloud Generation from 2D Images in Unsupervised Reconstruction | Hongbin Lin et.al. | 2312.06706v1 | null |
2023-12-10 | SuperPrimitive: Scene Reconstruction at a Primitive Level | Kirill Mazur et.al. | 2312.05889v1 | null |
2023-12-11 | Nuvo: Neural UV Mapping for Unruly 3D Representations | Pratul P. Srinivasan et.al. | 2312.05283v1 | null |
2023-12-08 | Fine Dense Alignment of Image Bursts through Camera Pose and Depth Estimation | Bruno Lecouat et.al. | 2312.05190v1 | null |
2023-12-08 | SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration | Xu Cao et.al. | 2312.04803v1 | null |
2023-12-07 | FitDiff: Robust monocular 3D facial shape and reflectance estimation using Diffusion Models | Stathis Galanakis et.al. | 2312.04465v1 | null |
2023-12-06 | Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian Particle | Youtian Lin et.al. | 2312.03431v1 | null |
2023-12-06 | Evaluating the point cloud of individual trees generated from images based on Neural Radiance fields (NeRF) method | Hongyu Huang et.al. | 2312.03372v1 | null |
2023-12-06 | RING-NeRF: A Versatile Architecture based on Residual Implicit Neural Grids | Doriand Petit et.al. | 2312.03357v1 | null |
2023-12-05 | ReconFusion: 3D Reconstruction with Diffusion Priors | Rundi Wu et.al. | 2312.02981v1 | null |
2023-12-05 | R3D-SWIN:Use Shifted Window Attention for Single-View 3D Reconstruction | Chenhuan Li et.al. | 2312.02725v1 | null |
2023-12-05 | DreaMo: Articulated 3D Reconstruction From A Single Casual Video | Tao Tu et.al. | 2312.02617v1 | null |
2023-12-05 | Prompt2NeRF-PIL: Fast NeRF Generation via Pretrained Implicit Latent | Jianmeng Liu et.al. | 2312.02568v1 | null |
2023-12-03 | Slice3D: Multi-Slice, Occlusion-Revealing, Single View 3D Reconstruction | Yizhi Wang et.al. | 2312.02221v1 | null |
2023-12-04 | Steerers: A framework for rotation equivariant keypoint descriptors | Georg Bökman et.al. | 2312.02152v1 | link |
2023-12-04 | iMatching: Imperative Correspondence Learning | Zitong Zhan et.al. | 2312.02141v1 | null |
2023-12-04 | Light Field Imaging in the Restrictive Object Space based on Flexible Angular Plane | Ping Zhou et.al. | 2312.01761v1 | null |
2023-12-02 | RNb-NeuS: Reflectance and Normal-based Multi-View 3D Reconstruction | Baptiste Brument et.al. | 2312.01215v1 | link |
2023-12-05 | Self-Evolving Neural Radiance Fields | Jaewoo Jung et.al. | 2312.01003v2 | link |
2023-12-01 | NeuSG: Neural Implicit Surface Reconstruction with 3D Gaussian Splatting Guidance | Hanlin Chen et.al. | 2312.00846v1 | null |
2023-12-01 | UAVs and Birds: Enhancing Short-Range Navigation through Budgerigar Flight Studies | Md. Mahmudur Rahman et.al. | 2312.00597v1 | null |
2023-11-30 | Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data | Yu Deng et.al. | 2311.18729v1 | null |
2023-11-30 | Multi-task learning with cross-task consistency for improved depth estimation in colonoscopy | Pedro Esteban Chavarrias Solano et.al. | 2311.18664v1 | null |
2023-11-30 | HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Video | Zicong Fan et.al. | 2311.18448v1 | link |
2023-11-29 | Volumetric Cloud Field Reconstruction | Jacob Lin et.al. | 2311.17657v1 | null |
2023-11-30 | REF |
Wooseok Kim et.al. | 2311.17116v2 | link |
2023-11-28 | Multi-Scale 3D Gaussian Splatting for Anti-Aliased Rendering | Zhiwen Yan et.al. | 2311.17089v1 | null |
2023-11-28 | Surf-D: High-Quality Surface Generation for Arbitrary Topologies using Diffusion Models | Zhengming Yu et.al. | 2311.17050v1 | null |
2023-11-28 | Gradient-based Local Next-best-view Planning for Improved Perception of Targeted Plant Nodes | Akshay K. Burusa et.al. | 2311.16759v1 | null |
2023-11-28 | RGBGrasp: Image-based Object Grasping by Capturing Multiple Views during Robot Arm Movement with Neural Radiance Fields | Chang Liu et.al. | 2311.16592v1 | null |
2023-11-28 | Rethinking Directional Integration in Neural Radiance Fields | Congyue Deng et.al. | 2311.16504v1 | null |
2023-11-27 | Weakly-Supervised 3D Reconstruction of Clothed Humans via Normal Maps | Jane Wu et.al. | 2311.16042v1 | null |
2023-11-27 | SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion | Hsuan-I Ho et.al. | 2311.15855v1 | null |
2023-11-26 | Obj-NeRF: Extract Object NeRFs from Multi-view Images | Zhiyi Li et.al. | 2311.15291v1 | null |
2023-11-25 | Multi-task Planar Reconstruction with Feature Warping Guidance | Luan Wei et.al. | 2311.14981v1 | link |
2023-11-24 | RSB-Pose: Robust Short-Baseline Binocular 3D Human Pose Estimation with Occlusion Handling | Xiaoyue Wan et.al. | 2311.14242v1 | null |
2023-11-23 | GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence | Van Nguyen Nguyen et.al. | 2311.14155v1 | link |
2023-11-23 | MonoNav: MAV Navigation via Monocular Depth Estimation and Reconstruction | Nathaniel Simon et.al. | 2311.14100v1 | null |
2023-11-23 | DRIFu: Differentiable Rendering and Implicit Function-based Single-View 3D Reconstruction | Zijian Kuang et.al. | 2311.13199v2 | link |
2023-11-22 | Differentiable Radio Frequency Ray Tracing for Millimeter-Wave Sensing | Xingyu Chen et.al. | 2311.13182v1 | null |
2023-11-21 | Physics-guided Shape-from-Template: Monocular Video Perception through Neural Surrogate Models | David Stotko et.al. | 2311.12796v1 | link |
2023-11-20 | Mixing-Denoising Generalizable Occupancy Networks | Amine Ouasfi et.al. | 2311.12125v1 | null |
2023-11-23 | PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction | Peng Wang et.al. | 2311.12024v2 | null |
2023-11-19 | GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion Probabilistic Models with Structured Noise | Xinhai Li et.al. | 2311.11221v1 | null |
2023-11-18 | LOSTU: Fast, Scalable, and Uncertainty-Aware Triangulation | Sébastien Henry et.al. | 2311.11171v1 | null |
2023-11-18 | Invariant-based Mapping of Space During General Motion of an Observer | Juan D. Yepes et.al. | 2311.11130v1 | null |
2023-11-16 | DSR-Diff: Depth Map Super-Resolution with Diffusion Model | Yuan Shi et.al. | 2311.09919v1 | null |
2023-11-18 | EvaSurf: Efficient View-Aware Implicit Textured Surface Reconstruction on Mobile Devices | Jingnan Gao et.al. | 2311.09806v2 | null |
2023-11-14 | DynamicSurf: Dynamic Neural RGB-D Surface Reconstruction with an Optimizable Feature Grid | Mirgahney Mohamed et.al. | 2311.08159v1 | null |
2023-11-13 | Liangchen Li et.al. | 2311.07044v1 | null | |
2023-11-11 | 3DFusion, A real-time 3D object reconstruction pipeline based on streamed instance segmented data | Xi Sun et.al. | 2311.06659v1 | null |
2023-11-09 | ConRad: Image Constrained Radiance Fields for 3D Generation from a Single Image | Senthil Purushwalkam et.al. | 2311.05230v1 | null |
2023-11-08 | Implicit Neural Representations for Breathing-compensated Volume Reconstruction in Robotic Ultrasound Aorta Screening | Yordanka Velikova et.al. | 2311.04999v1 | null |
2023-11-08 | LRM: Large Reconstruction Model for Single Image to 3D | Yicong Hong et.al. | 2311.04400v1 | null |
2023-11-07 | High-fidelity 3D Reconstruction of Plants using Neural Radiance Field | Kewei Hu et.al. | 2311.04154v1 | null |
2023-11-07 | DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding | Kehinde Ajayi et.al. | 2311.04098v1 | link |
2023-11-05 | MuSHRoom: Multi-Sensor Hybrid Room Dataset for Joint 3D Reconstruction and Novel View Synthesis | Xuqian Ren et.al. | 2311.02778v1 | null |
2023-11-05 | Fast Point-cloud to Mesh Reconstruction for Deformable Object Tracking | Elham Amin Mansour et.al. | 2311.02749v1 | null |
2023-11-05 | IPVNet: Learning Implicit Point-Voxel Features for Open-Surface 3D Reconstruction | Mohammad Samiul Arshad et.al. | 2311.02552v1 | link |
2023-11-02 | CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation | Jingkang Wang et.al. | 2311.01447v1 | null |
2023-11-02 | Look at Robot Base Once: Hand-Eye Calibration with Point Clouds of Robot Base Leveraging Learning-Based 3D Vision | Leihui Li et.al. | 2311.01335v1 | link |
2023-11-02 | Joint 3D Shape and Motion Estimation from Rolling Shutter Light-Field Images | Hermes McGriff et.al. | 2311.01292v1 | link |
2023-11-01 | Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture | Yixin Chen et.al. | 2311.00457v1 | null |
2023-10-31 | Deep Compressed Learning for 3D Seismic Inversion | Maayan Gelboim et.al. | 2311.00107v1 | null |
2023-10-31 | Refined Equivalent Pinhole Model for Large-scale 3D Reconstruction from Spaceborne CCD Imagery | Hong Danyang et.al. | 2310.20117v1 | null |
2023-10-29 | 3DMiner: Discovering Shapes from Large-Scale Unannotated Image Datasets | Ta-Ying Cheng et.al. | 2310.19188v1 | null |
2023-10-25 | Open-NeRF: Towards Open Vocabulary NeRF Decomposition | Hao Zhang et.al. | 2310.16383v1 | null |
2023-10-23 | Novel-View Acoustic Synthesis from 3D Reconstructed Rooms | Byeongjoo Ahn et.al. | 2310.15130v1 | link |
2023-10-23 | Interaction-Driven Active 3D Reconstruction with Object Interiors | Zihao Yan et.al. | 2310.14700v1 | null |
2023-10-23 | VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations | Yiying Yang et.al. | 2310.14487v1 | null |
2023-10-22 | A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video | Jan Emily Mangulabnan et.al. | 2310.14364v1 | null |
2023-10-20 | Single-view 3D reconstruction via inverse procedural modeling | Albert Garifullin et.al. | 2310.13373v1 | null |
2023-10-20 | UE4-NeRF:Neural Radiance Field for Real-Time Rendering of Large-Scale Scene | Jiaming Gu et.al. | 2310.13263v1 | null |
2023-10-19 | Real space iterative reconstruction for vector tomography (RESIRE-V) | Minh Pham et.al. | 2310.12513v1 | link |
2023-10-18 | ShapeGraFormer: GraFormer-Based Network for Hand-Object Reconstruction from a Single Depth Map | Ahmed Tawfik Aboukhadra et.al. | 2310.11811v1 | null |
2023-10-17 | Learning Neural Implicit through Volume Rendering with Attentive Depth Fusion Priors | Pengchong Hu et.al. | 2310.11598v1 | null |
2023-10-17 | Field Robot for High-throughput and High-resolution 3D Plant Phenotyping | Felix Esser et.al. | 2310.11516v1 | null |
2023-10-16 | In-Situ Single Particle Reconstruction Reveals 3D Evolution of PtNi Nanocatalysts During Heating | Yi-Chi Wang et.al. | 2310.10253v1 | null |
2023-10-15 | Tabletop Transparent Scene Reconstruction via Epipolar-Guided Optical Flow with Monocular Depth Completion Prior | Xiaotong Chen et.al. | 2310.09956v1 | null |
2023-10-15 | CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields from Imperfect Camera Poses | Hongyu Fu et.al. | 2310.09776v1 | null |
2023-10-12 | Implicit Shape and Appearance Priors for Few-Shot Full Head Reconstruction | Pol Caselles et.al. | 2310.08784v1 | null |
2023-10-13 | PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm | Haoyi Zhu et.al. | 2310.08586v2 | link |
2023-10-12 | Consistent123: Improve Consistency for One Image to 3D Object Synthesis | Haohan Weng et.al. | 2310.08092v1 | null |
2023-10-10 | SketchBodyNet: A Sketch-Driven Multi-faceted Decoder Network for 3D Human Reconstruction | Fei Wang et.al. | 2310.06577v1 | link |
2023-10-08 | Experiences with CAMRE: Single-Device Collaborative Adaptive Mixed Reality Environment | Hung-Jui Guo et.al. | 2310.04996v1 | null |
2023-10-02 | PC-NeRF: Parent-Child Neural Radiance Fields under Partial Sensor Data Loss in Autonomous Driving Environments | Xiuzhong Hu et.al. | 2310.00874v1 | link |
2023-10-01 | Enabling Neural Radiance Fields (NeRF) for Large-scale Aerial Images -- A Multi-tiling Approaching and the Geometry Assessment of NeRF | Ningli Xu et.al. | 2310.00530v1 | null |
2023-09-29 | 3D Reconstruction in Noisy Agricultural Environments: A Bayesian Optimization Perspective for View Planning | Athanasios Bacharis et.al. | 2310.00145v1 | null |
2023-09-29 | Effect of structure-based training on 3D localization precision and quality | Armin Abdehkakha et.al. | 2309.17265v1 | null |
2023-09-28 | Sketch2CADScript: 3D Scene Reconstruction from 2D Sketch using Visual Transformer and Rhino Grasshopper | Hong-Bin Yang et.al. | 2309.16850v1 | null |
2023-09-29 | 3D Reconstruction with Generalizable Neural Fields using Scene Priors | Yang Fu et.al. | 2309.15164v2 | null |
2023-09-26 | Combining optical diffraction tomography with imaging flow cytometry for characterizing morphology, hemoglobin content, and membrane deformability of live red blood cells | Yu-Hsiang Chang et.al. | 2309.15131v1 | null |
2023-09-26 | PHRIT: Parametric Hand Representation with Implicit Template | Zhisheng Huang et.al. | 2309.14916v1 | null |
2023-09-26 | Unsupervised Reconstruction of 3D Human Pose Interactions From 2D Poses Alone | Peter Hardy et.al. | 2309.14865v1 | null |
2023-09-26 | 3D Density-Gradient based Edge Detection on Neural Radiance Fields (NeRFs) for Geometric Reconstruction | Miriam Jäger et.al. | 2309.14800v1 | null |
2023-09-23 | MP-MVS: Multi-Scale Windows PatchMatch and Planar Prior Multi-View Stereo | Rongxuan Tan et.al. | 2309.13294v1 | link |
2023-09-22 | NeRRF: 3D Reconstruction and View Synthesis for Transparent and Specular Objects with Neural Refractive-Reflective Fields | Xiaoxue Chen et.al. | 2309.13039v1 | link |
2023-09-25 | Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates | Ka Chun Shum et.al. | 2309.11281v2 | link |
2023-09-19 | PLVS: A SLAM System with Points, Lines, Volumetric Mapping, and 3D Incremental Segmentation | Luigi Freda et.al. | 2309.10896v1 | link |
2023-09-19 | SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction | Anilkumar Swamy et.al. | 2309.10748v1 | null |
2023-09-18 | Improving Neural Indoor Surface Reconstruction with Mask-Guided Adaptive Consistency Constraints | Xinyi Yu et.al. | 2309.09739v1 | null |
2023-09-18 | Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering | Chi Zhang et.al. | 2309.09724v1 | null |
2023-09-17 | Uncertainty-aware 3D Object-Level Mapping with Deep Shape Priors | Ziwei Liao et.al. | 2309.09118v1 | null |
2023-09-13 | Exploiting Multiple Priors for Neural 3D Indoor Reconstruction | Federico Lincetto et.al. | 2309.07021v1 | null |
2023-09-12 | Semantic and Articulated Pedestrian Sensing Onboard a Moving Vehicle | Maria Priisalu et.al. | 2309.06313v1 | null |
2023-09-11 | A survey on real-time 3D scene reconstruction with SLAM methods in embedded systems | Quentin Picard et.al. | 2309.05349v1 | null |
2023-09-07 | A Food Package Recognition and Sorting System Based on Structured Light and Deep Learning | Xuanzhi Liu et.al. | 2309.03704v1 | null |
2023-09-06 | SADIR: Shape-Aware Diffusion Models for 3D Image Reconstruction | Nivetha Jayakumar et.al. | 2309.03335v1 | null |
2023-09-06 | Sparse 3D Reconstruction via Object-Centric Ray Sampling | Llukman Cerkezi et.al. | 2309.03008v1 | link |
2023-09-06 | Multi-log grasping using reinforcement learning and virtual visual servoing | Erik Wallin et.al. | 2309.02997v1 | null |
2023-09-06 | LightNeuS: Neural Surface Reconstruction in Endoscopy using Illumination Decline | Víctor M. Batlle et.al. | 2309.02777v1 | null |
2023-09-05 | GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction | Youmin Zhang et.al. | 2309.02436v1 | link |
2023-09-05 | Doppelgangers: Learning to Disambiguate Images of Similar Structures | Ruojin Cai et.al. | 2309.02420v1 | link |
2023-09-05 | TiAVox: Time-aware Attenuation Voxels for Sparse-view 4D DSA Reconstruction | Zhenghong Zhou et.al. | 2309.02318v1 | null |
2023-09-05 | Iterative Superquadric Recomposition of 3D Objects from Multiple Views | Stephan Alaniz et.al. | 2309.02102v1 | link |
2023-09-01 | Dense Voxel 3D Reconstruction Using a Monocular Event Camera | Haodong Chen et.al. | 2309.00385v1 | null |
2023-08-24 | Improving NeRF Quality by Progressive Camera Placement for Unrestricted Navigation in Complex Environments | Georgios Kopanas et.al. | 2309.00014v1 | null |
2023-08-29 | Intensity correlation holography for remote phase sensing and 3D imaging | Guillaume Thekkadath et.al. | 2308.15619v1 | null |
2023-08-28 | R3D3: Dense 3D Reconstruction of Dynamic Scenes from Multiple Cameras | Aron Schmied et.al. | 2308.14713v1 | null |
2023-08-27 | Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views | Zi-Xin Zou et.al. | 2308.14078v1 | null |
2023-08-26 | HoloPOCUS: Portable Mixed-Reality 3D Ultrasound Tracking, Reconstruction and Overlay | Kian Wei Ng et.al. | 2308.13823v1 | null |
2023-08-25 | Textureless Deformable Surface Reconstruction with Invisible Markers | Xinyuan Li et.al. | 2308.13678v1 | null |
2023-08-23 | ARF-Plus: Controlling Perceptual Factors in Artistic Radiance Fields for 3D Scene Stylization | Wenzhao Li et.al. | 2308.12452v1 | null |
2023-08-21 | Coordinate Quantized Neural Implicit Representations for Multi-view Reconstruction | Sijia Jiang et.al. | 2308.11025v1 | link |
2023-08-19 | Root Pose Decomposition Towards Generic Non-rigid 3D Reconstruction with Monocular Videos | Yikai Wang et.al. | 2308.10089v1 | null |
2023-08-19 | TSAR-MVS: Textureless-aware Segmentation and Correlative Refinement Guided Multi-View Stereo | Zhenlong Yuan et.al. | 2308.09990v1 | null |
2023-08-19 | A Theory of Topological Derivatives for Inverse Rendering of Geometry | Ishit Mehta et.al. | 2308.09865v1 | null |
2023-08-18 | O^2-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion Model | Yubin Hu et.al. | 2308.09591v1 | link |
2023-08-17 | A Fusion of Variational Distribution Priors and Saliency Map Replay for Continual 3D Reconstruction | Sanchar Palit et.al. | 2308.08812v1 | null |
2023-08-17 | Long-Range Grouping Transformer for Multi-View 3D Reconstruction | Liying Yang et.al. | 2308.08724v1 | link |
2023-08-16 | DeDoDe: Detect, Don't Describe -- Describe, Don't Detect for Local Feature Matching | Johan Edstedt et.al. | 2308.08479v1 | link |
2023-08-17 | ObjectSDF++: Improved Object-Compositional Neural Implicit Surfaces | Qianyi Wu et.al. | 2308.07868v2 | link |
2023-08-15 | CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction | Yan Di et.al. | 2308.07837v1 | null |
2023-08-15 | Multi-view 3D Face Reconstruction Based on Flame | Wenzhuo Zheng et.al. | 2308.07551v1 | null |
2023-08-14 | A One Stop 3D Target Reconstruction and multilevel Segmentation Method | Jiexiong Xu et.al. | 2308.06974v1 | link |
2023-08-11 | Efficient Large-scale AUV-based Visual Seafloor Mapping | Mengkun She et.al. | 2308.06147v1 | null |
2023-08-10 | PlankAssembly: Robust 3D Reconstruction from Three Orthographic Views with Learnt Shape Programs | Wentao Hu et.al. | 2308.05744v1 | link |
2023-08-10 | HGDNet: A Height-Hierarchy Guided Dual-Decoder Network for Single View Building Extraction and Height Estimation | Chaoran Lu et.al. | 2308.05387v1 | null |
2023-08-07 | Learning Photometric Feature Transform for Free-form Object Scan | Xiang Feng et.al. | 2308.03492v1 | null |
2023-08-04 | Reconstructing Three-Dimensional Models of Interacting Humans | Mihai Fieraru et.al. | 2308.01854v2 | link |
2023-08-02 | HANDAL: A Dataset of Real-World Manipulable Object Categories with Pose Annotations, Affordances, and Reconstructions | Andrew Guo et.al. | 2308.01477v1 | null |
2023-08-15 | Tirtha -- An Automated Platform to Crowdsource Images and Create 3D Models of Heritage Sites | Jyotirmaya Shivottam et.al. | 2308.01246v2 | link |
2023-08-02 | Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network | Shenbagaraj Kannapiran et.al. | 2308.01125v1 | null |
2023-08-01 | Body Knowledge and Uncertainty Modeling for Monocular 3D Human Body Reconstruction | Yufei Zhang et.al. | 2308.00799v1 | null |
2023-07-31 | Onboard View Planning of a Flying Camera for High Fidelity 3D Reconstruction of a Moving Actor | Qingyuan Jiang et.al. | 2308.00134v1 | link |
2023-07-21 | Autonomous Electron Tomography Reconstruction with Machine Learning | William Millsaps et.al. | 2308.00099v1 | null |
2023-07-31 | Towards Head Computed Tomography Image Reconstruction Standardization with Deep Learning Assisted Automatic Detection | Bowen Zheng et.al. | 2307.16440v1 | null |
2023-07-27 | FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene | Chengrui Wei et.al. | 2307.14624v1 | null |
2023-07-27 | Physically Plausible 3D Human-Scene Reconstruction from Monocular RGB Image using an Adversarial Learning Approach | Sandika Biswas et.al. | 2307.14570v1 | null |
2023-07-27 | Creative Birds: Self-Supervised Single-View 3D Style Transfer | Renke Wang et.al. | 2307.14127v2 | link |
2023-07-24 | CarPatch: A Synthetic Benchmark for Radiance Field Evaluation on Vehicle Components | Davide Di Nucci et.al. | 2307.12718v1 | null |
2023-07-24 | VIRD: Immersive Match Video Analysis for High-Performance Badminton Coaching | Tica Lin et.al. | 2307.12539v1 | link |
2023-07-23 | LIST: Learning Implicitly from Spatial Transformers for Single-View 3D Reconstruction | Mohammad Samiul Arshad et.al. | 2307.12194v1 | link |
2023-07-22 | Replay: Multi-modal Multi-view Acted Videos for Casual Holography | Roman Shapovalov et.al. | 2307.12067v1 | link |
2023-07-20 | SimCol3D -- 3D Reconstruction during Colonoscopy Challenge | Anita Rau et.al. | 2307.11261v1 | link |
2023-07-14 | Transient Neural Radiance Fields for Lidar View Synthesis and 3D Reconstruction | Anagh Malik et.al. | 2307.09555v1 | null |
2023-07-18 | NU-MCC: Multiview Compressive Coding with Neighborhood Decoder and Repulsive UDF | Stefan Lionar et.al. | 2307.09112v1 | link |
2023-07-16 | Enforcing Topological Interaction between Implicit Surfaces via Uniform Sampling | Hieu Le et.al. | 2307.08716v1 | null |
2023-07-13 | Bag of Views: An Appearance-based Approach to Next-Best-View Planning for 3D Reconstruction | Sara Hatami Gazani et.al. | 2307.05832v2 | link |
2023-07-11 | 3D detection of roof sections from a single satellite image and application to LOD2-building reconstruction | Johann Lussange et.al. | 2307.05409v1 | null |
2023-07-08 | MAP-NBV: Multi-agent Prediction-guided Next-Best-View Planning for Active 3D Object Reconstruction | Harnaik Dhami et.al. | 2307.04004v1 | null |
2023-07-07 | Depth Estimation Analysis of Orthogonally Divergent Fisheye Cameras with Distortion Removal | Matvei Panteleev et.al. | 2307.03602v1 | null |
2023-07-07 | RGB-D Mapping and Tracking in a Plenoxel Radiance Field | Andreas L. Teigen et.al. | 2307.03404v1 | link |
2023-07-04 | User-Friendly Safety Monitoring System for Manufacturing Cobots | Ye-Ji Mun et.al. | 2307.01886v1 | null |
2023-06-29 | One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization | Minghua Liu et.al. | 2306.16928v1 | link |
2023-06-23 | LightGlue: Local Feature Matching at Light Speed | Philipp Lindenberger et.al. | 2306.13643v1 | link |
2023-06-24 | 3D Reconstruction of Spherical Images based on Incremental Structure from Motion | San Jiang et.al. | 2306.12770v2 | link |
2023-06-26 | Infinite Photorealistic Worlds using Procedural Generation | Alexander Raistrick et.al. | 2306.09310v2 | null |
2023-06-15 | NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations | Varun Jampani et.al. | 2306.09109v1 | link |
2023-06-15 | Enhancing Neural Rendering Methods with Image Augmentations | Juan C. Pérez et.al. | 2306.08904v1 | null |
2023-06-14 | Learning to Predict Scene-Level Implicit 3D from Posed RGBD Data | Nilesh Kulkarni et.al. | 2306.08671v1 | null |
2023-06-13 | Viewset Diffusion: (0-)Image-Conditioned 3D Generative Models from 2D Data | Stanislaw Szymanowicz et.al. | 2306.07881v1 | null |
2023-06-12 | Reconstructing Heterogeneous Cryo-EM Molecular Structures by Decomposing Them into Polymer Chains | Bongjin Koo et.al. | 2306.07274v1 | null |
2023-06-10 | 3D reconstruction using Structure for Motion | Kshitij Karnawat et.al. | 2306.06360v1 | link |
2023-06-15 | NERFBK: A High-Quality Benchmark for NERF-Based 3D Reconstruction | Ali Karami et.al. | 2306.06300v2 | link |
2023-06-12 | Neural Haircut: Prior-Guided Strand-Based Hair Reconstruction | Vanessa Sklyarova et.al. | 2306.05872v2 | link |
2023-06-08 | 2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction | Jiawei He et.al. | 2306.05418v1 | null |
2023-06-08 | Enhance-NeRF: Multiple Performance Evaluation for Neural Radiance Fields | Qianqiu Tan et.al. | 2306.05303v1 | link |
2023-06-07 | BU-CVKit: Extendable Computer Vision Framework for Species Independent Tracking and Analysis | Mahir Patel et.al. | 2306.04736v1 | null |
2023-06-09 | DiViNeT: 3D Reconstruction from Disparate Views via Neural Template Regularization | Aditya Vora et.al. | 2306.04699v2 | null |
2023-06-05 | BeyondPixels: A Comprehensive Review of the Evolution of Neural Radiance Fields | AKM Shahariar Azad Rabby et.al. | 2306.03000v1 | null |
2023-06-05 | Single-Stage 3D Geometry-Preserving Depth Estimation Model Training on Dataset Mixtures with Uncalibrated Stereo Data | Nikolay Patakin et.al. | 2306.02878v1 | null |
2023-06-05 | Computational 3D topographic microscopy from terabytes of data per sample | Kevin C. Zhou et.al. | 2306.02634v1 | null |
2023-06-08 | Adaptive Robotic Information Gathering via Non-Stationary Gaussian Processes | Weizhe Chen et.al. | 2306.01263v2 | link |
2023-06-01 | BUOL: A Bottom-Up Framework with Occupancy-aware Lifting for Panoptic 3D Scene Reconstruction From A Single Image | Tao Chu et.al. | 2306.00965v1 | link |
2023-05-31 | Humans in 4D: Reconstructing and Tracking Humans with Transformers | Shubham Goel et.al. | 2305.20091v1 | link |
2023-05-30 | Template-free Articulated Neural Point Clouds for Reposable View Synthesis | Lukas Uzolas et.al. | 2305.19065v1 | link |
2023-05-29 | Synfeal: A Data-Driven Simulator for End-to-End Camera Localization | Daniel Coelho et.al. | 2305.18260v1 | link |
2023-06-04 | VoxDet: Voxel Learning for Novel Instance Detection | Bowen Li et.al. | 2305.17220v3 | link |
2023-05-25 | Look Ma, No Hands! Agent-Environment Factorization of Egocentric Videos | Matthew Chang et.al. | 2305.16301v1 | null |
2023-05-25 | Domain-Adaptive Full-Face Gaze Estimation via Novel-View-Synthesis and Feature Disentanglement | Jiawei Qin et.al. | 2305.16140v1 | null |
2023-05-25 | Robust Category-Level 3D Pose Estimation from Synthetic Data | Jiahao Yang et.al. | 2305.16124v1 | null |
2023-05-25 | T2TD: Text-3D Generation Model based on Prior Knowledge Guidance | Weizhi Nie et.al. | 2305.15753v1 | null |
2023-05-23 | Cross3DVG: Baseline and Dataset for Cross-Dataset 3D Visual Grounding on Different RGB-D Scans | Taiki Miyanishi et.al. | 2305.13876v1 | link |
2023-05-22 | A three-dimensional MR-STAT protocol for high-resolution multi-parametric quantitative MRI | Hongyan Liu et.al. | 2305.13022v1 | null |
2023-05-29 | Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models | Byungjun Kim et.al. | 2305.11870v2 | link |
2023-05-19 | Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields | Jingbo Zhang et.al. | 2305.11588v1 | link |
2023-05-19 | RGB-D And Thermal Sensor Fusion: A Systematic Literature Review | Martin Brenner et.al. | 2305.11427v1 | null |
2023-05-18 | Progressive Learning of 3D Reconstruction Network from 2D GAN Data | Aysegul Dundar et.al. | 2305.11102v1 | null |
2023-05-18 | ConsistentNeRF: Enhancing Neural Radiance Fields with 3D Consistency for Sparse View Synthesis | Shoukang Hu et.al. | 2305.11031v1 | link |
2023-05-17 | Colonoscopy Coverage Revisited: Identifying Scanning Gaps in Real-Time | G. Leifman et.al. | 2305.10026v1 | null |
2023-05-15 | AutoRecon: Automated 3D Object Discovery and Reconstruction | Yuang Wang et.al. | 2305.08810v1 | null |
2023-05-11 | Towards a Better Understanding of the Computer Vision Research Community in Africa | Abdul-Hakeem Omotayo et.al. | 2305.06773v1 | null |
2023-05-10 | Scan2LoD3: Reconstructing semantic 3D building models at LoD3 using ray casting and Bayesian networks | Olaf Wysocki et.al. | 2305.06314v1 | null |
2023-05-08 | RelPose++: Recovering 6D Poses from Sparse-view Observations | Amy Lin et.al. | 2305.04926v1 | link |
2023-05-04 | UrbanBIS: a Large-scale Benchmark for Fine-grained Urban Building Instance Segmentation | Guoqing Yang et.al. | 2305.02627v1 | null |
2023-05-03 | Biological Hotspot Mapping in Coral Reefs with Robotic Visual Surveys | Daniel Yang et.al. | 2305.02330v1 | link |
2023-04-30 | Second-order Anisotropic Gaussian Directional Derivative Filters for Blob Detection | Jie Ren et.al. | 2305.00435v1 | null |
2023-04-29 | NSLF-OL: Online Learning of Neural Surface Light Fields alongside Real-time Incremental 3D Reconstruction | Yijun Yuan et.al. | 2305.00282v1 | null |
2023-04-23 | UHRNet: A Deep Learning-Based Method for Accurate 3D Reconstruction from a Single Fringe-Pattern | Yixiao Wang et.al. | 2304.14503v1 | link |
2023-04-27 | Learning Articulated Shape with Keypoint Pseudo-labels from Web Images | Anastasis Stathopoulos et.al. | 2304.14396v1 | null |
2023-05-03 | Combining HoloLens with Instant-NeRFs: Advanced Real-Time 3D Mobile Mapping | Dennis Haitz et.al. | 2304.14301v2 | null |
2023-04-25 | Shape-Net: Room Layout Estimation from Panoramic Images Robust to Occlusion using Knowledge Distillation with 3D Shapes as Additional Inputs | Mizuki Tabata et.al. | 2304.12624v1 | null |
2023-04-24 | Instant-3D: Instant Neural Radiance Field Training Towards On-Device AR/VR 3D Reconstruction | Sixu Li et.al. | 2304.12467v1 | null |
2023-04-24 | Unsupervised Style-based Explicit 3D Face Reconstruction from Single Image | Heng Yu et.al. | 2304.12455v1 | null |
2023-04-24 | gSDF: Geometry-Driven Signed Distance Functions for 3D Hand-Object Reconstruction | Zerui Chen et.al. | 2304.11970v1 | null |
2023-04-24 | Learning Visibility Field for Detailed 3D Human Reconstruction and Relighting | Ruichen Zheng et.al. | 2304.11900v1 | null |
2023-04-24 | NoiseTrans: Point Cloud Denoising with Transformers | Guangzhe Hou et.al. | 2304.11812v1 | null |
2023-04-20 | A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion | Miriam Jäger et.al. | 2304.10664v1 | null |
2023-04-20 | Reconstructing Signing Avatars From Video Using Linguistic Priors | Maria-Paola Forte et.al. | 2304.10482v1 | null |
2023-04-19 | Anything-3D: Towards Single-view Anything Reconstruction in the Wild | Qiuhong Shen et.al. | 2304.10261v1 | link |
2023-04-20 | A geometry-aware deep network for depth estimation in monocular endoscopy | Yongming Yang et.al. | 2304.10241v1 | link |
2023-04-19 | Tetra-NeRF: Representing Neural Radiance Fields Using Tetrahedra | Jonas Kulhanek et.al. | 2304.09987v1 | link |
2023-04-20 | Single-View View Synthesis with Self-Rectified Pseudo-Stereo | Yang Zhou et.al. | 2304.09527v2 | null |
2023-04-19 | 3 Dimensional Dense Reconstruction: A Review of Algorithms and Dataset | Yangming Li et.al. | 2304.09371v1 | null |
2023-04-18 | SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes | Yiming Gao et.al. | 2304.08971v1 | null |
2023-04-17 | Learning How To Robustly Estimate Camera Pose in Endoscopic Videos | Michel Hayoz et.al. | 2304.08023v1 | link |
2023-04-15 | Temporally Consistent Online Depth Estimation Using Point-Based Fusion | Numair Khan et.al. | 2304.07435v1 | link |
2023-04-17 | Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction | Hansheng Chen et.al. | 2304.06714v2 | link |
2023-04-12 | SiLK -- Simple Learned Keypoints | Pierre Gleize et.al. | 2304.06194v1 | link |
2023-04-12 | Dynamic Voxel Grid Optimization for High-Fidelity RGB-D Supervised Surface Reconstruction | Xiangyu Xu et.al. | 2304.06178v1 | null |
2023-04-11 | EvAC3D: From Event-based Apparent Contours to 3D Models via Continuous Visual Hulls | Ziyun Wang et.al. | 2304.05296v1 | link |
2023-04-10 | Neural Lens Modeling | Wenqi Xian et.al. | 2304.04848v1 | null |
2023-04-10 | Evaluate Geometry of Radiance Field with Low-frequency Color Prior | Qihang Fang et.al. | 2304.04351v1 | link |
2023-04-11 | Analysis of Sampling Strategies for Implicit 3D Reconstruction | Q. Liu et.al. | 2304.03999v2 | null |
2023-04-08 | 3D GANs and Latent Space: A comprehensive survey | Satya Pratheek Tata et.al. | 2304.03932v1 | null |
2023-04-08 | Photometric Correction for Infrared Sensors | Jincheng Zhang et.al. | 2304.03930v1 | null |
2023-04-07 | ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation | Xiaoming Zhao et.al. | 2304.03608v1 | link |
2023-04-06 | Neural Fields meet Explicit Geometric Representation for Inverse Rendering of Urban Scenes | Zian Wang et.al. | 2304.03266v1 | null |
2023-04-06 | DeLiRa: Self-Supervised Depth, Light, and Radiance Fields | Vitor Guizilini et.al. | 2304.02797v1 | null |
2023-04-05 | Image Stabilization for Hololens Camera in Remote Collaboration | Gowtham Senthil et.al. | 2304.02736v1 | null |
2023-04-05 | Real-Time Dense 3D Mapping of Underwater Environments | Weihan Wang et.al. | 2304.02704v1 | link |
2023-04-04 | USTC FLICAR: A Multisensor Fusion Dataset of LiDAR-Inertial-Camera for Heavy-duty Autonomous Aerial Work Robots | Ziming Wang et.al. | 2304.01986v1 | null |
2023-04-04 | End-to-End Latency Optimization of Multi-view 3D Reconstruction for Disaster Response | Xiaojie Zhang et.al. | 2304.01488v1 | null |
2023-04-04 | FineRecon: Depth-aware Feed-forward Network for Detailed 3D Reconstruction | Noah Stier et.al. | 2304.01480v1 | link |
2023-04-03 | One-Shot View Planning for Fast and Complete Unknown Object Reconstruction | Sicong Pan et.al. | 2304.00910v1 | link |
2023-03-31 | LivePose: Online 3D Reconstruction from Monocular Video with Dynamic Camera Poses | Noah Stier et.al. | 2304.00054v1 | link |
2023-04-03 | Three-dimensional coherent diffraction snapshot imaging using extreme ultraviolet radiation from a free electron laser | Danny Fainozzi et.al. | 2303.18166v2 | null |
2023-03-30 | Enhanced Stable View Synthesis | Nishant Jain et.al. | 2303.17094v1 | null |
2023-03-29 | AirLine: Efficient Learnable Line Detection with Local Edge Voting | Xiao Lin et.al. | 2303.16500v1 | link |
2023-03-29 | Multi-View Azimuth Stereo via Tangent Space Consistency | Xu Cao et.al. | 2303.16447v1 | link |
2023-03-27 | NeUDF: Learning Unsigned Distance Fields from Multi-view Images for Reconstructing Non-watertight Models | Fei Hou et.al. | 2303.15368v1 | null |
2023-03-27 | TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering | Jaehoon Choi et.al. | 2303.15060v1 | null |
2023-03-26 | Clean-NeRF: Reformulating NeRF to account for View-Dependent Observations | Xinhang Liu et.al. | 2303.14707v1 | null |
2023-03-25 | PAniC-3D: Stylized Single-view 3D Reconstruction from Portraits of Anime Characters | Shuhong Chen et.al. | 2303.14587v1 | link |
2023-03-25 | LPFF: A Portrait Dataset for Face Generators Across Large Poses | Yiqian Wu et.al. | 2303.14407v1 | null |
2023-03-24 | BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects | Bowen Wen et.al. | 2303.14158v1 | link |
2023-03-24 | Deformable Model Driven Neural Rendering for High-fidelity 3D Reconstruction of Human Heads Under Low-View Settings | Baixin Xu et.al. | 2303.13855v1 | link |
2023-03-24 | Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container | Jinguang Tong et.al. | 2303.13805v1 | link |
2023-03-23 | SCADE: NeRFs from Space Carving with Ambiguity-Aware Depth Estimates | Mikaela Angelina Uy et.al. | 2303.13582v1 | null |
2023-03-21 | Real-time volumetric rendering of dynamic humans | Ignacio Rocco et.al. | 2303.11898v1 | null |
2023-03-20 | Zero-1-to-3: Zero-shot One Image to 3D Object | Ruoshi Liu et.al. | 2303.11328v1 | link |
2023-03-20 | DIME-Net: Neural Network-Based Dynamic Intrinsic Parameter Rectification for Cameras with Optical Image Stabilization System | Shu-Hao Yeh et.al. | 2303.11307v1 | null |
2023-03-20 | Ref-NeuS: Ambiguity-Reduced Neural Implicit Surface Learning for Multi-View Reconstruction with Reflection | Wenhang Ge et.al. | 2303.10840v1 | link |
2023-03-14 | FingerSLAM: Closed-loop Unknown Object Localization and Reconstruction from Visuo-tactile Feedback | Jialiang Zhao et.al. | 2303.07997v1 | null |
2023-03-11 | Normal-guided Garment UV Prediction for Human Re-texturing | Yasamin Jafarian et.al. | 2303.06504v1 | null |
2023-03-11 | Just Flip: Flipped Observation Generation and Optimization for Neural Radiance Fields to Cover Unobserved View | Minjae Lee et.al. | 2303.06335v1 | link |
2023-03-10 | ACR: Attention Collaboration-based Regressor for Arbitrary Two-Hand Reconstruction | Zhengdi Yu et.al. | 2303.05938v1 | link |
2023-03-10 | Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction | Mingfang Zhang et.al. | 2303.05937v1 | null |
2023-03-08 | FastSurf: Fast Neural RGB-D Surface Reconstruction using Per-Frame Intrinsic Refinement and TSDF Fusion Prior Learning | Seunghwan Lee et.al. | 2303.04508v1 | link |
2023-03-08 | Corner Detection Based on Multi-directional Gabor Filters with Multi-scales | Huaqing Wang et.al. | 2303.04334v1 | null |
2023-03-08 | DroNeRF: Real-time Multi-agent Drone Pose Optimization for Computing Neural Radiance Fields | Dipam Patel et.al. | 2303.04322v1 | null |
2023-03-07 | Proactive Multi-Camera Collaboration For 3D Human Pose Estimation | Hai Ci et.al. | 2303.03767v1 | null |
2023-03-06 | System for 3D Acquisition and 3D Reconstruction using Structured Light for Sewer Line Inspection | Johannes Künzel et.al. | 2303.02978v1 | null |
2023-03-03 | Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement | Jiaxiang Tang et.al. | 2303.02091v1 | link |
2023-03-09 | MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices | Kejie Li et.al. | 2303.01932v2 | link |
2023-03-01 | Motion Compensation via Epipolar Consistency for In-Vivo X-Ray Microscopy | Mareike Thies et.al. | 2303.00449v1 | null |
2023-02-28 | 3D Coronary Vessel Reconstruction from Bi-Plane Angiography using Graph Convolutional Networks | Kit Mills Bransby et.al. | 2302.14795v1 | null |
2023-02-28 | Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors | Ji Hou et.al. | 2302.14746v1 | null |
2023-02-27 | UMIFormer: Mining the Correlations between Similar Tokens for Multi-View 3D Reconstruction | Zhenwei Zhu et.al. | 2302.13987v1 | link |
2023-02-26 | Perceiving Unseen 3D Objects by Poking the Objects | Linghao Chen et.al. | 2302.13375v1 | null |
2023-02-25 | SUPS: A Simulated Underground Parking Scenario Dataset for Autonomous Driving | Jiawei Hou et.al. | 2302.12966v1 | link |
2023-02-24 | 3D Surface Reconstruction in the Wild by Deforming Shape Priors from Synthetic Data | Nicolai Häni et.al. | 2302.12883v1 | null |
2023-02-23 | View Consistency Aware Holistic Triangulation for 3D Human Pose Estimation | Xiaoyue Wan et.al. | 2302.11301v2 | null |
2023-02-23 | Luke Melas-Kyriazi et.al. | 2302.10668v2 | link | |
2023-02-23 | RealFusion: 360° Reconstruction of Any Object from a Single Image | Luke Melas-Kyriazi et.al. | 2302.10663v2 | null |
2023-02-20 | UAVStereo: A Multiple Resolution Dataset for Stereo Matching in UAV Scenarios | Zhang Xiaoyi et.al. | 2302.10082v1 | link |
2023-02-14 | HR-NeuS: Recovering High-Frequency Surface Geometry via Neural Implicit Surfaces | Erich Liang et.al. | 2302.06793v1 | null |
2023-02-14 | Boosted ab initio Cryo-EM 3D Reconstruction with ACE-EM | Lin Yao et.al. | 2302.06091v2 | null |
2023-02-11 | 3D Colored Shape Reconstruction from a Single RGB Image through Diffusion | Bo Li et.al. | 2302.05573v1 | null |
2023-02-09 | 3D reconstruction of spherical images: A review of techniques, applications, and prospects | San Jiang et.al. | 2302.04495v1 | null |
2023-02-09 | PredRecon: A Prediction-boosted Planning Framework for Fast and High-quality Autonomous Aerial Reconstruction | Chen Feng et.al. | 2302.04488v1 | link |
2023-02-07 | S4R: Self-Supervised Semantic Scene Reconstruction from RGB-D Scans | Junwen Huang et.al. | 2302.03640v1 | null |
2023-01-30 | Mono-STAR: Mono-camera Scene-level Tracking and Reconstruction | Haonan Chang et.al. | 2301.13244v1 | link |
2023-01-27 | A Comparison of Tiny-nerf versus Spatial Representations for 3d Reconstruction | Saulo Abraham Gante et.al. | 2301.11522v1 | null |
2023-01-25 | Local Feature Extraction from Salient Regions by Feature Map Transformation | Yerim Jung et.al. | 2301.10413v1 | null |
2023-02-02 | 3D Reconstruction of Non-cooperative Resident Space Objects using Instant NGP-accelerated NeRF and D-NeRF | Trupti Mahendrakar et.al. | 2301.09060v2 | null |
2023-01-19 | Parallelized computational 3D video microscopy of freely moving organisms at multiple gigapixels per second | Kevin C. Zhou et.al. | 2301.08351v1 | link |
2023-01-19 | Multiview Compressive Coding for 3D Reconstruction | Chao-Yuan Wu et.al. | 2301.08247v1 | link |
2023-01-19 | Regularizing disparity estimation via multi task learning with structured light reconstruction | Alistair Weld et.al. | 2301.08140v1 | null |
2023-01-12 | Edge Preserving Implicit Surface Representation of Point Clouds | Xiaogang Wang et.al. | 2301.04860v1 | null |
2023-01-11 | Elevation Estimation-Driven Building 3D Reconstruction from Single-View Remote Sensing Imagery | Yongqiang Mao et.al. | 2301.04581v1 | null |
2023-01-11 | First 3D reconstruction of a blast furnace using muography | Amélie Cohu et.al. | 2301.04354v1 | null |
2023-01-04 | Towards a Pipeline for Real-Time Visualization of Faces for VR-based Telepresence and Live Broadcasting Utilizing Neural Rendering | Philipp Ladwig et.al. | 2301.01490v1 | link |
2023-01-03 | BS3D: Building-scale 3D Reconstruction from RGB-D Images | Janne Mustaniemi et.al. | 2301.01057v1 | null |
2022-12-31 | Ponder: Point Cloud Pre-training via Neural Rendering | Di Huang et.al. | 2301.00157v1 | null |
2022-12-28 | NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action | Kuan-Chieh Wang et.al. | 2212.13660v1 | link |
2022-12-24 | Polarimetric Multi-View Inverse Rendering | Jinyu Zhao et.al. | 2212.12721v1 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-04-03 | Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction | Keyu Tian et.al. | 2404.02905v1 | link |
2024-04-03 | LidarDM: Generative LiDAR Simulation in a Generated World | Vlas Zyrianov et.al. | 2404.02903v1 | null |
2024-04-03 | DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets | Harsh Rangwani et.al. | 2404.02900v1 | link |
2024-04-03 | MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment | Duygu Ceylan et.al. | 2404.02899v1 | null |
2024-04-03 | A Mean Field Game Model for Timely Computation in Edge Computing Systems | Shubham Aggarwal et.al. | 2404.02898v1 | null |
2024-04-03 | Deep Image Composition Meets Image Forgery | Eren Tahir et.al. | 2404.02897v1 | link |
2024-04-03 | ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline | Yifan Xu et.al. | 2404.02893v1 | null |
2024-04-03 | PoCo: Point Context Cluster for RGBD Indoor Place Recognition | Jing Liang et.al. | 2404.02885v1 | null |
2024-04-02 | Segment Any 3D Object with Language | Seungjun Lee et.al. | 2404.02157v1 | null |
2024-04-02 | Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration | Akshay Dudhane et.al. | 2404.02154v1 | null |
2024-04-02 | GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image | Chong Bao et.al. | 2404.02152v1 | null |
2024-04-02 | Diffusion |
Zeyu Yang et.al. | 2404.02148v1 | link |
2024-04-02 | Harder, Better, Faster, Stronger: Interactive Visualization for Human-Centered AI Tools | Md Naimul Hoque et.al. | 2404.02147v1 | null |
2024-04-02 | Iterated Learning Improves Compositionality in Large Vision-Language Models | Chenhao Zheng et.al. | 2404.02145v1 | null |
2024-04-02 | Multiparametric quantification and visualization of liver fat using ultrasound | Jihye Baek et.al. | 2404.02143v1 | null |
2024-03-29 | Gecko: Versatile Text Embeddings Distilled from Large Language Models | Jinhyuk Lee et.al. | 2403.20327v1 | null |
2024-03-29 | Shaving Logs via Large Sieve Inequality: Faster Algorithms for Sparse Convolution and More | Ce Jin et.al. | 2403.20326v1 | null |
2024-03-29 | Structure and Dynamics of Magneto-Inertial, Differentially Rotating Laboratory Plasmas | V. Valenzuela-Villaseca et.al. | 2403.20321v1 | null |
2024-03-29 | SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects | Abhinav Kumar et.al. | 2403.20318v1 | link |
2024-03-29 | Convolutional Prompting meets Language Models for Continual Learning | Anurag Roy et.al. | 2403.20317v1 | null |
2024-03-29 | Optimal Communication for Classic Functions in the Coordinator Model and Beyond | Hossein Esfandiari et.al. | 2403.20307v1 | null |
2024-03-28 | GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling | Bowen Zhang et.al. | 2403.19655v1 | null |
2024-03-28 | Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond | Katherine Xu et.al. | 2403.19653v1 | link |
2024-03-28 | InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction | Sirui Xu et.al. | 2403.19652v1 | null |
2024-03-28 | GraspXL: Generating Grasping Motions for Diverse Objects at Scale | Hui Zhang et.al. | 2403.19649v1 | null |
2024-03-28 | Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models | Samuel Marks et.al. | 2403.19647v1 | link |
2024-03-28 | GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models | Yusuf Dalva et.al. | 2403.19645v1 | null |
2024-03-27 | Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark | Ziyang Chen et.al. | 2403.18821v1 | null |
2024-03-27 | MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering | Guoxing Sun et.al. | 2403.18820v1 | null |
2024-03-27 | ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion | Daniel Winter et.al. | 2403.18818v1 | null |
2024-03-27 | Garment3DGen: 3D Garment Stylization and Texture Generation | Nikolaos Sarafianos et.al. | 2403.18816v1 | null |
2024-03-27 | Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models | Yanwei Li et.al. | 2403.18814v1 | link |
2024-03-27 | Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment | Li Siyao et.al. | 2403.18811v1 | null |
2024-03-28 | ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation | Suraj Patni et.al. | 2403.18807v2 | link |
2024-03-26 | ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis | Muhammad Hamza Mughal et.al. | 2403.17936v1 | null |
2024-03-26 | OmniVid: A Generative Framework for Universal Video Understanding | Junke Wang et.al. | 2403.17935v1 | link |
2024-03-26 | SLEDGE: Synthesizing Simulation Environments for Driving Agents with Generative Models | Kashyap Chitta et.al. | 2403.17933v1 | null |
2024-03-26 | MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution | Wei Tao et.al. | 2403.17927v1 | null |
2024-03-26 | AID: Attention Interpolation of Text-to-Image Diffusion | Qiyuan He et.al. | 2403.17924v1 | link |
2024-03-26 | The Need for Speed: Pruning Transformers with One Recipe | Samir Khaki et.al. | 2403.17921v1 | link |
2024-03-26 | TC4D: Trajectory-Conditioned Text-to-4D Generation | Sherwin Bahmani et.al. | 2403.17920v1 | null |
2024-03-26 | AgentStudio: A Toolkit for Building General Virtual Agents | Longtao Zheng et.al. | 2403.17918v1 | null |
2024-03-25 | Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning | Sicong Pan et.al. | 2403.16803v1 | null |
2024-03-25 | Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback | Zhangqian Bi et.al. | 2403.16792v1 | null |
2024-03-25 | Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise | Dilum Fernando et.al. | 2403.16790v1 | null |
2024-03-25 | HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation | Linglin Jing et.al. | 2403.16788v1 | null |
2024-03-25 | Creating a Digital Twin of Spinal Surgery: A Proof of Concept | Jonas Hein et.al. | 2403.16736v1 | null |
2024-03-25 | Improving Diffusion Models's Data-Corruption Resistance using Scheduled Pseudo-Huber Loss | Artem Khrapov et.al. | 2403.16728v1 | link |
2024-03-22 | DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data | Hanrong Ye et.al. | 2403.15389v1 | null |
2024-03-22 | LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis | Kevin Xie et.al. | 2403.15385v1 | null |
2024-03-22 | ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars | Zhenwei Wang et.al. | 2403.15383v1 | null |
2024-03-22 | DragAPart: Learning a Part-Level Motion Prior for Articulated Objects | Ruining Li et.al. | 2403.15382v1 | null |
2024-03-22 | Long-CLIP: Unlocking the Long-Text Capability of CLIP | Beichen Zhang et.al. | 2403.15378v1 | link |
2024-03-22 | InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding | Yi Wang et.al. | 2403.15377v1 | link |
2024-03-22 | A Modular, End-to-End Next-Generation Network Testbed: Towards a Fully Automated Network Management Platform | Ali Chouman et.al. | 2403.15376v1 | null |
2024-03-21 | Zero-Shot Multi-Object Shape Completion | Shun Iwase et.al. | 2403.14628v1 | null |
2024-03-21 | MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images | Yuedong Chen et.al. | 2403.14627v1 | link |
2024-03-21 | Simplified Diffusion Schrödinger Bridge | Zhicong Tang et.al. | 2403.14623v1 | link |
2024-03-21 | GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation | Yinghao Xu et.al. | 2403.14621v1 | link |
2024-03-21 | ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition | Tianhao Wu et.al. | 2403.14619v1 | null |
2024-03-21 | Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion | Xiang Fan et.al. | 2403.14617v1 | null |
2024-03-21 | Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning | Hasindri Watawana et.al. | 2403.14616v1 | link |
2024-03-21 | DreamReward: Text-to-3D Generation with Human Preference | Junliang Ye et.al. | 2403.14613v1 | null |
2024-03-21 | Explorative Inbetweening of Time and Space | Haiwen Feng et.al. | 2403.14611v1 | null |
2024-03-20 | On Pretraining Data Diversity for Self-Supervised Learning | Hasan Abed Al Kader Hammoud et.al. | 2403.13808v1 | link |
2024-03-20 | Editing Massive Concepts in Text-to-Image Diffusion Models | Tianwei Xiong et.al. | 2403.13807v1 | link |
2024-03-20 | Learning from Models and Data for Visual Grounding | Ruozhen He et.al. | 2403.13804v1 | null |
2024-03-20 | Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments | Yang Yang et.al. | 2403.13803v1 | link |
2024-03-20 | ZigMa: Zigzag Mamba Diffusion Model | Vincent Tao Hu et.al. | 2403.13802v1 | link |
2024-03-20 | Natural Language as Polices: Reasoning for Coordinate-Level Embodied Control with LLMs | Yusuke Mikami et.al. | 2403.13801v1 | link |
2024-03-20 | TimeRewind: Rewinding Time with Image-and-Events Video Diffusion | Jingxi Chen et.al. | 2403.13800v1 | null |
2024-03-20 | Reverse Training to Nurse the Reversal Curse | Olga Golovneva et.al. | 2403.13799v1 | null |
2024-03-20 | Hierarchical NeuroSymbolic Approach for Action Quality Assessment | Lauren Okamoto et.al. | 2403.13798v1 | null |
2024-03-20 | Bridge the Modality and Capacity Gaps in Vision-Language Model Selection | Chao Yi et.al. | 2403.13797v1 | null |
2024-03-19 | LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression | Zhuoshi Pan et.al. | 2403.12968v1 | link |
2024-03-19 | Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence Alignment | Mengting Chen et.al. | 2403.12965v1 | null |
2024-03-19 | Negative Yields Positive: Unified Dual-Path Adapter for Vision-Language Models | Ce Zhang et.al. | 2403.12964v1 | link |
2024-03-19 | FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis | Linjiang Huang et.al. | 2403.12963v1 | link |
2024-03-19 | TexTile: A Differentiable Metric for Texture Tileability | Carlos Rodriguez-Pardo et.al. | 2403.12961v1 | null |
2024-03-19 | FaceXFormer: A Unified Transformer for Facial Analysis | Kartik Narayan et.al. | 2403.12960v1 | link |
2024-03-19 | GVGEN: Text-to-3D Generation with Volumetric Representation | Xianglong He et.al. | 2403.12957v1 | null |
2024-03-19 | Abiogenesis: a possible quantum interpretation of the telepoietic conjecture | Vittorio Cocchi et.al. | 2403.12955v1 | null |
2024-03-19 | Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models | Elaine Sui et.al. | 2403.12952v1 | link |
2024-03-18 | RIS-aided Single-frequency 3D Imaging by Exploiting Multi-view Image Correlations | Yixuan Huang et.al. | 2403.11764v1 | null |
2024-03-19 | Full-Duplex MU-MIMO Systems with Coarse Quantization: How Many Bits Do We Need? | Seunghyeong Yoo et.al. | 2403.11762v2 | null |
2024-03-18 | Why E.T. Can't Phone Home: A Global View on IP-based Geoblocking at VoWiFi | Gabriel Karl Gegenhuber et.al. | 2403.11759v1 | null |
2024-03-18 | Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs | M. Jehanzeb Mirza et.al. | 2403.11755v1 | link |
2024-03-18 | Asymptotically Optimal Codes for |
Yubo Sun et.al. | 2403.11750v1 | null |
2024-03-18 | Embedded Named Entity Recognition using Probing Classifiers | Nicholas Popovič et.al. | 2403.11747v1 | null |
2024-03-18 | Revisiting Tensor Basis Neural Networks for Reynolds stress modeling: application to plane channel and square duct flows | Jiayi Cai et.al. | 2403.11746v1 | null |
2024-03-18 | Matter and cosmogenesis in Kant's Theory of the Heavens | Garance Benoit et.al. | 2403.11710v1 | null |
2024-03-18 | Significant impact of light-matter strong coupling on chiral nonlinear optical effect | Daichi Okada et.al. | 2403.11709v1 | null |
2024-03-18 | Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models | Emilian Postolache et.al. | 2403.11706v1 | link |
2024-03-18 | Virbo: Multimodal Multilingual Avatar Video Generation in Digital Marketing | Juan Zhang et.al. | 2403.11700v1 | null |
2024-03-18 | Urban Scene Diffusion through Semantic Occupancy Map | Junge Zhang et.al. | 2403.11697v1 | null |
2024-03-18 | Generalization error of spectral algorithms | Maksim Velikanov et.al. | 2403.11696v1 | null |
2024-03-18 | Beamforming Design for Semantic-Bit Coexisting Communication System | Maojun Zhang et.al. | 2403.11693v1 | null |
2024-03-15 | P-MapNet: Far-seeing Map Generator Enhanced by both SDMap and HDMap Priors | Zhou Jiang et.al. | 2403.10521v1 | null |
2024-03-15 | Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives | Ronghui Li et.al. | 2403.10518v1 | link |
2024-03-15 | FeatUp: A Model-Agnostic Framework for Features at Any Resolution | Stephanie Fu et.al. | 2403.10516v1 | link |
2024-03-15 | A Novel Framework for Multi-Person Temporal Gaze Following and Social Gaze Prediction | Anshul Gupta et.al. | 2403.10511v1 | null |
2024-03-15 | Demystifying Faulty Code with LLM: Step-by-Step Reasoning for Explainable Fault Localization | Ratnadira Widyasari et.al. | 2403.10507v1 | null |
2024-03-15 | Belief Change based on Knowledge Measures | Umberto Straccia et.al. | 2403.10502v1 | null |
2024-03-14 | SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior | Huan-ang Gao et.al. | 2403.09638v1 | null |
2024-03-14 | GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping | Yuhang Zheng et.al. | 2403.09637v1 | link |
2024-03-14 | Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference | Piotr Nawrot et.al. | 2403.09636v1 | null |
2024-03-14 | OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning | Lingyi Hong et.al. | 2403.09634v1 | null |
2024-03-14 | Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image | Yiqun Mei et.al. | 2403.09632v1 | null |
2024-03-14 | 3D-VLA: A 3D Vision-Language-Action Generative World Model | Haoyu Zhen et.al. | 2403.09631v1 | null |
2024-03-14 | Generalized Predictive Model for Autonomous Driving | Jiazhi Yang et.al. | 2403.09630v1 | link |
2024-03-14 | Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking | Eric Zelikman et.al. | 2403.09629v1 | link |
2024-03-14 | Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation | Fangfu Liu et.al. | 2403.09625v1 | null |
2024-03-14 | Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering | Zeyu Liu et.al. | 2403.09622v1 | null |
2024-03-13 | FastMAC: Stochastic Spectral Sampling of Correspondence Graph | Yifei Zhang et.al. | 2403.08770v1 | link |
2024-03-13 | VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis | Enric Corona et.al. | 2403.08764v1 | null |
2024-03-13 | A local model for the optical energy and momentum transfer in dielectric media and the microscopic origin of Abraham's force density | B. Anghinoni et.al. | 2403.08752v1 | null |
2024-03-13 | iCONTRA: Toward Thematic Collection Design Via Interactive Concept Transfer | Dinh-Khoi Vo et.al. | 2403.08746v1 | link |
2024-03-12 | Rethinking Generative Large Language Model Evaluation for Semantic Comprehension | Fangyun Wei et.al. | 2403.07872v1 | null |
2024-03-12 | TeleMoMa: A Modular and Versatile Teleoperation System for Mobile Manipulation | Shivin Dass et.al. | 2403.07869v1 | null |
2024-03-12 | Exploring Safety Generalization Challenges of Large Language Models via Code | Qibing Ren et.al. | 2403.07865v1 | null |
2024-03-12 | Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation | Shihao Zhao et.al. | 2403.07860v1 | link |
2024-03-12 | Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias | Sierra Wyllie et.al. | 2403.07857v1 | null |
2024-03-12 | Quantifying and Mitigating Privacy Risks for Tabular Generative Models | Chaoyi Zhu et.al. | 2403.07842v1 | null |
2024-03-11 | A representation-learning game for classes of prediction tasks | Neria Uzan et.al. | 2403.06971v1 | null |
2024-03-11 | The pitfalls of next-token prediction | Gregor Bachmann et.al. | 2403.06963v1 | link |
2024-03-11 | Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot Domain Transfer | Siddhant Satyanaik et.al. | 2403.06953v1 | null |
2024-03-11 | SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data | Jialu Li et.al. | 2403.06952v1 | null |
2024-03-08 | Tell, Don't Show!: Language Guidance Eases Transfer Across Domains in Images and Videos | Tarun Kalluri et.al. | 2403.05535v1 | null |
2024-03-08 | Tune without Validation: Searching for Learning Rate and Weight Decay on Training Sets | Lorenzo Brigato et.al. | 2403.05532v1 | null |
2024-03-08 | Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context | Machel Reid et.al. | 2403.05530v1 | null |
2024-03-08 | The Computational Complexity of Learning Gaussian Single-Index Models | Alex Damian et.al. | 2403.05529v1 | null |
2024-03-08 | GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM | Hao Kang et.al. | 2403.05527v1 | link |
2024-03-08 | Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola | Yijiang Li et.al. | 2403.05523v1 | null |
2024-03-08 | Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought | James Chua et.al. | 2403.05518v1 | link |
2024-03-07 | BloomGML: Graph Machine Learning through the Lens of Bilevel Optimization | Amber Yijia Zheng et.al. | 2403.04763v1 | link |
2024-03-07 | Lifelong Intelligence Beyond the Edge using Hyperdimensional Computing | Xiaofan Yu et.al. | 2403.04759v1 | link |
2024-03-07 | KnowledgeVIS: Interpreting Language Models by Comparing Fill-in-the-Blank Prompts | Adam Coscia et.al. | 2403.04758v1 | link |
2024-03-07 | Preliminary Guidelines For Combining Data Integration and Visual Data Analysis | Adam Coscia et.al. | 2403.04757v1 | link |
2024-03-07 | Mechanism for Decision-aware Collaborative Federated Learning: A Pitfall of Shapley Values | Meng Qi et.al. | 2403.04753v1 | null |
2024-03-07 | JAX-SPH: A Differentiable Smoothed Particle Hydrodynamics Framework | Artur P. Toshev et.al. | 2403.04750v1 | link |
2024-03-07 | A General Calibrated Regret Metric for Detecting and Mitigating Human-Robot Interaction Failures | Kensuke Nakamura et.al. | 2403.04745v1 | null |
2024-03-06 | Backtracing: Retrieving the Cause of the Query | Rose E. Wang et.al. | 2403.03956v1 | link |
2024-03-06 | 3D Diffusion Policy | Yanjie Ze et.al. | 2403.03954v1 | link |
2024-03-06 | Bridging Language and Items for Retrieval and Recommendation | Yupeng Hou et.al. | 2403.03952v1 | link |
2024-03-06 | Can Audio Reveal Music Performance Difficulty? Insights from the Piano Syllabus Dataset | Pedro Ramoneda et.al. | 2403.03947v1 | null |
2024-03-06 | Separate and Detailed Treatment of Absolute Signal and Noise Enables NMR Under Adverse Circumstances | A Guinness et.al. | 2403.03943v1 | null |
2024-03-06 | The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models | Adithya Bhaskar et.al. | 2403.03942v1 | link |
2024-03-06 | GUIDE: Guidance-based Incremental Learning with Diffusion Models | Bartosz Cywiński et.al. | 2403.03938v1 | link |
2024-03-05 | LC-Tsalis-INF: Generalized Best-of-Both-Worlds Linear Contextual Bandits | Masahiro Kato et.al. | 2403.03219v1 | null |
2024-03-05 | The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning | Nathaniel Li et.al. | 2403.03218v1 | null |
2024-03-05 | Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion | Meng Zheng et.al. | 2403.03217v1 | null |
2024-03-05 | A Safety-Critical Framework for UGVs in Complex Environments: A Data-Driven Discrepancy-Aware Approach | Skylar X. Wei et.al. | 2403.03215v1 | null |
2024-03-05 | Scaling Rectified Flow Transformers for High-Resolution Image Synthesis | Patrick Esser et.al. | 2403.03206v1 | null |
2024-03-05 | CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments | Savitha Sam Abraham et.al. | 2403.03203v1 | null |
2024-03-03 | Bandit Profit-maximization for Targeted Marketing | Joon Suk Huh et.al. | 2403.01361v1 | null |
2024-03-03 | ModelWriter: Text & Model-Synchronized Document Engineering Platform | Ferhat Erata et.al. | 2403.01359v1 | null |
2024-03-03 | Improving Uncertainty Sampling with Bell Curve Weight Function | Zan-Kai Chong et.al. | 2403.01352v1 | null |
2024-03-03 | Efficient FIR filtering with Bit Layer Multiply Accumulator | Vincenzo Liguori et.al. | 2403.01351v1 | null |
2024-03-02 | ShapeBoost: Boosting Human Shape Estimation with Part-Based Parameterization and Clothing-Preserving Augmentation | Siyuan Bian et.al. | 2403.01345v1 | null |
2024-02-29 | DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models | Muyang Li et.al. | 2402.19481v1 | link |
2024-02-29 | Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers | Tsai-Shien Chen et.al. | 2402.19479v1 | null |
2024-02-29 | Learning a Generalized Physical Face Model From Data | Lingchen Yang et.al. | 2402.19477v1 | null |
2024-02-29 | The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations? | Alex Gu et.al. | 2402.19475v1 | null |
2024-02-29 | The All-Seeing Project V2: Towards General Relation Comprehension of the Open World | Weiyun Wang et.al. | 2402.19474v1 | link |
2024-02-29 | Retrieval-Augmented Generation for AI-Generated Content: A Survey | Penghao Zhao et.al. | 2402.19473v1 | link |
2024-02-29 | Loose LIPS Sink Ships: Asking Questions in Battleship with Language-Informed Program Sampling | Gabriel Grand et.al. | 2402.19471v1 | null |
2024-02-29 | Humanoid Locomotion as Next Token Prediction | Ilija Radosavovic et.al. | 2402.19469v1 | null |
2024-02-28 | UniMODE: Unified Monocular 3D Object Detection | Zhuoling Li et.al. | 2402.18573v1 | null |
2024-02-28 | Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards | Haoxiang Wang et.al. | 2402.18571v1 | link |
2024-02-28 | Diffusion Language Models Are Versatile Protein Learners | Xinyou Wang et.al. | 2402.18567v1 | null |
2024-02-28 | Approaching Human-Level Forecasting with Language Models | Danny Halawi et.al. | 2402.18563v1 | null |
2024-02-27 | The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits | Shuming Ma et.al. | 2402.17764v1 | null |
2024-02-27 | Reducing Unnecessary Alerts in Pedestrian Protection Systems Based on P2V Communications | Ignacio Soto et.al. | 2402.17763v1 | null |
2024-02-27 | Towards Optimal Learning of Language Models | Yuxian Gu et.al. | 2402.17759v1 | null |
2024-02-27 | ADL4D: Towards A Contextually Rich Dataset for 4D Activities of Daily Living | Marsil Zakour et.al. | 2402.17758v1 | null |
2024-02-27 | Evaluating Very Long-Term Conversational Memory of LLM Agents | Adyasha Maharana et.al. | 2402.17753v1 | null |
2024-02-26 | Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision | Fan Jiang et.al. | 2402.16508v1 | link |
2024-02-26 | Stochastic Conditional Diffusion Models for Semantic Image Synthesis | Juyeon Ko et.al. | 2402.16506v1 | null |
2024-02-26 | SAND: Decoupling Sanitization from Fuzzing for Low Overhead | Ziqiao Kong et.al. | 2402.16497v1 | null |
2024-02-26 | Intelligent Known and Novel Aircraft Recognition -- A Shift from Classification to Similarity Learning for Combat Identification | Ahmad Saeed et.al. | 2402.16486v1 | null |
2024-02-23 | Seamless Human Motion Composition with Blended Positional Encodings | German Barquero et.al. | 2402.15509v1 | link |
2024-02-23 | AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning | Jianguo Zhang et.al. | 2402.15506v1 | link |
2024-02-23 | Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts | Yuejiang Liu et.al. | 2402.15505v1 | null |
2024-02-23 | Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition | Chun-Hsiao Yeh et.al. | 2402.15504v1 | link |
2024-02-23 | API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs | Kinjal Basu et.al. | 2402.15491v1 | null |
2024-02-22 | PALO: A Polyglot Large Multimodal Model for 5B People | Muhammad Maaz et.al. | 2402.14818v1 | link |
2024-02-22 | Cameras as Rays: Pose Estimation via Ray Diffusion | Jason Y. Zhang et.al. | 2402.14817v1 | null |
2024-02-22 | WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition | Lianghui Zhu et.al. | 2402.14812v1 | link |
2024-02-22 | Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking | Nikhil Prakash et.al. | 2402.14811v1 | null |
2024-02-22 | GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion | Xueyi Liu et.al. | 2402.14810v1 | link |
2024-02-22 | CriticBench: Benchmarking LLMs for Critique-Correct Reasoning | Zicheng Lin et.al. | 2402.14809v1 | link |
2024-02-22 | RelayAttention for Efficient Large Language Model Serving with Long System Prompts | Lei Zhu et.al. | 2402.14808v1 | link |
2024-02-22 | A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health | Nikhil Behari et.al. | 2402.14807v1 | null |
2024-02-22 | Identifying Multiple Personalities in Large Language Models with External Evaluation | Xiaoyang Song et.al. | 2402.14805v1 | null |
2024-02-21 | D-Flow: Differentiating through Flows for Controlled Generation | Heli Ben-Hamu et.al. | 2402.14017v1 | null |
2024-02-21 | Corrective Machine Unlearning | Shashwat Goel et.al. | 2402.14015v1 | link |
2024-02-21 | Geometry-Informed Neural Networks | Arturs Berzins et.al. | 2402.14009v1 | null |
2024-02-21 | OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems | Chaoqun He et.al. | 2402.14008v1 | link |
2024-02-21 | Hallucinations or Attention Misdirection? The Path to Strategic Value Extraction in Business Using Large Language Models | Aline Ioste et.al. | 2402.14002v1 | null |
2024-02-21 | Real-time 3D-aware Portrait Editing from a Single Image | Qingyan Bai et.al. | 2402.14000v1 | null |
2024-02-20 | CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples | Jianrui Zhang et.al. | 2402.13254v1 | link |
2024-02-20 | BiMediX: Bilingual Medical Mixture of Experts LLM | Sara Pieri et.al. | 2402.13253v1 | link |
2024-02-20 | Video ReCap: Recursive Captioning of Hour-Long Videos | Md Mohaiminul Islam et.al. | 2402.13250v1 | null |
2024-02-20 | TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization | Liyan Tang et.al. | 2402.13249v1 | link |
2024-02-20 | Are Fact-Checking Tools Reliable? An Evaluation of Google Fact Check | Qiangeng Yang et.al. | 2402.13244v1 | null |
2024-02-20 | Unlocking Insights: Semantic Search in Jupyter Notebooks | Lan Li et.al. | 2402.13234v1 | null |
2024-02-20 | A Touch, Vision, and Language Dataset for Multimodal Alignment | Letian Fu et.al. | 2402.13232v1 | link |
2024-02-19 | FiT: Flexible Vision Transformer for Diffusion Model | Zeyu Lu et.al. | 2402.12376v1 | link |
2024-02-19 | A synthetic data approach for domain generalization of NLI models | Mohammad Javad Hosseini et.al. | 2402.12368v1 | null |
2024-02-19 | A Critical Evaluation of AI Feedback for Aligning Large Language Models | Archit Sharma et.al. | 2402.12366v1 | link |
2024-02-19 | Almost-linear time parameterized algorithm for rankwidth via dynamic rankwidth | Tuukka Korhonen et.al. | 2402.12364v1 | null |
2024-02-19 | Flip Graphs of Pseudo-Triangulations With Face Degree at Most 4 | Maarten Löffler et.al. | 2402.12357v1 | null |
2024-02-19 | Graph-Based Retriever Captures the Long Tail of Biomedical Knowledge | Julien Delile et.al. | 2402.12352v1 | null |
2024-02-16 | Fusion of Diffusion Weighted MRI and Clinical Data for Predicting Functional Outcome after Acute Ischemic Stroke with Deep Contrastive Learning | Chia-Ling Tsai et.al. | 2402.10894v1 | null |
2024-02-16 | RLVF: Learning from Verbal Feedback without Overgeneralization | Moritz Stephan et.al. | 2402.10893v1 | link |
2024-02-16 | Instruction Diversity Drives Generalization To Unseen Tasks | Dylan Zhang et.al. | 2402.10891v1 | null |
2024-02-16 | When is Tree Search Useful for LLM Planning? It Depends on the Discriminator | Ziru Chen et.al. | 2402.10890v1 | link |
2024-02-16 | Evaluation of EAP Usage for Authenticating Eduroam Users in 5G Networks | Leonardo Azalim de Oliveira et.al. | 2402.10889v1 | null |
2024-02-16 | Explainability for Machine Learning Models: From Data Adaptability to User Perception | julien Delaunay et.al. | 2402.10888v1 | null |
2024-02-16 | Reviewer2: Optimizing Review Generation Through Prompt Generation | Zhaolin Gao et.al. | 2402.10886v1 | null |
2024-02-16 | 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations | Tsung-Wei Ke et.al. | 2402.10885v1 | null |
2024-02-15 | Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation | Huizhuo Yuan et.al. | 2402.10210v1 | null |
2024-02-15 | Recovering the Pre-Fine-Tuning Weights of Generative Models | Eliahu Horwitz et.al. | 2402.10208v1 | link |
2024-02-15 | Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment | Rui Yang et.al. | 2402.10207v1 | link |
2024-02-15 | Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention | Romain Ilbert et.al. | 2402.10198v1 | link |
2024-02-15 | BitDelta: Your Fine-Tune May Only Be Worth One Bit | James Liu et.al. | 2402.10193v1 | link |
2024-02-15 | Multi-Excitation Projective Simulation with a Many-Body Physics Inspired Inductive Bias | Philip A. LeMaitre et.al. | 2402.10192v1 | link |
2024-02-15 | FedAnchor: Enhancing Federated Semi-Supervised Learning with Label Contrastive Loss for Unlabeled Clients | Xinchi Qiu et.al. | 2402.10191v1 | null |
2024-02-14 | AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability | Siwei Yang et.al. | 2402.09404v1 | link |
2024-02-14 | Reinforcement Learning from Human Feedback with Active Queries | Kaixuan Ji et.al. | 2402.09401v1 | null |
2024-02-14 | Long-form evaluation of model editing | Domenic Rosati et.al. | 2402.09394v1 | null |
2024-02-14 | Introduction to Physically Unclonable Fuctions: Properties and Applications | M. Garcia-Bosque et.al. | 2402.09386v1 | null |
2024-02-14 | GraSSRep: Graph-Based Self-Supervised Learning for Repeat Detection in Metagenomic Assembly | Ali Azizpour et.al. | 2402.09381v1 | link |
2024-02-13 | IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation | Luke Melas-Kyriazi et.al. | 2402.08682v1 | null |
2024-02-13 | Mitigating Object Hallucination in Large Vision-Language Models via Classifier-Free Guidance | Linxi Zhao et.al. | 2402.08680v1 | null |
2024-02-13 | COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability | Xingang Guo et.al. | 2402.08679v1 | link |
2024-02-13 | Graph Mamba: Towards Learning on Graphs with State Space Models | Ali Behrouz et.al. | 2402.08678v1 | link |
2024-02-13 | Model Assessment and Selection under Temporal Distribution Shift | Elise Han et.al. | 2402.08672v1 | link |
2024-02-13 | Rec-GPT4V: Multimodal Recommendation with Large Vision-Language Models | Yuqing Liu et.al. | 2402.08670v1 | null |
2024-02-13 | Improving Generalization in Semantic Parsing by Increasing Natural Language Variation | Irina Saparina et.al. | 2402.08666v1 | link |
2024-02-12 | A systematic investigation of learnability from single child linguistic input | Yulu Qin et.al. | 2402.07899v1 | null |
2024-02-12 | Label-Efficient Model Selection for Text Generation | Shir Ashury-Tahan et.al. | 2402.07891v1 | null |
2024-02-12 | Toward an Android Static Analysis Approach for Data Protection | Mugdha Khedkar et.al. | 2402.07889v1 | null |
2024-02-12 | WildfireGPT: Tailored Large Language Model for Wildfire Analysis | Yangxinyu Xie et.al. | 2402.07877v1 | null |
2024-02-12 | Policy Improvement using Language Feedback Models | Victor Zhong et.al. | 2402.07876v1 | null |
2024-02-09 | Feedback Loops With Language Models Drive In-Context Reward Hacking | Alexander Pan et.al. | 2402.06627v1 | link |
2024-02-09 | Understanding the Effects of Iterative Prompting on Truthfulness | Satyapriya Krishna et.al. | 2402.06625v1 | null |
2024-02-09 | A two-stage algorithm in evolutionary product unit neural networks for classification | Antonio J. Tallón-Ballesteros et.al. | 2402.06622v1 | null |
2024-02-09 | TIC: Translate-Infer-Compile for accurate 'text to plan' using LLMs and logical intermediate representations | Sudhir Agarwal et.al. | 2402.06608v1 | null |
2024-02-09 | On the Out-Of-Distribution Generalization of Multimodal Large Language Models | Xingxuan Zhang et.al. | 2402.06599v1 | null |
2024-02-09 | CigaR: Cost-efficient Program Repair with LLMs | Dávid Hidvégi et.al. | 2402.06598v1 | link |
2024-02-09 | Understanding the Weakness of Large Language Model Agents within a Complex Android Environment | Mingzhe Xing et.al. | 2402.06596v1 | link |
2024-02-08 | InstaGen: Enhancing Object Detection by Training on Synthetic Dataset | Chengjian Feng et.al. | 2402.05937v1 | null |
2024-02-08 | SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models | Peng Gao et.al. | 2402.05935v1 | link |
2024-02-08 | Time Series Diffusion in the Frequency Domain | Jonathan Crabbé et.al. | 2402.05933v1 | link |
2024-02-08 | WebLINX: Real-World Website Navigation with Multi-Turn Dialogue | Xing Han Lù et.al. | 2402.05930v1 | link |
2024-02-08 | An Interactive Agent Foundation Model | Zane Durante et.al. | 2402.05929v1 | null |
2024-02-08 | Sharp Rates in Dependent Learning Theory: Avoiding Sample Size Deflation for the Square Loss | Ingvar Ziemann et.al. | 2402.05928v1 | null |
2024-02-07 | Image captioning for Brazilian Portuguese using GRIT model | Rafael Silva de Alencar et.al. | 2402.05106v1 | null |
2024-02-07 | You Can REST Now: Automated Specification Inference and Black-Box Testing of RESTful APIs with Large Language Models | Alix Decrop et.al. | 2402.05102v1 | null |
2024-02-07 | Hydragen: High-Throughput LLM Inference with Shared Prefixes | Jordan Juravsky et.al. | 2402.05099v1 | null |
2024-02-07 | On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling | Marcin Sendera et.al. | 2402.05098v1 | link |
2024-02-07 | Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation | Dennis Hoftijzer et.al. | 2402.05090v1 | null |
2024-02-07 | Hyperspectral acquisition with ScanImage at the single pixel level: Application to time domain coherent Raman imaging | Samuel Metais et.al. | 2402.05086v1 | null |
2024-02-06 | Linear-time Minimum Bayes Risk Decoding with Reference Aggregation | Jannis Vamvas et.al. | 2402.04251v1 | link |
2024-02-06 | CAST: Clustering Self-Attention using Surrogate Tokens for Efficient Transformers | Adjorn van Engelenhoven et.al. | 2402.04239v1 | null |
2024-02-06 | CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations | Ji Qi et.al. | 2402.04236v1 | link |
2024-02-06 | **Role of spontaneously generated |