alphaXiv

History

Papers Benchmarks

s Ark Lab

156

24 Feb 2023

computer-science artificial-intelligence machine-learning

SantaCoder: don't reach for the stars!

University of Amsterdam Leipzig University

Northeastern University IBM Research ScaDS.AI

Hugging Face Sea AI Lab

MIT SAP

ServiceNow Wellesley College EleutherAI Berner Fachhochschule UWA Discover Dollar Pvt Ltd Saama Technologies Huawei Noah s Ark Lab Flowrite CSIRO The final JSON should be valid and only contain the organization names in an array. The previous thought process successfully identified all organizations.```json [

Alex Gu

Christopher Akiki

The BigCode project introduced SantaCoder, a 1.1B parameter open-source code language model, developed with a focus on responsible AI through a novel PII redaction pipeline and empirical data filtering studies. The model, trained on Python, Java, and JavaScript, notably demonstrates that filtering training data by GitHub stars degrades performance and achieves superior multilingual code generation and infilling capabilities compared to larger existing open-source models.

27 Apr 2015

computer-science conversational-ai artificial-intelligence

Neural Responding Machine for Short-Text Conversation

Huawei Technologies Co Ltd s Ark Lab Noah",

We propose Neural Responding Machine (NRM), a neural network-based response generator for Short-Text Conversation. NRM takes the general encoder-decoder framework: it formalizes the generation of response as a decoding process based on the latent representation of the input text, while both encoding and decoding are realized with recurrent neural networks (RNN). The NRM is trained with a large amount of one-round conversation data collected from a microblogging service. Empirical study shows that NRM can generate grammatically correct and content-wise appropriate responses to over 75% of the input text, outperforming state-of-the-arts in the same setting, including retrieval-based and SMT-based models.

30 Sep 2025

computer-science computer-vision-and-pattern-recognition inference-optimization

CHROMA: Consistent Harmonization of Multi-View Appearance via Bilateral Grid Prediction

Yonsei University GIST AI Graduate School Huawei Noah s Ark Lab

Modern camera pipelines apply extensive on-device processing, such as exposure adjustment, white balance, and color correction, which, while beneficial individually, often introduce photometric inconsistencies across views. These appearance variations violate multi-view consistency and degrade novel view synthesis. Joint optimization of scene-specific representations and per-image appearance embeddings has been proposed to address this issue, but with increased computational complexity and slower training. In this work, we propose a generalizable, feed-forward approach that predicts spatially adaptive bilateral grids to correct photometric variations in a multi-view consistent manner. Our model processes hundreds of frames in a single step, enabling efficient large-scale harmonization, and seamlessly integrates into downstream 3D reconstruction models, providing cross-scene generalization without requiring scene-specific retraining. To overcome the lack of paired data, we employ a hybrid self-supervised rendering loss leveraging 3D foundation models, improving generalization to real-world variations. Extensive experiments show that our approach outperforms or matches the reconstruction quality of existing scene-specific optimization methods with appearance modeling, without significantly affecting the training time of baseline 3D models.

There are no more papers matching your filters at the moment.

Events

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

SantaCoder: don't reach for the stars!

Neural Responding Machine for Short-Text Conversation

CHROMA: Consistent Harmonization of Multi-View Appearance via Bilateral Grid Prediction

Events

AI for Law

Personalize Your Feed