code generation - Salesforce AI

CodeGen2.5: Small, but mighty

Equal contribution between Erik Nijkamp and Hiroaki Hayashi. Paper Code Tweet Abstract The family of Salesforce CodeGen models is growing with CodeGen2.5 – a small, but mighty model! While there has been a recent trend of large language models (LLM) of increasing size, we show that a small model can

06 Jul 2023 • Erik Nijkamp • #CodeGen

Long Sequence Modeling with XGen: A 7B LLM Trained on 8K Input Sequence Length

TLDR We trained a series of 7B LLMs named XGen-7B with standard dense attention on up to 8K sequence length for up to 1.5T tokens. We also fine tune the models on public-domain instructional data. The main take-aways are: * On standard NLP benchmarks, XGen achieves comparable or better results

28 Jun 2023 • Erik Nijkamp • #llm

CodeT5+: Open Code Large Language Models

TL;DR: CodeT5+ is a new family of open code large language models (LLMs) with improved model architectures and training techniques. CodeT5+ achieves the state-of-the-art performance among the open-source LLMs on many challenging code intelligence tasks, including zero-shot evaluation on the code generation benchmark HumanEval. Background: Code LLMs Large language

20 May 2023 • Yue Wang • #codet5+

AI Coding with CodeRL: Toward Mastering Program Synthesis with Deep Reinforcement Learning

TL;DR: CodeRL is a new framework for program synthesis through holistic integration of pretrained language models and deep reinforcement learning. By utilizing unit test feedback as part of model training and inference, and integrating with an improved CodeT5 model, CodeRL achieves state-of-the-art results on competition-level programming tasks. The following

19 Jul 2022 • Henry Hung Le • #reinforcement-learning

Conversational AI Programming with CodeGen: Let AI Write Code For You

Links: Research Paper [https://arxiv.org/abs/2203.13474], Github [https://github.com/salesforce/CodeGen] -------------------------------------------------------------------------------- Can you imagine a machine writing an app for you, just by telling it what you want? As futuristic as this scenario sounds, it’s actually here today. Salesforce AI Research outlines conversational AI

29 Mar 2022 • Erik Nijkamp • #conversational AI

Blog

CodeGen2.5: Small, but mighty

Long Sequence Modeling with XGen: A 7B LLM Trained on 8K Input Sequence Length

CodeT5+: Open Code Large Language Models

AI Coding with CodeRL: Toward Mastering Program Synthesis with Deep Reinforcement Learning

Conversational AI Programming with CodeGen: Let AI Write Code For You