bellvei.cat

RedPajama replicates LLaMA dataset to build open source, state-of-the-art LLMs

4.6 (509) · $ 27.00 · In stock

RedPajama, which creates fully open-source large language models, has released a 1.2 trillion token dataset following the LLaMA recipe.

LLaMA clone: RedPajama – first open-source decentralized AI with open dataset

Dolma, OLMo, and the Future of Open-Source LLMs

The data that trains AI is under the spotlight — and even I'm weirded out

Report: The Openness of AI A Contrary Research Deep Dive

2311.17035] Scalable Extraction of Training Data from (Production) Language Models

RedPajama-INCITE-3B, an LLM for everyone

The data that trains AI is under the spotlight — and even I'm weirded out

Top 10 List of Large Language Models in Open-Source

The Latest Open Source LLMs and Datasets

Intrinsic Labs

RedPajama Project: An Open-Source Initiative to Democratizing LLMs - KDnuggets