Build, learn and collaborate on open-source AI

Blueprints are workflows and resources to help you build AI applications

Button Text
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Text Link
Podcast personalities

Try these prompts to create a fun podcast in the style of…

1
Text Link
Practical use cases to create podcasts using this Blueprint
2
Text Link
Don’t Miss These Events!

Tech demos, talks and events on AI

4
Text Link
This Week in Open Source

Get the latest information on the open source community.

6
No items found.
Text Link
Other Open Source Alternatives

Here are other existing solutions to create your own podcasts:

3
Blueprint of the week

Turn your documents into engaging podcasts quick and easy. While NotebookLm is taking the world by storm, you can explore and customize our blueprint to transform documents into dynamic audio podcasts using open tools and models.

0
Join our community!

Join our vibrant community of creators and builders!

5
Text Link
Dataset
Description
Modality
Task Purpose
Dataset
Dataset
Description
Modality
Task Purpose
Text Link
Dataset
Description
Modality
Dataset
Dataset
Description
Modality
Text Link
Dataset
Description
Dataset
Dataset
Description
Text Link
Project
Primary Function
Core Components
Output features
Open Source License
Project
Project
Primary Function
Core Components
Output features
Open Source License
Text Link
Data Provenance Explorer
Data Provenance Explorer
A dataset and toolkit that allow developers to filter for the finetuning datasets that best fit their requirements.
Multimodal
Multitask
Text Link
Common Voice
Common Voice
A multilingual, crowdsourced collection of voice recordings.
Audio
Speech technologies
Text Link
FineWeb
FineWeb
A comprehensive dataset of high-quality English web data.
Text
LLM training
Text Link
The Pile
The Pile
An English text dataset designed for training large scale language models.
Text
LLM training
Text Link
Data Provenance Explorer
Data Provenance Explorer
A dataset and toolkit that allow developers to filter for the finetuning datasets that best fit their requirements.
Multimodal
Text Link
Common Voice
Common Voice
A multilingual, crowdsourced collection of voice recordings.
Audio
Text Link
FineWeb
FineWeb
A comprehensive dataset of high-quality English web data.
Text Link
The Pile
The Pile
An English text dataset designed for training large scale language models.
Text Link
The Pile
The Pile
An English text dataset designed for training large scale language models.
Text
Text Link
NotebookLlama
NotebookLlama
Converts PDFs to podcasts using a structured pipeline
Llama models (1B, 8B, 70B), parler-tts, bark TTS tools
Dramatic podcast-style transcript and audio
MIT License
Text Link
Podcastfy
Podcastfy
Converts multi-modal content to multi-lingual podcasts
OpenAI, Anthropic, Google LLMs; ElevenLabs, Google TTS
Short and long-form podcasts in multiple languages
Apache 2.0 License
Text Link
Open NotebookLM
Open NotebookLM
Converts PDFs to podcast episodes with natural dialogues
Llama 3.1 405B, MeloTTS, Bark, Jina Reader
Natural-sounding dialogues in podcast format
Apache 2.0 License
Text Link
Stay Updated with Events
Stay Updated with Events

Stay informed about upcoming events and webinars hosted by the Developer Hub. These events provide valuable opportunities to learn from experts, network with peers, and stay updated on the latest trends in AI development.

Text Link
Showcase Your AI Projects
Showcase Your AI Projects

At the Developer Hub, we encourage you to showcase your AI projects. Share your innovations, receive feedback, and inspire others in the community. This is a great opportunity to gain visibility and recognition for your hard work.

Text Link
Tools for Effective Collaboration
Tools for Effective Collaboration

Utilize our collaboration tools to work effectively with your team. From version control systems to project management software, we provide the resources you need to streamline your workflow and achieve your project goals.

Text Link
Contribute to Open Source
Contribute to Open Source

Join us in contributing to open-source AI projects. Your contributions can make a significant impact on the community and help advance technology. Learn how to get started and find projects that align with your interests.

Text Link
Expand Your Knowledge
Expand Your Knowledge

Explore a variety of learning resources available at the Developer Hub. Whether you are a beginner or an experienced developer, you will find tutorials, articles, and videos that cater to your learning needs and help you grow your skills.

Text Link
Join the Discussion
Join the Discussion

Participate in our community forums to discuss ideas, ask questions, and share knowledge. The Developer Hub is a place where you can connect with other developers and gain insights from their experiences.

Text Link
Inspiring Success Stories
Inspiring Success Stories

Discover inspiring success stories from developers who have made significant contributions to the AI community. Learn about their journeys, challenges, and how they overcame obstacles to achieve their goals.

Text Link
Access Our Resource Library
Access Our Resource Library

Access a vast library of resources, including eBooks, research papers, and case studies related to AI development. Our resource library is designed to support your learning and provide valuable insights into the field.

Text Link
Find a Mentor
Find a Mentor

Our mentorship program connects experienced developers with those looking to learn and grow. Whether you need guidance on a project or career advice, our mentors are here to help you succeed in your journey.

Text Link
AI Development Best Practices
AI Development Best Practices

Familiarize yourself with best practices in AI development. Our resources cover essential topics such as coding standards, testing methodologies, and deployment strategies to help you create high-quality applications.

Text Link
Expand Your Network
Expand Your Network

Participate in networking events to connect with other developers, industry leaders, and potential collaborators. Building a strong network can open doors to new opportunities and partnerships in the AI space.

Text Link
Comprehensive API Docs
Comprehensive API Docs

Access comprehensive API documentation to help you integrate and utilize various AI services. Our documentation is designed to be user-friendly, providing clear examples and detailed explanations to assist you in your development process.

Text Link
Comprehensive Tutorials
Comprehensive Tutorials

Explore a wide range of tutorials and guides tailored for developers at all levels. Our resources cover various topics, from beginner to advanced, ensuring you have the knowledge needed to succeed in AI development.

Text Link
Discuss AI Ethics
Discuss AI Ethics

Join discussions on AI ethics and its implications in society. This is an important topic that requires thoughtful consideration, and your insights can contribute to a better understanding of ethical AI development.

Text Link
Participate in Challenges
Participate in Challenges

Engage in community challenges to test your skills and creativity. These challenges are designed to encourage collaboration and innovation, providing a fun way to learn and grow as a developer.

Text Link
We Value Your Feedback
We Value Your Feedback

Your feedback is important to us. Share your suggestions and ideas to help improve the Developer Hub. We are committed to creating a supportive environment that meets the needs of our community.

Text Link
Showcase Your Projects
Showcase Your Projects

Utilize our project showcase platform to display your work. This is an excellent way to gain recognition and feedback from the community, as well as to inspire others with your innovative ideas and solutions.

Text Link
Get Your Code Reviewed
Get Your Code Reviewed

Take advantage of our code review services to improve your coding skills. Our experienced developers will provide constructive feedback on your code, helping you to enhance your programming abilities and project outcomes.

Text Link
Welcome to the Developer Hub
Welcome to the Developer Hub

The Developer Hub is a vibrant community designed for developers to build, learn, and collaborate on open-source AI projects. Here, you can find resources, tutorials, and forums to enhance your skills and connect with like-minded individuals.

Text Link
SF Demo Night - Dec 5th, 2 PM
SF Demo Night - Dec 5th, 2 PM

Join us as we gather together the entire Bay Area tech community and its most brilliant creators for a night of mind-blowing demos that push the boundaries of what's possible!

Text Link
Biased Data - Dec 10th, 4 PM
Biased Data - Dec 10th, 4 PM

Linda Dounia Rebeiz is a Senegalese artist who was disappointed when searches for her home city of Dakar on text-to-image generative AI tools showed low-rise, decaying buildings that were not at all representative of the vibrant architecture surrounding her. This is part of the reason why Linda uses her own datasets to carefully train AI that will more accurately reflect her reality.

Text Link
Web Applets - Dec 10th, 7:30 PM
Web Applets - Dec 10th, 7:30 PM

Learn about Web Applets open standard & SDK, and how to use it to create rich, graphical client-side apps that can be used by both agents and humans.

Text Link
TheiaID - Dec 11th, 4PM
TheiaID - Dec 11th, 4PM

Get ready for a hands-on demo of the AI-powered Theia IDE—a truly open, transparent, and flexible AI-driven development environment.

Text Link
Welcome to the Developer Hub
Welcome to the Developer Hub

The Developer Hub is a vibrant community designed for developers to build, learn, and collaborate on open-source AI projects. Here, you can find resources, tutorials, and forums to enhance your skills and connect with like-minded individuals.

Text Link
Tools for Effective Collaboration
Tools for Effective Collaboration

Utilize our collaboration tools to work effectively with your team. From version control systems to project management software, we provide the resources you need to streamline your workflow and achieve your project goals.

Text Link
Expand Your Knowledge
Expand Your Knowledge

Explore a variety of learning resources available at the Developer Hub. Whether you are a beginner or an experienced developer, you will find tutorials, articles, and videos that cater to your learning needs and help you grow your skills.

Text Link
Contribute to Open Source
Contribute to Open Source

Join us in contributing to open-source AI projects. Your contributions can make a significant impact on the community and help advance technology. Learn how to get started and find projects that align with your interests.

Text Link
Join the Discussion
Join the Discussion

Participate in our community forums to discuss ideas, ask questions, and share knowledge. The Developer Hub is a place where you can connect with other developers and gain insights from their experiences.

Text Link
Stay Updated with Events
Stay Updated with Events

Stay informed about upcoming events and webinars hosted by the Developer Hub. These events provide valuable opportunities to learn from experts, network with peers, and stay updated on the latest trends in AI development.

Text Link
Find a Mentor
Find a Mentor

Our mentorship program connects experienced developers with those looking to learn and grow. Whether you need guidance on a project or career advice, our mentors are here to help you succeed in your journey.

Text Link
Comprehensive API Docs
Comprehensive API Docs

Access comprehensive API documentation to help you integrate and utilize various AI services. Our documentation is designed to be user-friendly, providing clear examples and detailed explanations to assist you in your development process.

Text Link
Inspiring Success Stories
Inspiring Success Stories

Discover inspiring success stories from developers who have made significant contributions to the AI community. Learn about their journeys, challenges, and how they overcame obstacles to achieve their goals.

Text Link
AI Development Best Practices
AI Development Best Practices

Familiarize yourself with best practices in AI development. Our resources cover essential topics such as coding standards, testing methodologies, and deployment strategies to help you create high-quality applications.

Text Link
We Value Your Feedback
We Value Your Feedback

Your feedback is important to us. Share your suggestions and ideas to help improve the Developer Hub. We are committed to creating a supportive environment that meets the needs of our community.

Text Link
Access Our Resource Library
Access Our Resource Library

Access a vast library of resources, including eBooks, research papers, and case studies related to AI development. Our resource library is designed to support your learning and provide valuable insights into the field.

Text Link
Expand Your Network
Expand Your Network

Participate in networking events to connect with other developers, industry leaders, and potential collaborators. Building a strong network can open doors to new opportunities and partnerships in the AI space.

Text Link
Get Your Code Reviewed
Get Your Code Reviewed

Take advantage of our code review services to improve your coding skills. Our experienced developers will provide constructive feedback on your code, helping you to enhance your programming abilities and project outcomes.

Text Link
Showcase Your Projects
Showcase Your Projects

Utilize our project showcase platform to display your work. This is an excellent way to gain recognition and feedback from the community, as well as to inspire others with your innovative ideas and solutions.

Text Link
Comprehensive Tutorials
Comprehensive Tutorials

Explore a wide range of tutorials and guides tailored for developers at all levels. Our resources cover various topics, from beginner to advanced, ensuring you have the knowledge needed to succeed in AI development.

Text Link
Participate in Challenges
Participate in Challenges

Engage in community challenges to test your skills and creativity. These challenges are designed to encourage collaboration and innovation, providing a fun way to learn and grow as a developer.

Text Link
Discuss AI Ethics
Discuss AI Ethics

Join discussions on AI ethics and its implications in society. This is an important topic that requires thoughtful consideration, and your insights can contribute to a better understanding of ethical AI development.

Text Link
Turn your newsletters into an audio briefing
Turn your newsletters into an audio briefing

Instead of scanning dozen of emails, create audio briefings for your commute. Try feeding in our AI newsletter!

Text Link
Turn your study notes into a smarter learning program
Turn your study notes into a smarter learning program

Turn your study materials into structured learning with spaced repetition.

Text Link
Turn your museum guide into a free audio tour
Turn your museum guide into a free audio tour

Upload your museum content to craft your own guide. Ask it for quick facts, or summaries about any artifact.

Text Link
Emma Chamberlain
Emma Chamberlain

This episode is for listeners who enjoy a bit of attitude with their insights. Hosts are encouraged to use plenty of sarcasm and cultural references.

Suggested phrases:

“Here’s the tea”, “You know what? Let me just say it… no, wait, I have to say it”, “Hold up, did you just say that?”
Text Link
Ezra Klein
Ezra Klein

This episode invites listeners into a thoughtful exploration. No need to oversimplify or rush the discussion—let the ideas breathe and evolve organically.

Suggested phrases:

"This is where it gets interesting…", "The question that keeps coming up for me is…"
Text Link
Local RAG - Dec 17th, 7 PM
Local RAG - Dec 17th, 7 PM

Learn how to create an ultra-low dependency RAG application using only sqlite-vec, llamafile, and bare-bones Python — no other dependencies or "pip install"s required! Watch the event recording here!

Text Link
Pleias - Training LLMs required copyrighted data until it did not
Pleias - Training LLMs required copyrighted data until it did not

Pleias released Pleias 1.0 models, a family of fully open small language models that features two specialized models for knowledge retrieval with unprecedented performance for their size on multilingual RAG. These represent the first ever models trained exclusively on open data (i.e., non-copyrighted or published under a permissible license), being the first fully EU AI Act compliant models, and setting a new standard for safety and openness.

Text Link
Intro to Blueprints Hub - Jan 22nd, 1:30 PM EST (Discord)
Intro to Blueprints Hub - Jan 22nd, 1:30 PM EST (Discord)

Learn about Blueprints - open source building blocks for AI applications - and how to build with the blueprint 'Document to Podcast', which transforms text into podcast-style conversations using entirely open source tools and models.

Text Link
NovaSky open sources Sky-T1, the first truly open source reasoning model
NovaSky open sources Sky-T1, the first truly open source reasoning model

UC Berkeley's NovaSky team has released Sky-T1-32B-Preview, a 32-billion-parameter open-source model that matches OpenAI’s o1-preview on key benchmarks.The model offers advanced reasoning in math and coding, achieving high performance with a training cost below $450.

Text Link
Common Corpus: A 2+ trillion token dataset fully open
Common Corpus: A 2+ trillion token dataset fully open

Pleias has released Common Corpus, the largest public domain dataset for training LLMs, containing 500 billion words across multiple languages. The project demonstrates that LLMs can be trained on fully open, copyright-cleared content.

Text Link
TheiaID - Dec 11th, 4PM
TheiaID - Dec 11th, 4PM

Get ready for a hands-on demo of the AI-powered Theia IDE—a truly open, transparent, and flexible AI-driven development environment. Watch the event recording here!

Text Link
Everything is Open - Jan 20th-22nd, Adelaide
Everything is Open - Jan 20th-22nd, Adelaide

Everything Open is a conference centered on open technologies like Linux, open source software, hardware, and data, along with their communities. It offers technical deep-dives and insights from industry leaders on a wide range of related topics.

Text Link
Codestral: Hello, World!
Codestral: Hello, World!

Mistral AI has released Codestral, an open-weight generative AI model explicitly designed for code generation tasks. Codestral is trained on a diverse dataset of 80+ programming languages, and it sets a new standard on the performance/latency space for code generation compared to previous models used for coding. As it masters code and English, it can be used to design advanced AI applications for software developers.

Text Link
Public Domain 12M: a Highly Aesthetic Image-Text Dataset
Public Domain 12M: a Highly Aesthetic Image-Text Dataset

PD12M is a large dataset of 12.4 million public domain images with synthetic captions, aimed at addressing copyright concerns in AI training data. With the dataset, Spawning introduced a novel governance framework that enables public auditing, statistical stability, and community feedback through their Source.Plus platform.

Text Link
Web Applets - Dec 10th, 7:30 PM
Web Applets - Dec 10th, 7:30 PM

Learn about Web Applets open standard & SDK, and how to use it to create rich, graphical client-side apps that can be used by both agents and humans.

Text Link
FOSDEM 2025 - Feb 1st-2nd, Brussels
FOSDEM 2025 - Feb 1st-2nd, Brussels

FOSDEM is a community-driven event that brings together open source developers and communities. It offers a platform to connect, learn about the latest open source developments, and promote the benefits of free software and open solutions.

Text Link
LangChain introduces ambient agents
LangChain introduces ambient agents

LangChain implemented an email assistant showcasing ambient agent patterns. Unlike traditional chatbots that require the users to send a message every time they want the agent to do work, ambient agents proactively assist users by responding to contextual signals rather than waiting for direct prompts. They can manage multiple tasks simultaneously and incorporate human-in-the-loop elements for notifications, queries, and action reviews, ensuring user control and trust.

Text Link
Mistral has entered the chat
Mistral has entered the chat

Mistral AI has updated its free platform "le Chat" with several new features, including web search with citations, a Canvas interface for content creation and editing, and advanced document analysis powered by their new Pixtral Large multimodal model.

Text Link
Biased Data - Dec 10th, 4 PM
Biased Data - Dec 10th, 4 PM

Linda Dounia Rebeiz is a Senegalese artist who was disappointed when searches for her home city of Dakar on text-to-image generative AI tools showed low-rise, decaying buildings that were not at all representative of the vibrant architecture surrounding her. This is part of the reason why Linda uses her own datasets to carefully train AI that will more accurately reflect her reality.

Text Link
State of Open Conference - Feb 4th-5th, London
State of Open Conference - Feb 4th-5th, London

SOOCon is the UK's open technology conference on open-source software, open hardware, open data, open standards, and AI openness.

Text Link
Researchers at MBZUAI released LlamaV-o1 for multimodal reasoning
Researchers at MBZUAI released LlamaV-o1 for multimodal reasoning

The authors present a new benchmark, metric, and curriculum learning-based model to improve multimodal reasoning. The model achieves state-of-the-art performance with enhanced generalization, efficiency, and robustness, highlighting the importance of step-by-step reasoning for complex tasks.

Text Link
SF Demo Night - Dec 5th, 2 PM
SF Demo Night - Dec 5th, 2 PM

Join us as we gather together the entire Bay Area tech community and its most brilliant creators for a night of mind-blowing demos that push the boundaries of what's possible!