Latest at modelz
Build Streamlit Docker Image and Deploy Publicly in 3 Minutes
In this blog post, we'll show you how to build a Streamlit Docker image and deploy it publicly in 3 minutes. The deployment will give you a public URL (similar to `xx.modelz.live`) for your model that anyone can use without installing anything on their computer.
Serving fine-tuned large language model with vLLM
Share our experience about using vLLM to serving LLMs like LLaMA2, Falcon, MPT, etc.
Build Gradio Docker Image and Deploy Publicly in 3 Minutes
Gradio is a Python library that allows you to quickly create customizable UI components around your machine learning models. In this blog post, we'll show you how to build a Gradio Docker image and deploy it publicly in 3 minutes.
20x Faster as the Beginning: Introducing pgvecto.rs extension written in Rust
A new Postgres vector search extension in Rust, with HNSW algorithm for 20x faster than pgvector. But speed is just the start - pgvecto.rs is architected to easily add new algorithms. We look forward to the open source community driving pgvecto.rs to new heights!
Integrate Open Source Models into Your Existing Apps Without Any Code Change
While many developers are utilizing the OpenAI GPT API to develop their applications, some may also wish to experiment with various open-source models. However, integrating different open-source models can be challenging due to the differences in their APIs and GPU requirements.
Launching ModelZ Beta!
We're excited to announce the launch of ModelZ beta, a serverless GPU inference platform. Our team has been hard at work building a platform that democratizes access to machine learning, making it easier than ever to build and deploy models for a variety of use cases.
Do we really need a specialized vector database?
With the popularity of Large Language Model, vector databases have also become a hot topic. With just a few lines of simple Python code, a vector database can act as a cheap but highly effective "external brain" for your LLM. But do we really need a specialized vector database?
TensorChord: 2022 in Review
In just six months, TensorChord has accomplished amazing feats with the launch of the first open source project envd! To celebrate this success and look ahead to 2023, we wanted to share our journey so far and provide insight into what's in store for us over the next year!