| | Deploying custom ComfyUI workflows as APIs (baseten.co) |
|
1 point by AnhTho_FR 22 days ago | past
|
| | How to build function calling and JSON mode for open-source and fine-tuned LLMs (baseten.co) |
|
1 point by philipkiely 3 months ago | past
|
| | How to double tokens per second for Llama 3 with Medusa (baseten.co) |
|
2 points by philipkiely 3 months ago | past
|
| | Show HN: Automatically Build Nvidia TRT-LLM Engines (baseten.co) |
|
2 points by mikejulietbravo 4 months ago | past
|
| | Show HN: 60% higher tokens per second for 70B custom LLMs (baseten.co) |
|
1 point by mikejulietbravo 4 months ago | past
|
| | Show HN: Baseten Chains – Framework and SDK for Multi-Model AI Products (baseten.co) |
|
9 points by mikejulietbravo 5 months ago | past | 5 comments
|
| | Open Source Inference Engine Baseten Raises $40M from IVP, Spark and Greylock (baseten.co) |
|
2 points by mikejulietbravo 9 months ago | past | 1 comment
|
| | FP8: Efficient model inference with 8-bit floating point numbers (baseten.co) |
|
2 points by philipkiely 9 months ago | past
|
| | Introduction to quantizing machine learning models (baseten.co) |
|
1 point by tuhins 10 months ago | past
|
| | Faster Mixtral inference with TensorRT-LLM and quantization (baseten.co) |
|
2 points by tikkun 11 months ago | past | 1 comment
|
| | A guide to open-source LLM inference and performance (baseten.co) |
|
113 points by varunshenoy on Nov 20, 2023 | past | 14 comments
|
| | How we got Stable Diffusion XL inference to under 2 seconds (baseten.co) |
|
51 points by varunshenoy on Aug 31, 2023 | past | 5 comments
|
| | SDXL inference in under 2 seconds (baseten.co) |
|
3 points by tuhins on Aug 31, 2023 | past | 1 comment
|
| | Three techniques to adapt LLMs for any use case (baseten.co) |
|
1 point by philipkiely on June 15, 2023 | past
|
| | Show HN: ChatLLaMA – A ChatGPT style chatbot for Facebook's LLaMA (baseten.co) |
|
402 points by aaronrelph on March 22, 2023 | past | 215 comments
|
| | Show HN: Fine-tune generative models in 1 line of code (baseten.co) |
|
16 points by aqader on March 1, 2023 | past
|
| | Serving four million Riffusion requests in two days (baseten.co) |
|
5 points by philipkiely on Dec 21, 2022 | past
|
| | Accelerating model deployment: 100X faster dev loops with draft models (baseten.co) |
|
1 point by tuhins on Dec 9, 2022 | past
|
| | Show HN: Free Stable Diffusion 2.0 hosted interface (baseten.co) |
|
25 points by philipkiely on Nov 24, 2022 | past | 2 comments
|
| | Try it yourself: Speech to text with Whisper (baseten.co) |
|
5 points by philipkiely on Oct 1, 2022 | past
|
| | Deploying Stable Diffusion in Production Using Truss (baseten.co) |
|
3 points by philipkiely on Sept 1, 2022 | past
|
| | Hosted Stable Diffusion Demo (baseten.co) |
|
7 points by philipkiely on Aug 24, 2022 | past
|
| | Code generation interactive demo (Salesforce Codegen mono 2B) (baseten.co) |
|
2 points by philipkiely on July 1, 2022 | past
|
| | DALL-E Mini – Generate images from a text prompt (baseten.co) |
|
52 points by tuhins on June 10, 2022 | past | 22 comments
|
| | Demo – Text generation with EleutherAI's GPT-J-6B model (baseten.co) |
|
1 point by tuhins on April 29, 2022 | past
|
| | Show HN: Baseten – Build ML-powered applications (baseten.co) |
|
112 points by philipkiely on April 26, 2022 | past | 11 comments
|
| | How BaseTen is using “docs as code” (baseten.co) |
|
5 points by philipkiely on March 9, 2022 | past
|
| | GFP-GAN – Photo Restoration App (baseten.co) |
|
1 point by tuhins on Dec 31, 2021 | past
|
| | Transcribing large audio files with wav2vec (baseten.co) |
|
1 point by tuhins on Dec 15, 2021 | past
|
| | Hotdog or Not Hotdog (Cutedog) (baseten.co) |
|
1 point by tuhins on Dec 3, 2021 | past
|
|
|
More |