Hacker Newsnew | past | comments | ask | show | jobs | submit | sovok's commentslogin

An LLM step also works pretty well for diarization. You get a transcript with speaker-segmentation (with whisper and pyannote for example), SPEAKER_01 says at some point „Hi I’m Bob. And here’s Alice“, SPEAKER_02 says „Hi Bob“ and now the LLM can infer that SPEAKER_01 = Bob and SPEAKER_02 = Alice.


Yep, my agent i built years ago worked very well with this approach, using a whisper-pyannote combo. The fun part is knowning when to end transcription in noisy environments like a coffee shop.


Seems to be a common thing. I did something similar two days ago. Pretty helpful so far: https://github.com/combatwombat/cmdline-ai-helper


Very neat. I like the channel idea. There’s also https://random-video.com which shows one of 4.5 Billion YouTube videos randomly (or with a filter for view count, language, year).

Pretty fun to discover what’s out there, without being influenced by YouTubes algorithm. It’s based on the YT Archive project and some other sources, since YT sadly has no randomize function.


There seem to be a few fancy boat projects that turn out to just be renderings and NFT scams.

Like https://www.pangeosyacht.com/projects, an $8 Billion „Terayacht“ maybe launching in 2033. Until then they sell NFTs for a place in the virtual Unreal Engine version of this (also still to be built).

The design studio behind that (https://www.lazzarinidesignstudio.com) has many renderings of vehicle concepts, but nothing realized yet.

Except for one thing, the Songball https://www.lazzarinidesignstudio.com/the-songball. A little plastic speaker that plays back one song you can record yourself, starting at 20€ (https://www.songball.com/?lang=en).


This was fun. I like the little bounce of playing field when the block hits the bottom. And it loads pretty fast for a Godot web game.


Thank you! I like it and people I have shown it to say it feels ”addictive”, but something is missing as I am the only person who keeps playing after some levels :D so maybe it’s for me only.


https://font2png.com to browse font-icons and export them as PNG, with background/foreground color. Usually you want SVG, but sometimes a PNG is better. I use it mostly to generate quick favicons. It was also fun to make it work completely client side with canvas.

https://github.com/combatwombat/rmdb imports the IMDb database (at least the limited .tsv files they provide) into MySQL so you can query it. List the highest rated horror movies of the 90s, genre distribution by year etc. I made that mostly to c̶h̶e̶a̶t̶ ̶o̶n̶ help with https://www.reddit.com/r/GuessTheMovie, with limited success. Still fun though to SQL-query over all movies ever made.


Ooh, I have a use for the IMDB one, I hadn't realised they had data export. Thanks.


Nice. I gather one could generate a lot of listicles with that. "The 10 most * movies of *", "5 early but great movies by famous actors", ...

Too bad IMDb doesn't provide more data in the export. Budget, box office or country of origin would be nice.


It was the db used in one of the cs50 psets this year!


The linked project website is interesting as well and quite a deep rabbit hole: https://www.kermitproject.org/kermit.html

> Although terminal emulation has been largely supplanted by the Web for online access, Kermit software continues to play a role in other applications such as remote sensing and data collection, management and troubleshooting of networking and telecommunications equipment, back office work, cargo and inventory management, medical insurance claim submission, electronic funds transfer, and online filing of income tax returns. Kermit software is embedded in network routers and switches, in cell-phone towers, in medical diagnostic and monitoring equipment, even in cardiac pacemakers, not to mention the cash registers of quite a few big-name "big box" retailers. In 2002 Kermit flew on the International Space Station, and Kermit software is the communication method used by EM APEX ocean floats (left) supplying realtime data to hurricane researchers and trackers to this day (the hurricane project entered a new expanded phase in 2010 based on a new version of Embedded Kermit).

It contains its own Perl-like scripting language that predates Perl (https://www.kermitproject.org/ckscripts.html#tut), which supports „Prolog-like declarative logic programming“ (https://www.kermitproject.org/ftp/kermit/scripts/socrates), some OOP (https://www.kermitproject.org/ftp/kermit/scripts/oop) and S-expressions (https://www.kermitproject.org/ftp/kermit/scripts/shortest_pa...).

There is a text-to-HTML converter (https://www.kermitproject.org/ftp/kermit/scripts/html). Someone should build a static site generator with that, running on a calculator or embedded system on some buoy.


https://font2png.com to create PNGs from icon fonts like Font Awesome. SVGs are usually the way to go, but sometimes you need a PNG. A previous tool that did the same went away, so I built my own.

Also https://frt.rip, a whoopie cushion with spatial audio, because I had the domain lying around.

Both don't make money, but they also have no users besides myself so they can run on my cheap server for free basically.


> Also https://frt.rip, a whoopie cushion with spatial audio

This may be the most magnificent phrase I have ever read on the internet.

But when I visit in Firefox (mostly vanilla, on Pop!OS), a scary warning pops up that the site is trying to do something to my machine, with no context of what. I presume that "spatial audio" requires some support beyond just whatever my browser already knows how to do, but.... what?

Hey, it works with no warnings on my phone! Oh, and that audio is glorious. Juicy. Extravagant.

This is going on my smallest most concealable Bluetooth speaker...


Thanks. The spatial audio is just a feature of the excellent https://howlerjs.com/.

But I added MIDI keyboard support to the secret fart piano recently and thus Firefox throws up a scary warning. It's now disabled unless you access https://frt.rip#midi. But you can also use your computer keyboard to play/fart a little tune, see Web Inspector for instructions :)


> https://font2png.com to create PNGs from icon fonts like Font Awesome.

Would you consider an extra slider for the corner radius? That would allow to download icons with a background which is not completely square, but also a squircle or circle (maximum corder radius).


Alright, the background can now have rounded corners with smoothing / squircles like in iOS, using Figmas algorithm. Thanks for the suggestion! → https://font2png.com


Nice! Thank you for taking the request and for the fast turnaround.


Good idea. I never really use the background color, but since it's there, a corner radius makes sense. I'll play around with that soon.


That is a nicely polished site. Love the chord cheat sheet.


Looks interesting. Do you read the Figma file format and re-implement all the rendering?

Although I get an 500 error right now: https://sc.robsite.net/files/1667596588-pikmaker-500-2.png


And yes, we are basically reading figma files and rendering ourselves. The idea is to also support psd and illustrator files in the future.


Figma has an API which will render the artboard and return an image. Why not use that?


Because you can't change the values :/


Sorry for that. Can you share with me the Figma link in Discord? So I can take a look on what is happening.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: