Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
pabs3
on April 26, 2025
|
parent
|
context
|
favorite
| on:
Petition to the Open Source Initiative: Publish th...
It is definitely doable to get openly licensed data, you just have to do it via voluntary participation of crowdsourced data acquisition programs. For example the RNNoise model was retrained from such crowdsourced data.
tedivm
on April 26, 2025
[–]
IBM did it with their Granite models.
pabs3
on April 26, 2025
|
parent
[–]
The data used for training Granite doesn't sound like it would be under FOSS licenses.
https://en.wikipedia.org/wiki/IBM_Granite
Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: