Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
cscurmudgeon's favorites
login
submissions
|
comments
1.
Do Large Language Models learn world models or just surface statistics? (2023)
(
thegradient.pub
)
37 points
by
fragmede
6 hours ago
|
53 comments
2.
OK, I can partly explain the LLM chess weirdness now
(
dynomight.net
)
369 points
by
dmazin
1 day ago
|
328 comments
3.
Bayesian Neural Networks
(
toronto.edu
)
214 points
by
reqo
4 days ago
|
37 comments
4.
Show HN: Llama 3.2 Interpretability with Sparse Autoencoders
(
github.com/paulpauls
)
497 points
by
PaulPauls
22 hours ago
|
71 comments
5.
Nickel Plating Handbook [pdf]
(
nickelinstitute.org
)
59 points
by
nativeit
2 days ago
|
19 comments
6.
New textbook teaches students about matrix methods and their real world apps
(
umich.edu
)
77 points
by
teleforce
10 days ago
|
33 comments
7.
LLaVA-O1: Let Vision Language Models Reason Step-by-Step
(
arxiv.org
)
176 points
by
lnyan
4 days ago
|
32 comments
8.
AlphaProof's Greatest Hits
(
rishimehta.xyz
)
249 points
by
rishicomplex
5 days ago
|
132 comments
9.
Biological Miracle – Wood Frog
(
nps.gov
)
503 points
by
thunderbong
7 days ago
|
139 comments
10.
Something weird is happening with LLMs and chess
(
dynomight.substack.com
)
695 points
by
crescit_eundo
8 days ago
|
474 comments
11.
BERTs Are Generative In-Context Learners
(
arxiv.org
)
141 points
by
fzliu
8 days ago
|
45 comments
12.
TinyTroupe, a new LLM-powered multiagent persona simulation Python library
(
github.com/microsoft
)
143 points
by
paulosalem
11 days ago
|
48 comments
13.
LoRA vs. Full Fine-Tuning: An Illusion of Equivalence
(
arxiv.org
)
236 points
by
timbilt
14 days ago
|
53 comments
14.
Evaluating the world model implicit in a generative model
(
arxiv.org
)
159 points
by
dsubburam
15 days ago
|
45 comments
15.
D2: Declarative Diagramming – A modern language that turns text to diagrams
(
d2lang.com
)
50 points
by
thunderbong
19 days ago
|
7 comments
16.
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
(
arxiv.org
)
174 points
by
og_kalu
21 days ago
|
33 comments
17.
The National EUV Accelerator comes to Albany
(
ibm.com
)
43 points
by
sandwichsphinx
21 days ago
|
45 comments
18.
AI Flame Graphs
(
brendangregg.com
)
316 points
by
JNRowe
24 days ago
|
89 comments
19.
Chain-of-thought can hurt performance on tasks where thinking makes humans worse
(
arxiv.org
)
371 points
by
benocodes
22 days ago
|
250 comments
20.
Don't implement unification by recursion
(
philipzucker.com
)
83 points
by
mathgenius
25 days ago
|
62 comments
21.
Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs
(
arxiv.org
)
66 points
by
belter
35 days ago
|
49 comments
22.
The AI Investment Boom
(
apricitas.io
)
278 points
by
m-hodges
33 days ago
|
376 comments
23.
Phenomenal consciousness is alien to us: SETI and the Fermi paradox
(
sciencedirect.com
)
51 points
by
rbanffy
33 days ago
|
69 comments
24.
Why do random forests work? They are self-regularizing adaptive smoothers
(
arxiv.org
)
295 points
by
sebg
35 days ago
|
41 comments
25.
Use Prolog to improve LLM's reasoning
(
shchegrikovich.substack.com
)
379 points
by
shchegrikovich
39 days ago
|
155 comments
26.
DeepSeek: Advancing theorem proving in LLMs through large-scale synthetic data
(
arxiv.org
)
186 points
by
hhs
39 days ago
|
54 comments
27.
Machine learning and information theory concepts towards an AI Mathematician
(
arxiv.org
)
109 points
by
marojejian
41 days ago
|
19 comments
28.
Understanding the Limitations of Mathematical Reasoning in LLMs
(
arxiv.org
)
282 points
by
hnhn34
42 days ago
|
266 comments
29.
Google’s AI thinks I left a Gatorade bottle on the moon
(
edwardbenson.com
)
367 points
by
gwintrob
46 days ago
|
194 comments
30.
The Data Visualisation Catalogue: find the right method for your data
(
datavizcatalogue.com
)
295 points
by
sea-gold
48 days ago
|
34 comments
More
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: