More

dplavery92 · 2024-11-05T21:43:57 1730843037

The title of Tencent's paper [0] as well as their homepage for the model [1] each use the term "Open-Source" in the title, so I think they are making the claim.

[0] https://arxiv.org/pdf/2411.02265 [1] https://llm.hunyuan.tencent.com/

dplavery92 · 2024-10-17T23:56:53 1729209413

Eight is a nice power of two.

donatj · 2024-10-18T00:01:11 1729209671

Can you explain how that's helpful? I'm not being obtuse, I just don't follow

spongebobstoes · 2024-10-18T00:06:45 1729210005

One thought is that it's always a whole number of bits (3) to bit-address within a byte. It's 3.5 bits to bit address a 10 bit byte. Sorta just works out nicer in general to have powers of 2 when working on base 2.

cogman10 · 2024-10-18T01:37:15 1729215435

This is basically the reason.

Another part of it is the fact that it's a lot easier to represent stuff with hex if the bytes line up.

I can represent "255" with "0xFF" which fits nice and neat in 1 byte. However, now if a byte is 10bits that hex no longer really works. You have 1024 values to represent. The max value would be 0x3FF which just looks funky.

Coming up with an alphanumeric system to represent 2^10 cleanly just ends up weird and unintuitive.

Spivak · 2024-10-18T03:24:50 1729221890

We probably wouldn't have chosen hex in a theoretical world where bytes were 10 bits, right? It would probably be two groups of 5 like 02:21 == 85 (like an ip address) or five groups of two 0x01111 == 85. It just has to be one of its divisors.

shultays · 2024-10-18T12:48:49 1729255729

  02:21

or instead of digits from 0 to F, the letters would go up V. 85 would be 0x2k I think (2 * 32 + 21)

inkyoto · 2024-10-18T00:45:11 1729212311

Because modern computing has settled on the Boolean (binary) logic (0/1 or true/false) in the chip design, which has given us 8 bit bytes (a power of two). It is the easiest and most reliable to design and implement in the hardware.

On the other hand, if computing settled on a three-valued logic (e.g. 0/1/«something» where «something» has been proposed as -1, «undefined»/«unknown»/«undecided» or a «shade of grey»), we would have had 9 bit bytes (a power of three).

10 was tried numerous times at the dawn of computing and… it was found too unwieldy in the circuit design.

davemp · 2024-10-18T01:22:30 1729214550

> On the other hand, if computing settled on a three-valued logic (e.g. 0/1/«something» where «something» has been proposed as -1, «undefined»/«unknown/undecided» or a «shade of grey»), we would have had 9 bit bytes (a power of three).

Is this true? 4 ternary bits give you really convenient base 12 which has a lot of desirable properties for things like multiplication and fixed point. Though I have no idea what ternary building blocks would look like so it’s hard to visualize potential hardware.

inkyoto · 2024-10-18T01:56:34 1729216594

It is hard to say whether it would have been 9 or 12, now that people have stopped experimenting with alternative hardware designs. 9-bit byte designs certainly did exist (and maybe even the 12-bit designs), too, although they were still based on the Boolean logic.

I have certainly heard an argument that ternary logic would have been a better choice, if it won over, but it is history now, and we are left with the vestiges of the ternary logic in SQL (NULL values which are semantically «no value» / «undefined» values).

davemp · 2024-10-18T01:07:24 1729213644

Many circuits have ceil(log_2(N_bits)) scaling wrt to propagation delay/other dimensions so you’re just leaving efficiency on the table if you aren’t using a power of 2 for your bit size.

bonzini · 2024-10-18T00:06:44 1729210004

It's easier to go from a bit number to (byte, bit) if you don't have to divide by 10.

dplavery92 · 2024-02-12T21:02:29 1707771749

You can also construct multiple hypothesis trackers from multiple Kalman Filters, but there is a little more machinery. For example, Interacting Multiple Models (IMM) trackers may use Kalman Filters or Particle Filters, and a lot of the foundational work by Bar-Shalom and others focuses on Kalman Filters.

dplavery92 · 2024-02-12T20:55:53 1707771353

The Kalman filter has a family of generalizations in the Extended Kalman Filter (EKF) and Unscented Kalman Filter (UKF.)

Also common in robotics applications is the Particle Filter, which uses a Monte Carlo approximation of the uncertainty in the state, rather than enforcing a (Gaussian) distribution, as in the traditional Kalman filter. This can be useful when the mechanics are highly nonlinear and/or your measurement uncertainties are, well, very non-Gaussian. Sebastian Thrun (a CMU robotics professor in the DARPA "Grand Challenge" days of self-driving cars) made an early Udacity course on Particle Filters.

dplavery92 · on Dec 19, 2023

I was encountering the same problem on my Intel MBP, and per another one of the comments here, find that switching from Chrome to Safari to view the page allows me to view the whole page, view it smoothly, and without my CPU utilization spiking or my fans spinning up.

tonyarkles · on Dec 19, 2023

Yeah, I just checked to see what this machine is... Mid 2015 15" Retina MBP with an AMD Radeon R9 and the page is buttery smooth in Safari.

dplavery92 · on Nov 17, 2023

I don't think anyone in this thread knows what happened, but since we're in a thread speculating why the CEO of the leading AI company was suddenly sacked, the possibility of an unacceptable interpersonal scandal isn't any more outlandish than others' suggestions of fraud, legal trouble for OpenAI, or foundering financials. The suggestion here is simply that Altman having done something "big and dangerous" is not a foregone conclusion.

In the words of Brandt, "well, Dude, we just don't know."

dplavery92 · on Nov 6, 2023

Correct, the article places UHZ1 at 13.2 billion light-years away, so roughly ~500 Gy into our 13.7-billion-year-old universe.

wolverine876 · on Nov 6, 2023

dplavery92 · on Oct 27, 2023

https://en.wikipedia.org/wiki/Earthrise

xcv123 · on Oct 27, 2023

That's only from orbit. The Moon is tidally locked to Earth, so Earth remains in roughly the same spot in the sky.

adastra22 · on Oct 27, 2023

It has a wobble. Near the limb or poles you would see an earthrise. This is shown in the beginning of the second season of For All Mankind btw.

hinkley · on Oct 27, 2023

Kim Stanly Robinson also hits on this in Red Moon. Someone is emotionally invested in having a particular plot of moon where the earth peeks out through a gap in a crater wall during the wobble.

dplavery92 · on Oct 9, 2023

From the captioned art in the article: "Siege, from the Peterborough Psalter, early 14th century, via the KBR Museum, Belgium. Yes, those defenders are all women."

dplavery92 · on Aug 2, 2023

I think a great place to start is https://www.bzarg.com/p/how-a-kalman-filter-works-in-picture...

Unlike the OP article, it does make use of the math formalism for Kalman filters, but it is a relatively gentle introduction that does a very good job visualizing and explaining the intuition of each term. I have gotten positive feedback (no pun intended!) from interns or junior hires using this resource to familiarize themselves with the topic.

If you are making a deeper study and are ready to dive into a textbook that more thoroughly explores theory and application, there is a book by Gibbs[1] that I have used in the past and is well-regarded in some segments of industry that rely on these techniques for state estimation and GNC.

[1] https://onlinelibrary.wiley.com/doi/book/10.1002/97804708900...