It is not monkey work. But A/B testing can often be more a trapping of scientifi...

It is not monkey work. But A/B testing can often be more a trapping of scientific than the real thing. Google UXR is hit or miss. They have done a number of poorly run studies on what shade of blue to color buttons without thinking about multiple comparison corrections, seasonality or evolution in display technologies. I am increasingly unconvinced that these efficiencies are meaningful for UX. Compilers are different and those optimizations are real and matter for both the bottom line and UX.