Both the Internet Archive and Common Crawl have tools that reveal actual crawl dates. Search engines are not really intended to be archives, so it's no surprise that they aren't very good archives.
Is it, though? I think you have to define what your search engine is searching to make a claim like that. Internet Archive and Common Crawl (which I will say has its own incentives discouraging the discoverability of old sites through its methodology and limitations of its web crawling) are search engines in their own right.
What are you doing when you use their services? Searching.
Not really, because Nightshade should have made image labeling more difficult, but if you try it, you'll see that it doesn't do anything; multimodal models are too powerful nowadays to be fooled by small adversarial noise generated using CLIP LPIPS (small enough not to be too noticeable to us).
And Glaze does not try to interfere with labeling.
I live in downtown Palo Alto, and it has a lot of mid-rise buildings and is walkable. Expanding the area that this is true for is not something completely different.
I hear it is 2x to 3x that much depending on time of day in Santa Clara. Also, people built solar when there were different rules. The new rules make it much less advantageous.
That's right, batteries! They enable us to capitalize on energy price fluctuations.
You can of course charge them however you like, but I have a feeling Santa Clara's purported 3x price fluctuations are not due entirely to natural gas.
No. If you get the time of use plan in Santa Clara it's 4 cents more on-peak vs. off-peak [1]. The situation with PG&E's rates is completely asinine and is a political failure more than anything else.
You’re probably hearing that from people who are in Santa Clara but somehow served by PG&E rather than Silicon Valley Power (the municipal utility). Maybe they are right on the city boundary, or they live in some other city in Santa Clara County (SVP only serves the city of Santa Clara). PG&E off-peak rates for Silicon Valley are 40-60 cents/kWh.
No there’s not -JWST is largest mirror in orbit and it is a few meters smaller, despite being many smaller mirrors. The mirror size is largely due to a tunnel size on the way to the summit in Chile, and applies to the other nearby telescopes.
It’s also really hard to ship mirrors much larger than this on a boat.
I'm the CTO at the Common Crawl Foundation, which has a 17 year old, 8 petabyte crawl & archive of the web. Our open dataset has been cited in nearly 10,000 research papers, and is the most-used dataset in the AWS Open Data program. Our organization is also very active in the open source community.
We are expanding our engineering team. We're looking for someone who is:
* Excited about our non-profit, open data mission
* Proficient with Python, and hopefully also some Java
* Proficient at cloud systems such as Spark/PySpark
* Willing to learn the rest: crawling parsing indexing etc.
Many of the government archives are not public for copyright reasons.
reply