Internet Archive to ignore robots.txt directives

Robots (or spiders, or crawlers) are little computer programs that search engines use to scan and index websites. Robots.txt is a little file placed on webservers to tell search engines what they should and shouldn’t index. The Internet Archive isn’t a search engine, but has historically obeyed exclusion requests from robots.txt files. But it’s changing its mind, because robots.txt is almost always crafted with search engines in mind and rarely reflects the intentions of domain owners when it comes to archiving. Over time we have observed that the robots.txt files that are geared toward search engine crawlers do not necessarily serve our archival purposes. Internet Archive’s goal is to create complete “snapshots” of web pages, including the duplicate content and the large versions of files. We have also seen an upsurge of the use of robots.txt files to remove entire domains from search engines when they transition from a live web site into a parked domain, which has historically also removed the entire domain from view in the Wayback Machine. In other words, a site goes out of business and then the parked domain is “blocked” from search engines and no one can look at the history of that site in the Wayback Machine anymore. We receive inquiries and complaints on these “disappeared” sites almost daily. A few months ago we stopped referring to robots.txt files on U.S. government and military web sites for both crawling and displaying web pages (though we respond to removal requests sent to info@archive.org). As we have moved towards broader access it has not caused problems, which we take as a good sign. We are now looking to do this more broadly. An excellent decision. To be clear, they’re ignoring robots.txt even if you explicitly identify and disallow the Internet Archive. It’s a splendid remember that nothing published on the web is ever meaningfully private, and will always go on your permanent record.

Read the original post:
Internet Archive to ignore robots.txt directives

Mathematical conjecture generates beautiful lifelike form

The deceptively simple Collatz Conjecture is one of mathematics’ most difficult puzzles. Alex Bellos shows off a cool rendering by Edmund Harris that looks like a beautiful life form from the sea. (more…)

Visit link:
Mathematical conjecture generates beautiful lifelike form

Blue horseshoe crab blood sells for up to $14,000 per quart

Unfortunately for horseshoe crabs, their blue blood is so good at detecting harmful bacteria that the hapless critters are being scooped up by the hundreds to be attached to industrial horseshoe crab blood milking stations. Now the International Union for Conservation of Nature has categorized the American horseshoe crab is “vulnerable” to extinction. From Popular Mechanics : Their distinctive blue blood is used to detect dangerous Gram-negative bacteria such as E. coli in injectable drugs such as insulin, implantable medical devices such as knee replacements, and hospital instruments such as scalpels and IVs. Components of this crab blood have a unique and invaluable talent for finding infection, and that has driven up an insatiable demand. Every year the medical testing industry catches a half-million horseshoe crabs to sample their blood. But that demand cannot climb forever. There’s a growing concern among scientists that the biomedical industry’s bleeding of these crabs may be endangering a creature that’s been around since dinosaur days. There are currently no quotas on how many crabs one can bleed because biomedical laboratories drain only a third of the crab’s blood, then put them back into the water, alive. But no one really knows what happens to the crabs once they’re slipped back into the sea. Do they survive? Are they ever the same?

View post:
Blue horseshoe crab blood sells for up to $14,000 per quart

New materials allow 2.8l/day of solar-powered desert water-vapor extraction

Researchers from MIT, UC Berkeley, Lawrence Berkeley, and King Abdulaziz City for Science and Technology published a paper in Science describing a solar-powered device that uses a new type of metal organic framework (MOF) to extract up to three litres of water per day from even the most arid desert air. (more…)

Continue reading here:
New materials allow 2.8l/day of solar-powered desert water-vapor extraction

Prison inmates built working PCs out of ewaste, networked them, and hid them in a closet ceiling

Inmates in Ohio’s Marion Correctional Institution smuggled computer parts out of an ewaste recycling workshop and built two working computers out of them, hiding them in the ceiling of a training room closet ceiling and covertly patching them into the prison’s network. (more…)

Originally posted here:
Prison inmates built working PCs out of ewaste, networked them, and hid them in a closet ceiling

Hackers hijacked a bank’s DNS and spent 5 hours raiding its customers’ accounts

Kaspersky Labs reports that an unnamed large Brazilian financial institution with $27B in assets was compromised by hackers who took over its DNS — by hijacking its NIC.br account — and for 5 hours were able to impersonate the bank to all its online customers (and possibly to control its ATMs) in order to plunder their accounts and steal their credit card details. (more…)

Original post:
Hackers hijacked a bank’s DNS and spent 5 hours raiding its customers’ accounts

Scuttlebutt: an "off-grid" P2P social network that runs without servers and can fall back to sneakernet

Dominic Tarr is a developer who lives on a self-steering sailboat in New Zealand; he created Scuttlebutt, a secure messaging system that can run without servers, even without ISPs. (more…)

Excerpt from:
Scuttlebutt: an "off-grid" P2P social network that runs without servers and can fall back to sneakernet

Inuit cartography: maps carved in driftwood

The Inuit carve portable, waterproof, floating maps out of driftwood for use in navigating the littoral. These three wooden maps show the journey from Sermiligaaq to Kangertittivatsiaq, on Greenland’s East Coast. The map to the right shows the islands along the coast, while the map in the middle shows the mainland and is read from one side of the block around to the other. The map to the left shows the peninsula between the Sermiligaaq and Kangertivartikajik fjords. From The Decolonial Atlas , an antidote to all the other ones: Kurdistan in Kurdish , Lakota Territory , Agricultural Maps .

Read more here:
Inuit cartography: maps carved in driftwood