Hacker News new | past | comments | ask | show | jobs | submit | mariapraetzel's comments login

Internet Archive, archive.org | Senior Web Developer | San Francisco, CA or Remote - Full Time

The Senior Web Developer will be responsible for maintaining and building new functionality for our web archiving services.

Maintenance and development of backend and API systems written in Django/Python, Maintenance of an application frontend written in Javascript/AngularJS (1.x) Migration of a large Java codebase and legacy deployment systems to Python and Ansible Configuration and monitoring of complex distributed applications Contribute to development of tools for automated deployment and monitoring of production systems. Demonstrated experience delivering complex development projects, managing multiple deadlines and projects simultaneously, and working in a collaborative team of engineers and project/product managers.

Skills & Requirements

3-4 years of experience in Python and Unix/Linux shell 3-4 years of experience in frontend/Javascript coding Solid experience in Internet protocols (HTTP is must.) Strong knowledge of HTML, JavaScript and Web technologies in general Ability to work in, and enjoy, a loosely structured work environment

To apply please email cover letter, salary expectations, and resume to jkafader[at]archive[.]org. Full job description: https://archive.org/about/jobs.


Internet Archive | Web Crawl Engineer, Archive-It - San Francisco, CA or remote - Full Time

Running large-scale web harvests on global and national domain levels and focused and specialized crawls using Heritrix, our open-source crawler, as well as other open-source technologies developed internally, including Umbra, Brozzler, warcprox and others. Configuration, monitoring, and improvement of large-scale web crawls to ensure their quality and timely completion. Processing, analysis and quality assurance of archived web content to ensure it is complete and of the highest quality. Contribute to development of tools for automated analysis and reporting of crawl material, and to development projects focused on crawling, processing, and access. Manage both large ingests and exports of web data, derivatives, logs, and reports. Demonstrated experience of delivering on commitments with deadlines and project timelines and working in a collaborative team of engineers and project/product managers.

Skills & Requirements

Experience in Unix shell scripting and Python coding required Experience with web crawlers or scrapers, especially Heritrix Solid experience in Internet protocols (HTTP is must.) Strong knowledge of HTML, JavaScript and Web technologies in general Ability to work in, and enjoy, a loosely structured work environment

To Apply: To apply please email cover letter, salary expectations, and résumé to jobs+crawlengineer@archive.org with the subject line "Web Crawl Engineer."


Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: