About this role
In this role, you''ll leverage your technical expertise to develop and maintain web scraping solutions that contribute to the training of next-generation AI systems. Your efforts will directly impact how models learn and perform by providing high-quality, real-world data inputs. This position is ideal for individuals with domain knowledge who are looking to apply their skills in a meaningful way, without the need for prior AI experience.
Key Responsibilities:- Design, develop, and maintain robust web scraping scripts and applications.
- Extract data from multiple websites, managing both static and dynamic content.
- Clean, validate, and structure large volumes of scraped data for analysis.
- Monitor scraping pipelines for data quality and troubleshoot any failures.
- Implement solutions to bypass anti-scraping mechanisms to ensure effective data collection.
- Collaborate with team members to define data requirements and deliverables.
- Document scraping processes and maintain code for scalability and reusability.
- Proven experience in building and maintaining web scrapers using tools such as Python, Scrapy, Selenium, or similar technologies.
- Strong understanding of HTML, CSS, JavaScript, and web protocols.
- Expertise in handling APIs, RESTful interfaces, and parsing JSON/XML data.
- Excellent written and verbal communication skills to convey technical information clearly.
- Proficiency in data cleaning, manipulation, and storage using relational or NoSQL databases.
- Knowledge of web security concepts and best practices for ethical scraping.
- Ability to troubleshoot and resolve issues autonomously.
- Experience with cloud-based scraping and deployment (AWS, Azure, or GCP).
- Familiarity with version control systems like Git.
- Background in large-scale data extraction projects.
Part-time, Contractor position, fully remote.
Compensation:$20 - $50 per hour.
Eligibility:Open to candidates with relevant skills and experience.