NodeJS Web Data Engineer (Data Scraping)

Please login or register as jobseeker to apply for this job.

TYPE OF WORK

Full Time

WAGE / SALARY

?136,992/mo

HOURS PER WEEK

40

DATE UPDATED

Aug 10, 2025

JOB OVERVIEW

## NodeJS Web Data Engineer (Data Scraping)

30–40 hrs/week (remote)

### Overview

Help build and maintain NodeJS-based web data scraping pipelines. You’ll work on extracting data from websites and APIs, cleaning and validating it, and delivering structured outputs. We value clear, reliable code, strong testing habits, and excellent communication.

### Responsibilities

- Build and maintain web scrapers and data ingestion scripts in Node.js/TypeScript.
- Use browser automation (Puppeteer/Playwright) and HTTP APIs to collect data.
- Parse HTML/JSON, extract fields, and implement data validation.
- Improve reliability: retries with backoff, rate limiting, error handling, and logging.
- Write unit/integration tests (Vitest) with fixtures and mocks.
- Collaborate via ---------- ; provide clear async updates and participate in code reviews.

### Skills

- TypeScript and Node.js
- Browser automation: Puppeteer or Playwright
- HTTP clients (fetch/`undici`), cookies/headers, and request debugging
- HTML parsing (DOM selectors, `cheerio` or similar) and regex-based extraction
- Data validation with Zod
- Concurrency control (`p-limit`) and retry/backoff (`p-retry`)
- Testing with Vitest; fixtures and network/DOM mocks
- Git, `pnpm`, and monorepo workflows
- Bonus: Next.js/React, Express/GraphQL, Docker, logs/metrics, `puppeteer-extra` stealth

### Requirements

- 2+ years with Node.js and TypeScript; solid async/Promise control flow.
- Hands-on scraping experience (selectors, anti-bot tactics, HTTP debugging).
- Strong testing habits and attention to reliability.
- Comfortable with data parsing and validation (regex, Zod).
- Clear written and spoken English.

### Nice to Have

- Playwright; `puppeteer-extra` + stealth plugin.
- Next.js/React; Express/GraphQL exposure.
- Observability (structured logs, basic metrics); Docker; cloud CI.
- Experience with idempotent upserts and change detection.

### Evaluation & Hiring Process

- 1-week paid trial (USD $500): Join our central ---------- , connect 1–2 sources end-to-end, include tests and a brief README. We review code quality, reliability, and throughput.
- Top performers advance to full-time: USD $10/hour, 30–40 hrs/week. Overtime paid. Growth potential with raises and bonuses for strong performance.

### Schedule & Perks

- Typical hours: at least 4 hours overlap with Eastern Time, 9am–1pm EST. We meet at night (EST) to assign/clarify, you work, submit, and we review the following EST morning.
- Lots of available work; advancement opportunities and performance bonuses.

### How to Apply

Make sure to follow all the steps below or I will delete your application!

1. Change the subject to: "Full Stack NodeJS Applicant! [insert name of creator of Node JS here]"
2. Write 2–3 sentences about why you are a good fit for this position. Do not write more than 3 sentences.
3. Human check — answer ONE of the following random questions (pick any; keep it short):
- Who created Git?
- In what year was Node.js first released?
- Who co-founded Apple with Steve Jobs (other than Ronald Wayne)?
4. Include numbered items:
a. Years of experience with Node.js + TypeScript
b. Years of experience with browser automation/scraping (Puppeteer/Playwright)
c. Years of total software development experience
d. Link to a sample scraper or backend (repo) and a working sample/output
e. Link to a test suite (Vitest) showing mocks/fixtures for HTTP or DOM parsing
f. Optional: link to a small React/Next.js internal tool/dashboard you built
g. Your weekly availability, time zone, and earliest start date
h. Confirmation you’re comfortable with the trial (USD $500) and rate (USD $10/hr)

SKILL REQUIREMENT
VIEW OTHER JOB POSTS FROM:
SHARE THIS POST
facebook linkedin