Data Crawler Staff

Toàn thời gian 121 lượt xem
  • TP. Hồ Chí Minh Cấp bậc:   Nhân viên
  • Mức lương:   12.000.000 - 18.000.000
  • Ngày đăng: 2025-02-13 00:44:52 Ngày hết hạn: 2025-03-15 00:44:52
  • Ngành nghề : Công nghệ số - CNTT - IT - Phần mềm - Thiết kế đồ hoạ web -phần cứng/mạng

Mô tả công việc

1. Professional Scraping System Development

Technical Requirements:

        System Architecture:

  • Design cross-platform Python crawling scripts

  • Build scalable systems

  • Develop parallel crawling solutions

  • Manage large, multi-threaded data streams

Technologies:

  • Scrapy, BeautifulSoup

  • Selenium

  • Asyncio, Multiprocessing

  • Proxy management

  • IP rotation techniques

2. Data Processing and Normalization

Processing Methods:

  • Develop API data cleaning processes

  • Data transformation algorithms

  • Integrity checks

  • Remove noisy data

Tools:

  • Pandas

  • Data validation techniques

  • Machine Learning preprocessing

3. Database Management

Specialized Skills:

    Advanced SQL:

  • Complex queries

  • Performance optimization

4. Monitoring & Optimization

Strategy:

  • Manage scraping system operations.

  • Track scraping performance

  • Challenge handling:

  • IP blocking

  • Speed ​​limiting

  • CAPTCHA

Quảng cáo / Dành cho nhà tài trợ
Liên hệ: E-mail: [email protected]
Điện thoại: (028) 38971633

Yêu cầu

 PROFESSIONAL REQUIREMENTS

Education

  • Bachelor's degree (GPA > 3.0)

  • Major:

  • Data science

  • Computer engineering

  • Data related fields

  • English: TOEIC > 700 of  IELTS >5.5

Technical Skills

Python Ecosystem

  • Asyncio, Multiprocessing

  •  Data cleaning techniques

  • Machine Learning preprocessing

  • Advanced error handling

Database & Big Data

  • SQL (Intermediate to Advanced)

  • NoSQL database management

  • PySpark

  • Data warehousing

In-depth Experience

  • Minimum 1-2 years

  • Project implementation:

  • Web scraping

  • Automatic data processing

  • Big data crawling

SOFT SKILLS

System analysis

Problem solving

Independent & team working

Time management

Logical thinking

NICE TO HAVE EXPERIENCES

Big Data experience

Data pipeline design

Working with diverse APIs

Professional certifications

Creativity and initiative in proposing ideas

Phúc lợi

-Enjoy full social insurance, health insurance, labor contracts, vacation days and other benefits according to state regulations.

-Parking allowance

-Regular annual salary increase

-Training and capacity development to meet job requirements and promotion path

-Participate in courses when necessary

-Weekly/monthly/quarterly/yearly bonuses and project bonuses

-Holiday/Tet bonuses

-Young, friendly and dynamic working environment.

-Travel: 1 time/year

  • Working hours: HC 07 hours/day (Morning from 08:00 - 11:30, Afternoon from 13:00 - 16:30), from Monday to Friday, off on Saturday & Sunday. 

  • Working equipment: provided