let's build containerized crawlee - work repository for test.

Overview

Crawlee-ing on container project!

  • NodeJS 기반 컨테이너 + Crawlee 오픈소스 테스트용 레포지토리입니다.

Crawlee는 크롤링 및 스크래이핑을 빠르게 구축하는 데 필요한 npm package입니다.

Features

  • Docker-compose 기반으로 Crawlee 오픈소스 테스트를 수행합니다.

  • Crawlee Tutorial을 따라 on-boarding

  • Crawlee Docs를 따라 지원 기능 구현

  • .env 파일로 Crawlee 크롤링 구성 도전

  • postgresql으로 크롤링 결과 데이터 저장

목표는 진행 과정에서 유동적으로 추가됩니다.


Prerequisite

�프로젝트를 시작하기 전에 필요한 환경입니다.

Environment Tested Version
docker Engine v20.10.11
Git 2.35.3
NodeJs (Optional) >= 16v

Getting Start

  1. Docker Container로 모든 작업을 처리합니다.
  2. 3가지의 Docker Image를 사용합니다.
    1. dev: v16+의 nodejs 이미지입니다. (presentj94/nodejs)
    2. job: puppetear-crawlee 이미지입니다. (presentj94/crawlee-job)
    3. play: playwright-crawlee 이미지입니다. (presentj94/crawlee-play)
  3. 주요 스크립트는 세 가지로 운영됩니다.
    1. ./crawling.sh
      1. 크롤링 코드(main.mjs)를 실행하는 docker runner를 생성합니다.
      2. runner, directory, index 순서로 입력합니다.
      3. example) ./crawling.sh job tutorial 02
    2. ./log-last.sh
      1. 가장 최근 특정 runner에 의해 실행된 directory/index의 컨테이너 로그를 보여줍니다.
      2. runner, directory, index 순서로 입력합니다.
      3. example) ./log-last.sh job tutorial 02
    3. ./clean.sh
      1. 가장 최근 특정 runner에 의해 실행된 directory/index의 컨테이너를 삭제하고, 그 컨테이너의 결과 디렉토리도 함께 삭제합니다.
      2. runner, directory, index 순서로 입력합니다.
      3. example) ./clean.sh job tutorial 02
  4. 자세한 사용 방법은 USAGES.md 를 확인해주세요.

With NodeJS

npm run init
npm run build
./crawling.sh job tutorial 02

Git Strategy & commit convention

Git Branching Strategy: git workflow

main branch

최종 제품 코드. develop branch로부터 PR을 통해 전달함.

develop branch

실제 코드 관련 개발. main branch 이전 모든 브랜치 코드를 병합함.

document branch

readme, docs 등 문서를 작성함.

Commit convention: Udacity Commit Message Convention


License

Apache 2.0 License

You might also like...

This repository contains a basic example on how to set up and run test automation jobs with CircleCI and report results to Testmo.

CircleCI test automation example This repository contains a basic example on how to set up and run test automation jobs with CircleCI and report resul

Dec 23, 2021

Example-browserstack-reporting - This repository contains an example of running Selenium tests and reporting BrowserStack test results, including full CI pipeline integration.

BrowserStack reporting and Selenium test result example This repository contains an example of running Selenium tests and reporting BrowserStack test

Jan 1, 2022

A Zotero add-on that scans your Markdown reading notes, tags the associated Zotero items, and lets you open notes for the Zotero items in Obsidian.

A Zotero add-on that scans your Markdown reading notes, tags the associated Zotero items, and lets you open notes for the Zotero items in Obsidian.

Zotero Obsidian Citations Adds colored tags to Zotero items that have associated Markdown notes stored in an external folder. Open an associated Markd

Jan 4, 2023

Codism is a Codepen inspired coding playgound that lets you add and modify HTML, CSS and JavaScript

Codism Codism is a Codepen inspired coding playgound that lets you add and modify HTML, CSS and JavaScript to create in order to create cool stuff! Yo

Nov 24, 2022

Lets you add a character to Hacker News links to add social media and OpenGraph previews for sharing on things like Slack or Twitter.

Lets you add a character to Hacker News links to add social media and OpenGraph previews for sharing on things like Slack or Twitter.

news.ycombinator1.com Lets you add a character to Hacker News links to add social media and OpenGraph previews for sharing on things like Slack or Dis

Sep 18, 2022

A type speed checking website which lets you check your typing speed and shows the real-tme leaderboards with mongodb as DB and express as backend

This is a Next.js project bootstrapped with create-next-app. Getting Started First, run the development server: npm run dev # or yarn dev Open http://

Mar 27, 2022

zkPoB is a mobile compatible tool that lets anyone prove they own a Bufficorn (or any NFT) without revealing which Buffi they own or the address they are verifying themselves with

zkPoB is a mobile compatible tool that lets anyone prove they own a Bufficorn (or any NFT) without revealing which Buffi they own or the address they are verifying themselves with

zkPoB is a mobile compatible tool that lets anyone prove they own a Bufficorn (or any NFT) without revealing which Buffi they own or the address they are verifying themselves with

Aug 25, 2022

Inside-out promise; lets you call resolve and reject from outside the Promise constructor function.

Inside-out promise; lets you call resolve and reject from outside the Promise constructor function.

Feb 28, 2022

Plugin that lets you create diagrams from textual representation (aka 'Diagrams as Code') within Logseq

Plugin that lets you create diagrams from textual representation (aka 'Diagrams as Code') within Logseq

Logseq - Diagrams as Code Plugin that lets you create diagrams (and other visualizations) from textual representation (aka 'Diagrams as Code') within

Dec 21, 2022
Owner
Present_Jay.Dev
Hi, I'm Jay, love writing, coding, and communicating. and recently interested in Keycloak, Kubernetes (and Rancher).
Present_Jay.Dev
Hemsida för personer i Sverige som kan och vill erbjuda boende till människor på flykt

Getting Started with Create React App This project was bootstrapped with Create React App. Available Scripts In the project directory, you can run: np

null 4 May 3, 2022
Kurs-repo för kursen Webbserver och Databaser

Webbserver och databaser This repository is meant for CME students to access exercises and codealongs that happen throughout the course. I hope you wi

null 14 Jan 3, 2023
An ultra-lightweight self-hosted CI solution with a dashboard and containerized runners

An extremely simple containerized CI server. Ecosystem The Candor ecosystem is straightforward, and entirely containerized. Docker runs on the host ma

Paul Huebner 8 Nov 20, 2022
An example T3 app containerized with Docker, Docker Compose, and deployed on Railway

Create T3 App This is an app bootstrapped according to the init.tips stack, also known as the T3-Stack. What's next? How do I make an app with this? W

Anthony Campolo 15 Dec 19, 2022
This repository contains an Advanced Zoom Apps Sample. It should serve as a starting point for you to build and test your own Zoom App in development.

Advanced Zoom Apps Sample Advanced Sample covers most complex scenarios that you might be needed in apps. App has reference implementation for: Authen

Zoom 11 Dec 17, 2022
A JavaScript Library To Make Your Work Work Easier/Faster

Functionalty.js (beta) About ✍️ This Is A JavaScript Library To Make Your Work Easier/Faster, You Can See Functionalty.js Website From Here Project Cr

Ali-Eldeba 16 Aug 30, 2022
A JavaScript Library To Make Your Work Work Easier/Faster

Functionality.js (beta) About ✍️ This Is A JavaScript Library To Make Your Work Easier/Faster, You Can See Functionalty.js Website From Here Project C

Ali-Eldeba 9 May 25, 2022
A JavaScript Library To Make Your Work Work Easier/Faster

Functionality.js About ✍️ This Is A JavaScript Library To Make Your Work Easier/Faster, You Can See Functionalty.js Website From Here Project Created

functionality 16 Jun 23, 2022
Jester is a test-generation tool to create integration test code.

Code Generator for Integration Tests Introduction Welcome to Jester: An easy-to-use web application that helps you create and implement integration te

OSLabs Beta 54 Dec 12, 2022
The co-work repository of HIWMS project group

_ooOoo_ o8888888o 88" . "88 (| -_- |) O\ = /O ____/'---'\____ .' \\|

null 3 Jun 30, 2022