Extract data-like things from a website on the fly.

Overview

There-should-be-an-API

Oh, I think this website needs an API.

Extract data-like things from a website on the fly.

Demo

The demo API is hosted on a 256MB RAM server with limited resources, so don't expect it to be fast.

https://tsbaa-jacoblincool.koyeb.app/?url=

You can check X-Parse-Time header in the response to see how long it took to parse the target page.

Also, there are some other options:

  • &hash: replace selectors with their hashes, default is true. I believe the selectors of the target page are not important in most cases.
  • &mode: simple or full, default is simple.
  • &min: threshold of the minimum acceptable size of group, default is 5.
  • &max: threshold of the maximum acceptable size of group, default is Infinity.
  • &content_min: threshold of the minimum acceptable size of text content, default is 5.
  • &content_max: threshold of the maximum acceptable size of text content, default is Infinity.

Examples

https://tsbaa-jacoblincool.koyeb.app/?url=https://ani.gamer.com.tw/animeList.php
https://tsbaa-jacoblincool.koyeb.app/?url=https://news.google.com/topstories
https://tsbaa-jacoblincool.koyeb.app/?url=https://github.com/trending
You might also like...

The frontend of a full stack application of a personal wallet made with React, Node and MongoDB that allows you to add inputs, outputs and see all your extract.

The frontend of a full stack application of a personal wallet made with React, Node and MongoDB that allows you to add inputs, outputs and see all your extract.

The frontend of a full stack application of a personal wallet made with React, Node and MongoDB that allows you to add inputs, outputs and see all your extract.

Jun 2, 2022

The backend of a full stack application of a personal wallet made with React, Node and MongoDB that allows you to add inputs, outputs and see all your extract.

The backend of a full stack application of a personal wallet made with React, Node and MongoDB that allows you to add inputs, outputs and see all your extract.

My first full stack application with the concept of a personal wallet that allows you to create a personal account to keep track of your entire statement by adding incoming and outgoing transactions, as well as calculating the total balance and being able to edit and delete old transactions.

Jun 23, 2022

Converts your IPv4 address to a 4x4 2-bit PNG which you can extract the IP from.

Converts your IPv4 address to a 4x4 2-bit PNG which you can extract the IP from.

IP-to-PNG Converts your IPv4 address to a 4x4 2-bit PNG which you can extract the IP from. https://www.npmjs.com/package/ip2png Run npm install ip2png

Nov 30, 2022

The ICPverse service to extract the metadata from an existing NFT project on the Internet Computer.

Infinity_Rank The ICPverse service to extract the metadata from an existing NFT project on the Internet Computer. Requirements for Use: npm installed

Nov 5, 2022

A logseq plugin to extract tweets. Optionally tag the tweet's author as well.

A logseq plugin to extract tweets. Optionally tag the tweet's author as well.

Logseq Twitter Extractor Plugin If this plugin helps you, I'd really appreciate your support. You can buy me a coffee here. A quick utility to parse a

Nov 17, 2022

Obsidian Extract PDF Annotations Plugin

Obsidian Extract PDF Annotations Plugin

Obsidian Extract PDF Annotations Plugin This is a plugin for Obsidian. It extracts Annotations from PDF Files. Usage This Plugin visits all PDF files

Dec 22, 2022

Compare 👥 and extract 🔬 differences between JSON values

json-changes Easily compare changes in JSON-like values. This can be especially useful when you want to compare an incoming payload with a counterpart

Aug 23, 2022

Tiny and fast utility to extract all possible values for a given enum.

Tiny (208B) and fast utility to extract all possible values for a given enum.

Apr 18, 2022
Comments
  • Configure Renovate

    Configure Renovate

    Mend Renovate

    Welcome to Renovate! This is an onboarding PR to help you understand and configure settings before regular Pull Requests begin.

    🚦 To activate Renovate, merge this Pull Request. To disable Renovate, simply close this Pull Request unmerged.


    Detected Package Files

    • Dockerfile (dockerfile)
    • package.json (npm)

    Configuration

    🔡 Renovate has detected a custom config for this PR. Feel free to ask for help if you have any doubts and would like it reviewed.

    Important: Now that this branch is edited, Renovate can't rebase it from the base branch any more. If you make changes to the base branch that could impact this onboarding PR, please merge them manually.

    What to Expect

    With your current configuration, Renovate will create 5 Pull Requests:

    fix(deps): update dependency esno to ^0.16.0
    • Schedule: ["at any time"]
    • Branch name: renovate/esno-0.x
    • Merge into: main
    • Upgrade esno to ^0.16.0
    chore(deps): update dependency @​types/node to v18
    • Schedule: ["at any time"]
    • Branch name: renovate/node-18.x
    • Merge into: main
    • Upgrade @types/node to ^18.0.0
    chore(deps): update jest monorepo to v29 (major)
    • Schedule: ["at any time"]
    • Branch name: renovate/major-jest-monorepo
    • Merge into: main
    • Upgrade @types/jest to ^29.0.0
    • Upgrade jest to ^29.0.0
    • Upgrade ts-jest to ^29.0.0
    fix(deps): update dependency jsdom to v20
    • Schedule: ["at any time"]
    • Branch name: renovate/jsdom-20.x
    • Merge into: main
    • Upgrade jsdom to ^20.0.0
    • Upgrade @types/jsdom to ^20.0.0
    fix(deps): update dependency node-fetch to v3
    • Schedule: ["at any time"]
    • Branch name: renovate/node-fetch-3.x
    • Merge into: main
    • Upgrade node-fetch to 3.3.0

    🚸 Branch creation will be limited to maximum 2 per hour, so it doesn't swamp any CI resources or spam the project. See docs for prhourlylimit for details.


    ❓ Got questions? Check out Renovate's Docs, particularly the Getting Started section. If you need any further assistance then you can also request help here.


    This PR has been generated by Mend Renovate. View repository job log here.

    opened by renovate[bot] 0
Owner
JacobLinCool
Freshman @ NTNU CSIE
JacobLinCool
This project is used to extract media from various posting platfroms like Twitter, Reddit, Pixiv, Youtube and many other

Social-Picker-API This project is used to extract media from various posting platfroms like Twitter, Reddit, Pixiv, Youtube and many others. It's writ

Serge 11 Nov 29, 2022
LiveTabs is a Javascript library that allows you to create and manage tabs on the fly. This library gives the ability to your application to act like browser tabs, making dynamic tabs.

LiveTabs Table of content Description Goals Technologies Setup Description LiveTabs is a Javascript library that allows you to create and manage tabs

Hossein Khalili 3 May 3, 2022
Lets you add a character to Hacker News links to add social media and OpenGraph previews for sharing on things like Slack or Twitter.

news.ycombinator1.com Lets you add a character to Hacker News links to add social media and OpenGraph previews for sharing on things like Slack or Dis

Ian Langworth ☠ 38 Sep 18, 2022
A browser extension to simplify web pages and hide distracting things like hide cookie banners, auto-playing videos, sidebars, etc

Unclutter Browser Extension A browser extension to simplify web pages and hide distracting things like hide cookie banners, auto-playing videos, sideb

null 849 Jan 9, 2023
A work-in-progress HTML sanitizer that strives for: performance like window.Sanitizer, readiness like DOMPurify, and ability to run in a WebWorker like neither of those.

Amuchina A work-in-progress HTML sanitizer that strives for: performance like window.Sanitizer, readiness like DOMPurify, and ability to run in a WebW

Fabio Spampinato 9 Sep 17, 2022
Chris Siku 13 Aug 22, 2022
To-do-List website is a page that contains information about listed shores and things to do .

Project Name To-do-List Description the project. To-do-List website is a page that contains information about listed shores and things to do . Built W

PIESIE YEBOAH 8 Aug 8, 2022
A tampermonkey script that adds functionality to the midjourney.com website to make it easier to do things.

MidJourneyTools A tampermonkey script that adds functionality to the midjourney.com website to make it easier to do things. Setup Instructions Make su

Nikolas 42 Dec 24, 2022
To-do-List website is a page that contains information that you already listed of shores and things to do.

Project Name To-do-List Description the project. To-do-List website is a page that contains information that you already listed of shores and things t

Laasri Said 9 Oct 14, 2022
Cindy Dorantes 12 Oct 18, 2022