You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Go to file
Simon Vieille e9cb85c695
Gitnet/scraper/pipeline/head This commit looks good Details
release v1.3.0
2 years ago
src add option to get multiple results 2 years ago
test add tests 3 years ago
.gitignore add scraper 3 years ago
Jenkinsfile add tests 3 years ago add option to get multiple results 2 years ago
package.json release v1.3.0 2 years ago
yarn.lock add dependences 3 years ago


Build Status

This project is a basic tool to scrap a data from a website using a CSS selector.

For example, if you want to retrieve the number of a project's releases hosted on github:

With CLI

node src/cli.js \
  --url \
  --selector '.repository-content .numbers-summary li:nth-child(4) a' \
  --tags \
  --breaks \
  --spaces \
  --breaks \

...will show XXX releases.

More help with node src/cli.js --help.

With code

const scraper = require('deblan-scraper')

const options = {
  url: '',
  acceptAllStatus: false, // Optional, default is `false`
  method: 'GET', // Optional, default is `GET`

const isMultiple = false // get the first result, `true` to get an array of results

const selector = '.repository-content .numbers-summary li:nth-child(4) a'

const filters = {
  tags: null, // Removes tags. You can specify the tags to remove (separated by comma)
  breaks: null, // Removes breaks (\n, \r)
  spaces: null, // Replaces 2 successive spaces by 1, except breaks
  trim: null, // Strips whitespaces from the beginning and end of the value

  function(value) {
  function(error) {



  • node >= 10
  • yarn
$ git clone
$ cd scraper
$ yarn