0.0
The project is in a healthy, maintained state
Ruby library for operating Zyte API.
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
 Dependencies

Runtime

~> 1.2.15, >= 1.2.15
~> 0.8.1, >= 0.8.1
~> 2.6.3, >= 2.6.3
~> 0.2.0, >= 0.2.0
~> 5.56.0, >= 5.56.0
~> 1.2.2, >= 1.2.2
~> 0.11.2, >= 0.11.2
 Project Readme

Gem version Gem downloads

Zyte Client

Ruby gem for operating Zyte.com API.

Outline:

  1. Installation
  2. Getting Started

1. Installation

gem install zyte-client

You many need to install the jq package too.

sudo apt install jq

2. Getting Started

require 'zyte-client'
url = 'https://www.google.com/search?q=Hola+Mundo'
client = ZyteClient.new(key: '<your Zyte API key here>')
html = client.extract(url: url)
File.open("getting-started.html", 'w') { |file| file.write(html) }
puts html

3. Zyte Options

You can specify additonal Zyte API's options to the default request. Default request is just this: {"url": url}.

require 'zyte-client'
url = 'https://www.google.com/search?q=Hola+Mundo'
client = ZyteClient.new(key: '<your Zyte API key here>')
html = client.extract(
    url: url,
    options: {
        "sessionContext": [
        {
            "name": "id",
            "value": "2"
        }
        ],
        "sessionContextParameters": {
            "actions": [
            {
                "action": "waitForTimeout",
                "timeout": 5,
                "onError": "return"
            }
            ]
        },            
        "httpResponseBody": true
    }
)
File.open("data/options.html", 'w') { |file| file.write(html) }
puts html

4. Parsing Zyte Response

By default the JSON response from Zyte is parsed and decoded using the bash commands jq and base64, and return the value of the httpResponseBody key.

If you want to get the entire JSON response, disable the json_parsing parameter.

require 'zyte-client'
url = 'https://www.google.com/search?q=Hola+Mundo'
client = ZyteClient.new(key: ZYTE_API_KEY)
ret = client.extract(
    url: url,
    json_parsing: false,
    options: {
        "sessionContext": [
        {
            "name": "id",
            "value": "2"
        }
        ],
        "sessionContextParameters": {
            "actions": [
            {
                "action": "waitForTimeout",
                "timeout": 5,
                "onError": "return"
            }
            ]
        },            
        "httpResponseBody": true
    }
)
File.open("data/parsing.json", 'w') { |file| file.write(ret.to_json) }
puts ret.to_json