Spider
SpiderTools is an open source web Scraper & Crawler that returns LLM-ready data. To start using Spider, you need an API key from the Spider dashboard.
Prerequisites
The following example requires the spider-client
library.
pip install -U spider-client
Example
The following agent will run a search query to get the latest news in USA and scrape the first search result. The agent will return the scraped data in markdown format.
cookbook/tools/spider_tools.py
from bitca.agent import Agent
from bitca.tools.spider import SpiderTools
agent = Agent(tools=[SpiderTools()])
agent.print_response('Can you scrape the first search result from a search on "news in USA"?', markdown=True)
Toolkit Params
Parameter
Type
Default
Description
max_results
int
-
The maximum number of search results to return
url
str
-
The url to be scraped or crawled
Toolkit Functions
Function
Description
search
Searches the web for the given query.
scrape
Scrapes the given url.
crawl
Crawls the given url.
Last updated