Crawl4AI

Crawl4aiTools enable an Agent to perform web crawling and scraping tasks using the Crawl4ai library.

Prerequisites

The following example requires the crawl4ai library.

pip install -U crawl4ai

Example

The following agent will scrape the content from the https://docs.projectbit.ca/ webpage:

cookbook/tools/crawl4ai_tools.py

from bitca.agent import Agent
from bitca.tools.crawl4ai_tools import Crawl4aiTools

agent = Agent(tools=[Crawl4aiTools(max_length=None)], show_tool_calls=True)
agent.print_response("Tell me about https://github.com/bitca/bitca.")

Toolkit Params

Parameter
Type
Default
Description

max_length

int

1000

Specifies the maximum length of the text from the webpage to be returned.

Toolkit Functions

Function
Description

web_crawler

Crawls a website using crawl4ai’s WebCrawler. Parameters include ‘url’ for the URL to crawl and an optional ‘max_length’ to limit the length of extracted content. The default value for ‘max_length’ is 1000.

Last updated