Loaders

Adapted from the Griptape AI Framework documentation.

Overview

Loaders are used to load data from sources and parse it into Artifacts. Each loader can be used to load a single "source" with load() or multiple sources with load_collection().

File

The following Loaders load a file using a FileManagerDriver and loads the resulting data into an Artifact for the respective file type.

As a convenience, File Loaders also have a save() method that can save an Artifact to a destination.

from griptape.drivers.file_manager.local import LocalFileManagerDriver
from griptape.loaders import TextLoader

loader = TextLoader(file_manager_driver=LocalFileManagerDriver())

data = loader.load("tests/resources/test.txt")

data.value = data.value.replace("foo", "bar")

loader.save("tests/resources/test.txt", data)

Text

Loads text files into TextArtifacts:

import urllib.request
from pathlib import Path

from griptape.loaders import TextLoader

TextLoader().load("tests/resources/test.txt")

urllib.request.urlretrieve("https://example-files.online-convert.com/document/txt/example.txt", "example.txt")

TextLoader().load(Path("example.txt"))

TextLoader().load_collection(["tests/resources/test.txt", Path("example.txt")])

PDF

Info

This driver requires the loaders-pdf extra.

Loads PDF files into ListArtifacts, where each element is a TextArtifact containing a page of the PDF:

import urllib.request
from pathlib import Path

from griptape.loaders import PdfLoader

urllib.request.urlretrieve("https://arxiv.org/pdf/1706.03762.pdf", "attention.pdf")

# Load a single PDF file
PdfLoader().load("attention.pdf")
# You can also pass a Path object
PdfLoader().load(Path("attention.pdf"))

urllib.request.urlretrieve("https://arxiv.org/pdf/1706.03762.pdf", "CoT.pdf")

# Load multiple PDF files
PdfLoader().load_collection([Path("attention.pdf"), Path("CoT.pdf")])

CSV

Loads CSV files into ListArtifacts, where each element is a TextArtifact containing a row of the CSV:

from pathlib import Path

from griptape.loaders import CsvLoader

# Load a single CSV file
CsvLoader().load("tests/resources/cities.csv")
# You can also pass a Path object
CsvLoader().load(Path("tests/resources/cities.csv"))

# Load multiple CSV files
CsvLoader().load_collection([Path("tests/resources/cities.csv"), "tests/resources/addresses.csv"])

Image

Info

This driver requires the loaders-image extra.

Loads images into ImageArtifacts:

from pathlib import Path

from griptape.loaders import ImageLoader

# Load an image from disk
ImageLoader().load("tests/resources/mountain.png")

# You can also pass a Path object
ImageLoader().load(Path("tests/resources/mountain.png"))

By default, the Image Loader will load images in their native format, but not all models work on all formats. To normalize the format of Artifacts returned by the Loader, set the format field.

from pathlib import Path

from griptape.loaders import ImageLoader

# Load a single image in BMP format
ImageLoader(format="bmp").load("tests/resources/mountain.png")
# You can also pass a Path object
ImageLoader(format="bmp").load(Path("tests/resources/mountain.png"))

# Load multiple images in BMP format
ImageLoader().load_collection([Path("tests/resources/mountain.png"), "tests/resources/cow.png"])

Audio

Loads audio files into AudioArtifacts:

The Loader will load audio in its native format and populates the resulting Artifact's format field by making a best-effort guess of the underlying audio format using the filetype package.

from pathlib import Path

from griptape.loaders import AudioLoader

# Load an image from disk
AudioLoader().load("tests/resources/sentences.wav")

# You can also pass a Path object
AudioLoader().load(Path("tests/resources/sentences.wav"))

JSON

Loads JSON files into JsonArtifacts:

from pathlib import Path

from griptape.loaders import JsonLoader

# Load an image from disk
JsonLoader().load("tests/resources/test.json")

# You can also pass a Path object
JsonLoader().load(Path("tests/resources/test.json"))

Web

Info

This driver requires the loaders-web extra.

Scrapes web pages using a WebScraperDriver and loads the resulting text into TextArtifacts.

from griptape.loaders import WebLoader

WebLoader().load("https://www.griptape.ai")

WebLoader().load_collection(["https://www.griptape.ai", "https://docs.griptape.ai"])

SQL

Loads data from a SQL database using a SQLDriver and loads the resulting data into ListArtifacts, where each element is a TextArtifact containing a row of the SQL query.

from griptape.drivers.sql.sql_driver import SqlDriver
from griptape.loaders import SqlLoader

SqlLoader(sql_driver=SqlDriver(engine_url="sqlite:///:memory:")).load("SELECT 'foo', 'bar'")

SqlLoader(sql_driver=SqlDriver(engine_url="sqlite:///:memory:")).load_collection(
    ["SELECT 'foo', 'bar';", "SELECT 'fizz', 'buzz';"]
)

Email

Info

This driver requires the loaders-email extra.

Loads data from an imap email server into a ListArtifacts, where each element is a TextArtifact containing an email.

from griptape.loaders import EmailLoader

loader = EmailLoader(imap_url="an.email.server.hostname", username="username", password="password")

loader.load(EmailLoader.EmailQuery(label="INBOX"))

loader.load_collection([EmailLoader.EmailQuery(label="INBOX"), EmailLoader.EmailQuery(label="SENT")])

On this page
Overview
File
Web
SQL
Email

← Prev

Artifacts

↑ Up

Data abstractions

Chunkers

Could this page be better? Report a problem or suggest an addition!

Loaders

Overview

File

Text

PDF

Info

CSV

Image

Info

Audio

JSON

Web

Info

SQL

Email

Info

← Prev

↑ Up

Next →