Scraping
Scraping means the ability to read the contents of a web page. This functionality is built into your workers automatically, if Toolhouse detects your worker needs it. You can also add it manually.
How scraping works
Your worker will have access to a scraper integration as needed. The scraper will try its best to read the contents of a webpage.
Scraping means reading websites. Scraping is not a technique to perform actions such as filling out forms or logging into websites.
By default, your worker will see an AI friendly version of the website. This means it will only see basic styling, but will not see the actual code of the page. If you need the worker to see the entire page code, edit your agent in Agent Editor and tell the editor: "I want my worker to scrape the page in HTML format".
Accessing private pages in an automated way often violates the terms of service of the platform you're trying to access. If you are looking to extract data from platforms that require authorization, such as Linkedin or Instagram, you can use our Integrations.
Supported countries
Sometimes the contents of a page change when you connect from a different country. Your worker can be instructed to connect to a specific location among the ones we support. To enable this behavior, type a prompt like this to your builder:
Make my worker select the right country for the content. I always want to retrieve content from Italy.
This is useful when you need to retrieve content coming from a specific country or in a specific language.
Scraper is optimized for best effort. If your worker tries to request a country not supported, the scraper will still return a result from the closest available region.
The list of supported countries may change at any time without prior notice. It is provided solely for reference.
Afghanistan
af
Albania
al
Algeria
dz
American Samoa
as
Andorra
ad
Angola
ao
Anguilla
ai
Antarctica
aq
Antigua & Barbuda
ag
Argentina
ar
Armenia
am
Aruba
aw
Australia
au
Austria
at
Azerbaijan
az
Bahama
bs
Bahrain
bh
Bangladesh
bd
Barbados
bb
Belarus
by
Belgium
be
Belize
bz
Benin
bj
Bermuda
bm
Bhutan
bt
Bolivia
bo
Bosnia and Herzegovina
ba
Botswana
bw
Bouvet Island
bv
Brazil
br
British Indian Ocean Territory
io
British Virgin Islands
vg
Brunei Darussalam
bn
Bulgaria
bg
Burkina Faso
bf
Burma (no longer exists)
bu
Burundi
bi
Cambodia
kh
Cameroon
cm
Canada
ca
Cape Verde
cv
Cayman Islands
ky
Central African Republic
cf
Chad
td
Chile
cl
China
cn
Christmas Island
cx
Cocos (Keeling) Islands
cc
Colombia
co
Comoros
km
Congo
cg
Cook Iislands
ck
Costa Rica
cr
Croatia
hr
Cuba
cu
Cyprus
cy
Czech Republic
cz
Czechoslovakia (no longer exists)
cs
Côte D'ivoire (Ivory Coast)
ci
Democratic Yemen (no longer exists)
yd
Denmark
dk
Djibouti
dj
Dominica
dm
Dominican Republic
do
East Timor
tp
Ecuador
ec
Egypt
eg
El Salvador
sv
Equatorial Guinea
gq
Eritrea
er
Estonia
ee
Ethiopia
et
Falkland Islands (Malvinas)
fk
Faroe Islands
fo
Fiji
fj
Finland
fi
France
fr
French Guiana
gf
French Polynesia
pf
French Southern Territories
tf
Gabon
ga
Gambia
gm
Georgia
ge
German Democratic Republic (no longer exists)
dd
Germany
de
Ghana
gh
Gibraltar
gi
Greece
gr
Greenland
gl
Grenada
gd
Guadeloupe
gp
Guam
gu
Guatemala
gt
Guinea
gn
Guinea-Bissau
gw
Guyana
gy
Haiti
ht
Heard & McDonald Islands
hm
Honduras
hn
Hong Kong
hk
Hungary
hu
Iceland
is
India
in
Indonesia
id
Iraq
iq
Ireland
ie
Islamic Republic of Iran
ir
Israel
il
Italy
it
Jamaica
jm
Japan
jp
Jordan
jo
Kazakhstan
kz
Kenya
ke
Kiribati
ki
Korea, Democratic People's Republic of
kp
Korea, Republic of
kr
Kuwait
kw
Kyrgyzstan
kg
Lao People's Democratic Republic
la
Latvia
lv
Lebanon
lb
Lesotho
ls
Liberia
lr
Libyan Arab Jamahiriya
ly
Liechtenstein
li
Lithuania
lt
Luxembourg
lu
Macau
mo
Madagascar
mg
Malawi
mw
Malaysia
my
Maldives
mv
Mali
ml
Malta
mt
Marshall Islands
mh
Martinique
mq
Mauritania
mr
Mauritius
mu
Mayotte
yt
Mexico
mx
Micronesia
fm
Moldova, Republic of
md
Monaco
mc
Mongolia
mn
Monserrat
ms
Morocco
ma
Mozambique
mz
Myanmar
mm
Namibia
na
Nauru
nr
Nepal
np
Netherlands Antilles
an
Netherlands
nl
Neutral Zone (no longer exists)
nt
New Caledonia
nc
New Zealand
nz
Nicaragua
ni
Niger
ne
Nigeria
ng
Niue
nu
Norfolk Island
nf
Northern Mariana Islands
mp
Norway
no
Oman
om
Pakistan
pk
Palau
pw
Panama
pa
Papua New Guinea
pg
Paraguay
py
Peru
pe
Philippines
ph
Pitcairn
pn
Poland
pl
Portugal
pt
Puerto Rico
pr
Qatar
qa
Romania
ro
Russian Federation
ru
Rwanda
rw
Réunion
re
Saint Lucia
lc
Samoa
ws
San Marino
sm
Sao Tome & Principe
st
Saudi Arabia
sa
Senegal
sn
Seychelles
sc
Sierra Leone
sl
Singapore
sg
Slovakia
sk
Slovenia
si
Solomon Islands
sb
Somalia
so
South Africa
za
South Georgia and the South Sandwich Islands
gs
Spain
es
Sri Lanka
lk
St. Helena
sh
St. Kitts and Nevis
kn
St. Pierre & Miquelon
pm
St. Vincent & the Grenadines
vc
Sudan
sd
Suriname
sr
Svalbard & Jan Mayen Islands
sj
Swaziland
sz
Sweden
se
Switzerland
ch
Syrian Arab Republic
sy
Taiwan, Province of China
tw
Tajikistan
tj
Tanzania, United Republic of
tz
Thailand
th
Togo
tg
Tokelau
tk
Tonga
to
Trinidad & Tobago
tt
Tunisia
tn
Turkey
tr
Turkmenistan
tm
Turks & Caicos Islands
tc
Tuvalu
tv
Uganda
ug
Ukraine
ua
Union of Soviet Socialist Republics (no longer exists)
su
United Arab Emirates
ae
United Kingdom (Great Britain)
gb
United States Minor Outlying Islands
um
United States Virgin Islands
vi
United States
us
Uruguay
uy
Uzbekistan
uz
Vanuatu
vu
Vatican City State (Holy See)
va
Venezuela
ve
Viet Nam
vn
Wallis & Futuna Islands
wf
Western Sahara
eh
Yemen
ye
Yugoslavia
yu
Zaire
zr
Zambia
zm
Zimbabwe
zw
Adding Scraping Manually
Go to Agents in your Toolhouse
Click on your worker to edit it
Select Integrations, then click Add Integration
Choose Metascraper
Click Save changes
Last updated