Image scraping support added!
Update for the API today, as from now on, images will also be able to be scraped using the API. Supported image extensions are: .jpg, .jpeg, .png, .gif, .bmp, .jpe, .webp You can pass the image URL directly in the API call and the response will contain the raw image content. Give it a try …. Read More
New features added to HeadlessBrowserAPI!
Today, an update landed for HeadlessBrowserAPI, bringing 2 new features to the Puppeteer and Tor endpoints of the API: Automatically solve CAPTCHAs in scraped pages Automatically block Ads in scraped pages Please check the documentation page of the API for more info on how to use these new features!
Recommend HeadlessBrowserAPI & Earn Recurring Passive Income!
Great news today! Join our Affiliate program and earn up to a 50% recurring commission on every user you refer! HeadlessBrowserAPI Affiliate Program We are so glad you’re here and want to learn more about the HeadlessBrowserAPI Affiliate Program! For those of you who don’t know, HeadlessBrowserAPI is a service that handles proxies, headless browsers so …. Read More
Rotating proxy support added by default to API calls – never get blocked again!
The latest update of the API added rotating proxy support to it. It is available by default to all request made by the API. You can still add your own proxies, using the proxy_url API parameter, available in all API nodes (excepting Tor node, which uses Onion/Tor proxies). You can also disable the usage of …. Read More
New endpoint added to the API: create full height screenshots of web sites
A new endpoint was just added to the HeadlessBrowserAPI, it is now able to create full height screenshots of web pages. Check the documentation page for details. The output format will be a jpg file (in case of successful API call). This endpoint will be helpful if you need to generate page previews, archive screenshots, …. Read More
Hacking real estate to find the best off-market deals
Winning in real estate is about better information, with data sources like Redfin and MLS as table stakes. The most successful real estate players win by having an information edge. HeadlessBrowserAPI is a smart web-scraper – letting innovative real estate players like Mr. Tomko find leads that others can’t. Here’s how Mr. Tomko identifies high value …. Read More
Generate high quality potential candidate leads
Good hiring is one of (if not the most) important drivers of success for any company, as great people will find creative solutions to hard problems, form a strong culture and propel a company forward. Great talent is scarce in any environment and to find the best talent you need to start with a wide …. Read More
Transforming one web page into $1 bn of market-moving insights
We all know the web contains valuable information, but the untapped potential of information that is out there, but not structured or used correctly, is enormous. Any doubts about that statement were put to rest recently when a web-data firm, Selerity, uncovered Twitter’s Q1 earnings results about an hour before the scheduled release. Twitter’s stock …. Read More
Scraping Competitor’s Websites
Your competitors may have a team of dedicated content writers creating relevant content which is helping them make more sales, get more leads or offer better information to their customers, this is especially true in eCommerce environments. eCommerce Scraping Scraping your competitor’s website’s for eCommerce product information is a key part to staying up to date …. Read More
What Is Website Scraping?
An Introduction To Website Scraping Web scraping (also called web harvesting or web data extraction) is a computer software technique of extracting information from websites. Usually, such software programs simulate human exploration of the World Wide Web by either implementing low-level Hypertext Transfer Protocol(HTTP), or embedding a fully-fledged web browser, such as Internet Explorer or Mozilla Firefox. Web scraping is closely related to web indexing, which indexes information on …. Read More