{ "cells": [ { "cell_type": "markdown", "id": "ad553e51", "metadata": {}, "source": [ "# Async Chromium\n", "\n", "Chromium is one of the browsers supported by Playwright, a library used to control browser automation. \n", "\n", "By running `p.chromium.launch(headless=True)`, we are launching a headless instance of Chromium. \n", "\n", "Headless mode means that the browser is running without a graphical user interface.\n", "\n", "`AsyncChromiumLoader` load the page, and then we use `Html2TextTransformer` to trasnform to text." ] }, { "cell_type": "code", "execution_count": null, "id": "1c3a4c19", "metadata": {}, "outputs": [], "source": [ "! pip install -q playwright beautifulsoup4\n", "! playwright install" ] }, { "cell_type": "code", "execution_count": 2, "id": "dd2cdea7", "metadata": {}, "outputs": [ { "data": { "text/plain": [ "'