The part of my code that's trying to download the pdf currently looks like this (commented lines being download attempts that didn't work): const newPagePromise = new Promise(x =>īrowser.once("targetcreated", target => x(target.page())) Specifically, I want to download the pdf from a page like this. You should find Puppeteer executes successfully, provided proper Chrome flags are used.I'm trying to do a bit of web scraping using Puppeteer, but I'm not sure how to actually download the documents I find. Chrome will write into /tmp instead.Īdd your JavaScript to your container with a COPY instruction. disable-dev-shm-usage – This flag is necessary to avoid running into issues with Docker’s default low shared memory space of 64MB.If you’re uncomfortable with this, you’ll need to manually configure working Chrome sandboxing, which is a more involved process. It’s vital you ensure your Docker containers are strongly isolated from your host. Using these flags could allow malicious web content to escape the browser process and compromise the host. no-sandbox and disable-setuid-sandbox – These disable Chrome’s sandboxing, a step which is required when running as the root user (the default in a Docker container).Setting this flag explicitly instructs Chrome not to try and use GPU-based rendering. disable-gpu – The GPU isn’t usually available inside a Docker container, unless you’ve specially configured the host.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |