diff --git a/content/posts/scrape_1.md b/content/posts/scrape_1.md index 826c107..9ea22f5 100644 --- a/content/posts/scrape_1.md +++ b/content/posts/scrape_1.md @@ -1,6 +1,7 @@ --- title: "Web Scraping 101: E-Reader app" date: 2019-05-13T10:46:34-04:00 +tags: ["scraping"] draft: false --- @@ -20,13 +21,14 @@ So, individual pdf pages are being read from this `getpdfpage` endpoint, rendere every time you flip a page in the app. This is what is sent to the endpoint: -``` + +{{}} globalbookid: "" pdfpage: ".pdf" iscover: "N" authkey: "" hsid: "" -``` +{{}} Obviously, the `globalbookid` is the unique ID of the book I am looking at. `pdfpage` is the ID of the page, there is probably a way to get a list of those with another endpoint. `iscover` and `authkey` are self explanatory. So what exactly is @@ -82,8 +84,8 @@ with open("book.json") as f: To stitch the pages together, I used `pdfunite`: -```bash +{{}} pdfunite $(ls -v) output.pdf -``` +{{}} Now even if you wanted, you *couldn't even buy* a digital version of that book of that quality. diff --git a/layouts/partials/css/custom.css b/layouts/partials/css/custom.css index 1ddf123..01c7e2f 100644 --- a/layouts/partials/css/custom.css +++ b/layouts/partials/css/custom.css @@ -13,10 +13,16 @@ a { text-decoration: none; } -code, kbd, pre, samp { - background-color: rgba(27,31,35,.05); +code, kbd, samp { + background-color: rgba(255,255,255,.08); border-radius: 3px; font-size: 90%; + padding: 0.1em 0.4em; +} + +pre code { + background-color: inherit; + padding: 0; } .nav-menu {