cancel
Highlighted
Bianca491 Student
Student
1 0 0 0
Message 1 of 1
27
Flag Post
HP Recommended

How to Mechanize to scrape an HP Printer Status page?

HP Pavilion Notebook 15-bc407TX 2018
Microsoft Windows 10 (64-bit)

The status page looks like this:

 

http://h20000.www2.hp.com/bc/docs/support/SupportDocument/c00002742/c00004781.gif

 

You see the text underneath the Device Status title? That's what I want to scrape.

 

When navigated to, the status page is updated. I've pulled this from the page source:

<form id="deviceStatusPage"   method="post" action="this.LCDispatcher?nav=hp.DeviceStatus">

I can't seem to understand what it's actually DOING so it's hard to work out a good scraping strategy. I'm fairly sure the solution will be trivial but I can't seem to get started at all.

 

Should have said I've been playing with Mechanize and Beautiful Soup. The former seems like it'd achieve what I'd want, but I'm not sure how.

† The opinions expressed above are the personal opinions of the authors, not of HP. By using this site, you accept the Terms of Use and Rules of Participation