HTPCGB Wrote:And here's the scraping class...So, I've taken the liberty of using your class in the little prototype I'm building for "Interactive Dynamic Source" apps (for want of a better term). Basically, a DynSource app with a form for capturing links from google, youtube, etc. It works like a regular DynSource app except that the form pops up when you press the import button. You can then copy video clip page URLs from your browser and paste them into the form where they get resolved into the actual target URL and added to a list. When you are done, the list gets turned into a PLX file which is then imported in the normal way.
I'd already built one for YouTube where I capture the link for playing the Flash video in full screen mode on the YouTube site. To play it I built a "player" from a simple webrowser control in a form with no border. Then, when I saw your GooglePage class, I built another one that uses your technique (ie. capture the actual URL of the file and then use VLC to play it). Here's what I found so far:
Your avi links work really well. At least they do in VLC 8.4a (my test system) but in VLC 8.5 (my GBPVR system) it complains that the avi format is broken and then stops (EDIT: but works fine in 8.6-test2a beta - I'm watching one now).
The flv links work (eventually) but VLC seems to have to download the entire flv file (which takes forever) before it starts playing it. I haven't figured out if you can persuade it to start playing while it's still being downloaded (like it does with most other formats). The flv files seem like they are really huge compared with other formats too. (EDIT: Just noticed that you already told me about this problem a few days ago. Duhh! I really must get some sleep ).
Using my YouTube technique, where I'm letting their site host the player and just rendering the resultant window, works kinda OK (Flash video is very slow and unreliable at the best of times, IMHO). The gotcha in my case is that something seems to change in the target URL they use every so often, so the link I capture works for a while and then no longer works. The video id is the same but something else seems to change. Still trying to figure out what.
I'll post the source (and, with a bit of luck, working binaries) when I've got it slightly more sorted out. Very much a "what if" project, though. Just one approach to solving the problem.
EDIT: Here's an example of the problem I mentioned above (I've highlighted the bit that changed):
Code:
http://www.youtube.com/watch_fullscreen?video_id=BL5kHYtL430&l=189&t=OEgsToPDskK[B]FQs1JFiiH_z2Dl1vOi9Rl[/B]&fs=1&title="Dance of the Firefly"
and 20 minutes later I get:
http://www.youtube.com/watch_fullscreen?video_id=BL5kHYtL430&l=189&t=OEgsToPDskK[B]TE6ki-TuKlF_u6JWqVIPk[/B]&fs=1&title="Dance of the Firefly"
Any ideas? A timestamp, maybe?
[SIZE=1]GBPVR v1.3.11 [/SIZE][SIZE=1]HVR-1250, [/SIZE][SIZE=1]ES7300[/SIZE][SIZE=1], 4GB, GeForce 9300, LianLi, Vista.[/SIZE]
[SIZE=1]GBPVR v1.0.08 [/SIZE][SIZE=1]PVR-150, [/SIZE][SIZE=1]P4 2.26GHz, [/SIZE][SIZE=1]1GB,[/SIZE][SIZE=1] GeForce 6200, [/SIZE]Coupden, XP[SIZE=1]
[/SIZE]
Author: UbuStream plugin, UbuRadio plugin, EPGExtra utility.
[SIZE=1]GBPVR v1.0.08 [/SIZE][SIZE=1]PVR-150, [/SIZE][SIZE=1]P4 2.26GHz, [/SIZE][SIZE=1]1GB,[/SIZE][SIZE=1] GeForce 6200, [/SIZE]Coupden, XP[SIZE=1]
[/SIZE]
Author: UbuStream plugin, UbuRadio plugin, EPGExtra utility.