MaggotInfested

joined 2 months ago

Yeah I was using it before I realized I might need a scraper.

[–] MaggotInfested@lemmy.dbzer0.com 6 points 19 hours ago (8 children)

What is the USTR blacklist? how do we preserve this data before its lost?

[–] MaggotInfested@lemmy.dbzer0.com 2 points 1 day ago (2 children)

Very little. I know basic html + css but I am trying to work with python

I test with IDLE for python + use selenium for driver directory (geckodrive)

[–] MaggotInfested@lemmy.dbzer0.com 1 points 3 days ago (1 children)

I could send it to you privately if you let me know ur discord or something

I don't like to touch js so ive being going python only. (besides basic html & Css) but I found puppeteer and didn't really get it.

The discord thing is a no-go since I don't really know how to make my issue palatable. That's why I used lemmy. Thanks again!

I am having to frankinscript because resources don't really give out the code for my needs. I am using command prompt from win powershell and testing with python IDLE

[–] MaggotInfested@lemmy.dbzer0.com 1 points 3 days ago (4 children)

I'm attempting to make a webscraper that can grab online books that are stored within the site or stored with a direct link to the storage site. I don't want to reinvent this but finding one that I can work and/or build off of is hard due to my lack of experience and vague resources.

[–] MaggotInfested@lemmy.dbzer0.com 1 points 3 days ago (2 children)

This was the original plan but it doesn't work as well for this on 'dynamic' websites

 

I have been trying for hours to figure this out. From a building tutorial to just trying to find prebuilt ones, I can't seem to make it click.

For context I am trying to scrape books myself that I can't seem to find elsewhere so I can use and post them for others.

The scraper tutorial

Hackernoon tutorial by Ethan Jarell

I initially tried to follow this but I kept having a "couldn't find module" error. Since I have never touched python prior to this, I am unaware how to fix this and the help links are not exactly helpful. If there's someone who could guide me through this tutorial that would be great.

Selenium

Selenium Homepage

I don't really get what this is but I think its some sort of python pack and it tells me to download using the pip command but that doesn't seem to work (syntax error). I don't know how to manually add it in because, again, I have little idea of what I'm doing.

Scrapy

Scrapy Homepage

This one seemed like it'd be an out-of-box deal but not only does it need the pip command to download but it has like 5 other dependencies it needs to function which complicates it more for me.

I am not criticizing these wares, I am just asking for help and if someone could help with the simplification of it all or maybe even point me to an easier method that would be amazing!


Updates

  • Figured out that I am supposed to run the command for pip in the command prompt thing on my computer, not the python runner. py -m followed by the pip request

  • Got the Ethan Jarrell tutorial to work and managed to add in selenium, which made me realize that selenium isn't really helpful with the project. rip xP

  • Spent a bunch of time trying to workshop the basic scraper to work with dynamic sites, unsuccessful

  • Online self-help doesn't go in as much as I would like, probably due to the legal grey area


 

I was watching on it this morning but I just tried to go on it and this came up. The megathread needs to be updated.

https://goodbye.braflix.is/

41
What is oalinst.exe? (lemmy.dbzer0.com)
submitted 3 weeks ago* (last edited 3 weeks ago) by MaggotInfested@lemmy.dbzer0.com to c/piracy@lemmy.dbzer0.com
 

I see it sometimes when I download games and I usually avoid it but I want to play the game online so I want to know if its a genuine concern. Couldn't find anyone else talking about it on here :P

EDIT: Thanks for the info everyone! Makes much more sense now.

 

I keep seeing in forums and sites like these that say it's frowned upon to not seed torrents that you use/used. I saw a post on here or Reddit (I don't remember) with a guy ecstatic that someone started seeding his download he had been trying to get done for months. I know seeding lets someone download something using your computer but how is it helpful if someone doesn't have a site and/or isn't "in-range" ?

If you can't tell, I don't know much about how torrenting works other than how to download something using one. I hope that you all can just explain or point me in the right direction because I would like to support the community.

view more: next ›