Clueless python learner
Posted by Radio_Pluto@reddit | learnprogramming | View on Reddit | 9 comments
I recently learned python from a book, since then i want to start web scraping and play with APIs but i see no starting point for it. I am trying to learn Requests module but i am not aware from terms like JSON, paraphrase, encode and stuffs. I also downloaded a book to learn web scraping but it was missing structure, the writer was jumping from html library to beautiful soup without explaining much. Is there a path to learn everything in a systematic way??
Free-Cheek-9440@reddit
If Requests feels confusing, start even simpler: just hit a public API and print raw output.
Don’t worry about parsing at first just observe how data looks.
Then slowly introduce JSON parsing (
json.loads) and only then move to scraping HTML pages.This step-by-step layering is what makes it stick.
Eastern_Ad_9018@reddit
I am currently using Python for web scraping; if you're interested, you can discuss it with me.
Radio_Pluto@reddit (OP)
yesss i wanted know how do i start what prerequisites do i need to learn first ??
Eastern_Ad_9018@reddit
You can first learn about the concepts and functions of web crawlers. (If you have the ability, you can also learn about some website development functions). Of course, it's okay if you don't understand these things. All you need to know is that you can obtain the specified data by simulating a browser through the program.
The two common types of requests used to obtain website data through programs are `GET` and `POST`.
After you obtain the data, there are many data that are not what you need, so there is a need for data cleaning and parsing.
After the data cleaning is completed, it is necessary to save the data locally or in the database.
This is the simple logic of a crawler. The next steps are how to correctly obtain the response content, how to improve the request speed, and the speed of data entry.
jameyiguess@reddit
Automate The Boring Stuff is a great book. But you will have to do side research no matter what as you learn anything in tech. Learn what json is, why is had to be encoded and decoded, etc.
Radio_Pluto@reddit (OP)
yeah i just saw that book it has many good beginner projects should i go in sequence or jump to projects which i like to do first?
desrtfx@reddit
IMO, the projects gradually build, so the best approach is to do them in sequence.
Yet, you can definitely do them in the order of your interests. Might need to do a bit more research, but it is definitely doable.
Latter-Particular440@reddit
yeah that book really covers web scraping basics well but you'll need to google around json and http stuff as you go, it's just how programming works unfortunately
Key_Use_8361@reddit
Completely normal phase what helped me was building tiny automation scripts instead of tutorials even a simple script running daily taught me more than courses later I used runable to schedule them automatically so I stayed consistent