How to Scrape Website Usage/User Data for a Website that Does Not Have an API

Question

How to Scrape Website Usage/User Data for a Website that Does Not Have an API

Nat

2021年9月30日 16:55

I am an aspiring Data Analyst and I was wondering how do I web scrape website usage/user data such as:

total people who visited that website,
total visitors per page,
how long did each visitor stay on each page, etc.

The website I am interested in scraping does not have an API in case that effects your answer. I have searched online, watched many YouTube tutorials, etc., but it doesn't pertain to the data I need.

I don't know the terminology (the words) I would use to get the usage data above. I don't know where this sort of data lives on websites and if it's even accessible to the general public?

I have used the code below in Python using BeautifulSoup and urllib.request but I don't know if this is the right way to go about getting the usage/user data in particular because it's not the the typical data that I have typically scraped like:

print('title, nav, href, etc...')

Below is the code I use when I am scraping data for things like: title, nav, href, etc.

import bs4 as bs 
import urllib.request 

source = urllib.request.urlopen(“the website you are scraping”).read()

soup = bs.BeautifulSoup(source, ‘lxml’)

print(soup)

Any and all help would be greatly appreciated.

Thank you.

Topic web-scraping data-science-model data python

Category Data Science

How to Scrape Website Usage/User Data for a Website that Does Not Have an API

About