We use requests and BeautifulSoup to perform the following steps:

  1. Get HTML
  2. Extract the title
  3. Extract the body while ignoring scripts and tags
  4. Replace multiple newlines with a single newline
