I have my personal blog, made with Hugo and hosted on GitHub pages. Initially I did not turn on any kind of web tracking / web analytics, because I do not like tracking at all. But I want to make my blog better and to achieve it, I need a feedback loop about traffic. For example, what are the most popular publications, or how many people view my blog from mobile devices, etc.

So, my question is, what is the most appropriate (ot the less evil) way to track a web traffic?

An answer “there is no good way to do it without breaking user’s privacy” is acceptable too, I did not decide yet turning on the analytics. Instead I’m interested in an opinion of the community.

Thanks in advance!

  • mox@lemmy.sdf.org
    link
    fedilink
    arrow-up
    11
    ·
    edit-2
    6 months ago

    The right way to do this is to self-host your analytics.

    I don’t know which tools are popular for this nowadays, but something like Matomo On-Premise might be worth a look. I expect you can find more with a web search. Keywords: open-source self-hosted web analytics.

    • Sem@lemmy.mlOP
      link
      fedilink
      English
      arrow-up
      2
      ·
      6 months ago

      Interesting, thanks! I didn’t think about it. But for a personal blog (without any kind of monetization) it is not an option, unfortunately, due to hosting/infra prices 😞

      • mox@lemmy.sdf.org
        link
        fedilink
        arrow-up
        7
        ·
        edit-2
        6 months ago

        VPS can be had very cheap: https://lowendstock.com/

        Also, it might be worth looking for analytics software that can get its data from web server log files. I have done that with Apache and Nginx in the past. These days, I wouldn’t be surprised if such software can ingest the log files created by Amazon’s S3 free tier. You wouldn’t have to manage a VPS with that approach.

        Of course, if you’re letting a major data collector like Github (Microsoft), Amazon, or Cloudflare serve your site, it’s not particularly good for privacy to begin with.

  • j4k3@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    6 months ago

    Think of it like people walking into a brick and mortar retail store and what they should be able to expect from an honest local business. For most of us, the sensitivities are when your “local store” is collecting data that is used for biased information, price fixing, and manipulation. I don’t think you’ll find anyone here that boycotts a store because they keep a count of how many customers walk in the front door.

  • The Doctor@beehaw.org
    link
    fedilink
    English
    arrow-up
    2
    ·
    6 months ago

    The folks who’re recommending analysis of your site’s access logs are correct. However, Github Pages doesn’t have any such notion. You might have to go with the recommendations of moving stuff over to a VPS.

    • Sem@lemmy.mlOP
      link
      fedilink
      English
      arrow-up
      1
      ·
      6 months ago

      It is always a tradeoff. It could be cool to see some analytics of visits, but I’m not ready to go self hosted for that. My blog is just a hobby, not a work, no monetization, etc.

    • ErwinLottemann@feddit.de
      link
      fedilink
      arrow-up
      1
      ·
      6 months ago

      awstats parses logfiles to create statistics, which op probably has not due to the page beeing hosted on github

      • delirious_owl
        link
        fedilink
        arrow-up
        1
        ·
        6 months ago

        Yeah my point is you gotta self host to collect analytics without violating your visitor’s privacy.

  • DetectiveSanity@lemmy.world
    link
    fedilink
    arrow-up
    1
    ·
    6 months ago

    Tracking how many users you have and how many of them visit an article is hardly any breach of privacy compared to what ad companies retain.

    Use an open source solution about analytics and minimise data collection if not absolutely necessary.