Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Infodemiology

Date Submitted: Aug 8, 2023
Date Accepted: Oct 24, 2024

The final, peer-reviewed published version of this preprint can be found here:

Uncovering the Top Nonadvertising Weight Loss Websites on Google: A Data-Mining Approach

Almenara CA, Gulec H

Uncovering the Top Nonadvertising Weight Loss Websites on Google: A Data-Mining Approach

JMIR Infodemiology 2024;4:e51701

DOI: 10.2196/51701

PMID: 39661980

PMCID: 11669867

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Uncovering the Top Non-Advertising Weight Loss Websites on Google: A Data-mining Approach

  • Carlos A. Almenara; 
  • Hayriye Gulec

ABSTRACT

Background:

Online weight loss information is commonly sought by internet users, and it may impact their health decisions and behaviors. Previous studies examined a limited number of Google search queries and relied on manual approaches to retrieve online weight loss websites.

Objective:

Therefore, the goal of this study was to identify and unveil the characteristics of the top weight loss websites on Google.

Methods:

This study gathered 432 Google search queries, collected from Google autocomplete suggestions, "People Also Ask" featured questions and Google Trends data. A data-mining software tool was developed to retrieve the search results automatically setting English and United States as the default criteria for language and location, respectively. Domain classification and evaluation technologies were used to categorize the websites according to their content and determine their risk of cyberattack. Also, the top five most frequent websites in non- advertising (i.e., non-sponsored) search results were inspected for quality.

Results:

The results revealed that the top five non-advertising websites were healthline.com, webmd.com, verywellfit.com, mayoclinic.org, and womenshealthmag.com. All provided accuracy statements and author credentials. The domain categorization taxonomy yielded a total of 101 unique categories. After grouping the websites that appeared less than five times, the most frequent categories involved "Health" (n = 104, 16.69%), "Personal Pages and Blogs" (n = 91, 14.61%), "Nutrition and Diet" (n = 48, 7.7%), and "Exercise" (n = 34, 5.46%). The risk of being a victim of a cyberattack was low.

Conclusions:

This study provided initial evidence that a data-mining software tool can help to identify the most common websites for weight loss. Clinical Trial: N/A


 Citation

Please cite as:

Almenara CA, Gulec H

Uncovering the Top Nonadvertising Weight Loss Websites on Google: A Data-Mining Approach

JMIR Infodemiology 2024;4:e51701

DOI: 10.2196/51701

PMID: 39661980

PMCID: 11669867

Per the author's request the PDF is not available.