Help | AOPhf

About the webserver

To use AOP-helpFinder service, please enter your email address. Your email will only be used for the service and will be erased as soon as you received your results. If you want to run a second search you will have to re-submit your email address.

After a couple of minutes, you will received an email. Please click on the link to access to AOP-helpFinder web service formular.

Upload your files

prototypical stressor file

event file

Output format option

Refinement filter option

Reduced search option

Upload your files

event file

Output format option

Reduced search option

The tool can perform the searches in the full abstracts or without considering the beginning part, which appears to be covered usually by the first 20% of the abstracts. This option allows to avoid too many false positive, as the introduction part often reflect a working hypothesis.
For more details of the refinement option, please read the method section below and the supplementary material

AOP-helpFinder is provided without any warranty. But if you have any problemes please feel free to contact us by mail.

After some time (it can be some hours depending of your input data and the queue), you will receive an email with a link. Follow it to access to your results.
You can download your result files as a .zip archive. Please note that it will be automatically deleted after one month.
Automated report : we add a automated report with different informations as the parameters used, figures and basic informations example zip for prototypical stressor - event
example zip for event - event

For both prototypical-stressor / event and event /event search, please note that .xlsx files are dynamic : click on the PMID to be forwarded to the corresponding PubMed page. This option is only available if the number of links is strictly lower than : 65 530.

About the method

The two input files are in text format (txt). One containing the list of the prototypical stressors (one per line, if synonyms, one line per synonyms)(prototypical stressor example), and the second file containing the biological events of interest ( events example )

The input file is in text format (.txt) and must contain the biological events to be linked (one per line, if synonyms, one line per synonyms). see events example

The NCBI API is used to retrieve abstracts from PubMed database(Kanz J.). To facilitate the run and optimize the time, the full PubMed database has been localy downloaded and is updated before each use of AOP-helpFinder web tool. For each stressor of interest, a file containing all the abstracts, that mention this stressor, in a .xml format is created. Then, an adaptation of the titipata PubMed Parser (Achakulvisut et al.2020) is used to extract the information of interest.
Kans J. Entrez Direct: E-utilities on the Unix Command Line. 2013 Apr 23 [Updated 2021 Apr 29]. In: Entrez Programming Utilities Help [Internet]. Bethesda (MD): National Center for Biotechnology Information (US); 2010-. available from: https://www.ncbi.nlm.nih.gov/books/NBK179288/
Achakulvisut et al., (2020). Pubmed Parser: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset XML Dataset.Journal of Open Source Software, 5(46), 1979, https://doi.org/10.21105/joss.01979

Pre-processing
Score method using the position of the event
Score method using graph theory

For more details, please read this link and github

Due to a large number of false positive at the beginning of the abstracts, AOP-helpFinder provide a reduced search option to skip the beginning of the abstracts. We advise the user to apply a limit ignoring the first 20 % of the abstracts, corresponding mostly to the introduction part of the study. Indeed, ignoring the first 20% of the abstracts allows to significantly improve the results, increasing the precision by about 10%, while keeping more than 95% of the information, and reducing the noise by 15%.

The AOP-helpFinder web tool will automatically screen the abstracts mentioning the stressor of interest to retrievee known associations with the selected events. Therefore, according to the input data (especially when searching for broad event type such as 'metabolism' or with common stem 'test' and 'testis'), a huge number of results will be retrieved or may lead to incorrect meanings and spelling errors.
A new module called "refinement option" has been developed to facilitate further analysis of the results, that can be for ex. by manual curation. This refinement option will only applied on the selected abstracts from the pre-processing run. As in the pre-processing step, the refinement option will split the abstracts into sentences, remove the negative sentences and the stop words. But instead of stemming the words, the option will use the lemmatization process that considers the context and converts the word to its meaningful base form. Then, AOP-helpFinder tool will removes the sentences containing "context word" (ex: suggest,... ).Then the scores as described in "3.Pre-processing and scoring" will be computed. This step is proposed as an option only for prototypical stressor -event search as it is more time consuming.

The Confidence score (Cs) help the user to identify the most relevant links. The calculation of this score is based on the use of Fisher exact tests. Fisher's exact test is a statistical test used to determine whether the proportions of categories in two variables differ significantly from each other. In our case, it is used in a one-sided manner to determine whether an event is found to be more frequently associated with a stressor (stressor-event) or with another event (event-event) compared to the rest of the PubMed literature.
To facilitate interpretation, the results of these tests are classified into 5 categories according to the strength of the link: “Low”; "Quite low"; "Moderate"; "High" ; "Very high". A Cs = Low means that a stressor-event or event-event link is not very well studied or defined, while a Cs = Very high means that the link has been very well studied. Nevertheless, a Low Cs does not mean that a link does not exist, but warns the user to manually check the result by reading the articles provided by AOP-helpFinder. For very general events (studied in a wide range of fields), you may get a Cs=Low even if the link is well defined.
/!\ Low Cs does not mean that a link does not exist, but warns you to check the link in the literature extracted by AOP-helpFinder.
For more details, please see the article:
doi: 10.1016/j.envint.2023.108017.

Visualization : now in addition to the regular results files we add some figure automatically generated. When you opted to include figures, you will find in the output directory:

prototypical stressor - event search:

distribution_events.pdf

distribution of links according to events

distribution_stressors.pdf

distribution of links according to stressors

distribution_years.pdf

distribution of articles according to their publication date

distribution_scores.pdf

distribution of links according to the number of papers and confidence score

heatmap(s)

distribution of links according to stressors and events (two versions : one where the color scale depends on the number of papers and the other one where it depends on the confidence score)

event - event search:

distribution_years.pdf

distribution of articles according to their publication date

distribution_scores.pdf

distribution of links according to the number of papers and confidence score

About the webserver

1. Enter your email

2. Check your emails

3.a Prototypical stressor-Event

3.b Event-Event

4. Results

About the method

1.a Prototypical stressor - Event Input data

1.b Event - Event Input data

2. Abstracts extraction from PubMed, and parsing

3. Pre-processing and Scoring

4. Reduced search option

5. Refinement filter option

6. Confidence score

7. Figures

prototypical stressor - event search:

event - event search: