Skip to contents

This function takes a parsed PDF and a list of keywords and returns a tibble with the source of the PDF, the file name, the keywords, the pages where the keywords were found, and whether any of the keywords were found.

Usage

match_keywords(file_path, parsedPdf, keywords, source = c("FAS", "TRS"))

Arguments

file_path

(chr) The path to the PDF file

parsedPdf

(chr) The parsed PDF

keywords

(chr) keywords to search for

source

(chr) The source of the PDF, either "FAS" or "TRS"

Value

(tibble) A tibble with the source of the PDF, the file name, the keywords, the pages where the keywords were found, and whether any of the keywords were found.