FAQ
 

What is Veeva Web2PDF?
Who is Veeva Web2PDF?
Do I need to pay to use Veeva Web2PDF?
How often is Veeva Web2PDF updated? Where can I find out about new features?
What browsers can I simulate with Veeva Web2PDF?
What are the beta options for Veeva Web2PDF?
How long will Veeva Web2PDF keep the PDF generated from my website?
Why do you need my email?
Do I need to include http:// or https:// in my URL?
Can Veeva Web2PDF handle dynamic elements on a website?
What is the difference between Password Protection and a Members Only Area?
Troubleshooting
Additional Help
What is Veeva Web2PDF?
Veeva Web2PDF is a free web solution to convert dynamic digital content to PDFs for faster, more accurate review and approval.
Who is Veeva Web2PDF?
Veeva Web2PDF is created and maintained by Veeva.
Do I need to pay to use Veeva Web2PDF?
Veeva Web2PDF is completely free and available to everyone.
How often is Veeva Web2PDF updated? Where can I find out about new features?
Veeva Web2PDF is updated every 4 to 5 weeks. See our release notes to find information about new features and fixed issues.
What browsers can I simulate with Veeva Web2PDF?
The Veeva Web2PDF crawler is browser agnostic. See Veeva Web2PDF Help for details.
What are the beta options for Veeva Web2PDF?
The TruWeb engine (beta) is available for use in Veeva Web2PDF. The TruWeb engine uses the Blink engine to render your website. We selected this engine for its robustness and handling of advanced web content. We invite you to try the TruWeb engine (beta) and send us feedback.
How long will Veeva Web2PDF keep the PDF generated from my website?
For security reasons, Veeva Web2PDF automatically deletes your PDF file from our servers as soon as you download it. If you do not download your PDF file within 24 hours, Veeva Web2PDF will delete the file.
Why do you need my email?
Your email is used to notify you when your PDF is generated. Veeva may also reach out to you to get your feedback on Veeva Web2PDF and to discuss value added opportunities for related Veeva products. Veeva Web2PDF will not sell or distribute your contact details to 3rd parties.
Do I need to include http:// or https:// in my URL?
No. However, if you experience difficulty in scanning your site, we recommend that you use fully qualified URLs.
Can Veeva Web2PDF handle dynamic elements on a website?
Yes. We understand that dynamic elements can make up a large portion of your website, and failing to capture these elements would leave out chunks of content. Dynamic elements are any aspects of a webpage that may not be displayed immediately as the page loads, such as an image slider. These elements are usually javascript based and may require some user interaction. To accurately capture these elements, we developed custom set of data attributes, called HINTS, that web developers can embed into their web pages, giving Veeva Web2PDF the instructions it needs to capture your dynamic elements. See Veeva Web2PDF Hints for details.
What is the difference between Password Protection and a Members Only Area?

Password protection means your website will prompt you for credentials before loading any content. For example, Windows Authentication or .htaccess password protection.

A Members Only area is a part of your website which requires users to enter login credentials to proceed. With this type of login, your site may load without providing credentials, but functionality behind the login the page is hidden. For example, Veeva Vault and most online banking websites have member only areas.

Troubleshooting
Why can't Veeva Web2PDF access my site?
Your site might be protected geographically or it may still be in staging. For Veeva customers and agency partners, we can make Veeva Web2PDF available for whitelisting. Contact your Veeva representative.
Why can't Veeva Web2PDF generate a PDF for my site?
Hosting settings may prevent Veeva Web2PDF from accessing your site. Confirm that your site is accessible to the public web (see above). If your site is accessible to the public web, check for delays on your landing page. Veeva Web2PDF crawlers wait briefly for a response before concluding that a site is missing. Common delays include:

  • Redirection to another URL: Make sure that you are using the direct URL for your site.
  • Loading resources: Does your landing page contain large images, animations, videos, or style sheets?
What is robots.txt, and how does Veeva Web2PDF handle it?
Robots.txt is often used by websites to communicate with web crawler/web robots. If you have pages on your site that you do not want search engines’ crawlers to access, you can specify that in the robots.txt file. Veeva Web2PDF fully respects the instruction you specify in your robots.txt file. If there are any specific pages that you do not want Veeva Web2PDF crawler to access, you can specify that in your robots.txt file. Veeva Web2PDF’s unique user agent is: VeevaWeb2PDFCrawler.

To allow Veeva Web2PDF full access to your website, you'll need to add Veeva as an allowed User-agent in your robots.txt file. Learn more about robots.txt.

User-agent: VeevaWeb2PDFCrawler
Disallow:

User-agent: *
Disallow: /

Summary Report Errors
The Summary Report shows any errors encountered by Veeva Web2PDF while crawling your website. The following talbes gives more information about these errors.

Error Type

Description

Duplicate Page

A duplicate page has been detected. The original page’s URL and duplicate URL are shown. This may happen when a page can be accessed on your site from multiple URLs.

Authentication

The specified page requires a login when no or incorrect credentials are provided. Verify that you have provided the correct credentials for your website.

External Link

The specified link is outside the parent path (sub-directory) and will not be crawled. For example, https://www.veeva.com/products/content-management/vault-promomats/ is specified as the starting page and a link is found to https://www.veeva.com/contact-us/. This will not be crawled as it is not within the starting path. Try running Veeva Web2PDF in a higher directory (ex: https://veeva.com).

External Reference

The specified link is outside of the specified domain and will not be crawled. For example, your website has a link to the FDA’s website.

File Reference

The specified link references a file. Files will not be captured or included by Veeva Web2PDF. Please add them separately.

Contact Information

An email or phone number was found on your website.

Broken Link

The specified link references an inaccessible page. Please check the link address and your hosting settings.

Page Timeout

The specified page could not be loaded within 30 seconds. Please check the page and your hosting settings.

Page Limit Reached

Veeva Web2PDF has reached its 1000 page limit. These links were found on your site but were not crawled.

Job Timeout

Veeva Web2PDF has reached its 1 hour timeout. These links were found on your site but were not crawled.

Unhinted responsive content

Veeva Web2PDF has detected dynamic content on your website such as hover menus, clickable buttons, or an image carousel. More information on the specific content and name (if applicable) can be found in the Details column. This content can be captured by adding hints.

Additional Help
You can find additional help on these pages:

You can also contact us with questions.