Server size
For DISQOVER to run on a server, we recommend the server specifications below.
SETUP |
SERVER SPECIFICATIONS |
MEDIUM (Up to 250 million triples) |
r5.2xlarge EC2 instance or equivalent:
|
LARGE (Up to 10 billion triples) |
i3.8xlarge EC2 instance or equivalent:
|
Requirements
The installation of the DISQOVER application has following requirements:
-
- A Linux server with CentOS 7, RHEL 7, Amazon Linux 2, Ubuntu 18.04 or 20.04 as OS.
- A running Docker service. If this is not yet installed, then the installation package will try
to install Docker CE (Community Edition). Note that for some OS, e.g. RHEL, this is not
standard available. In that case the user should provide a pre-installed Docker service. - An OS user which can manage the Docker service (typically this user must be part of the
Docker group). This user can added by the installation package or can be pre-existing. - An empty directory which is owned by the chosen OS user. This directory can added by
the installation script or can be pre-existing.
Firewall rules
The following resources must be accessible or whitelisted by the firewall for the DISQOVER
installer to be functional:
- rpm.ontoforce.com the installation package for RedHat based OS (All traffic for
53.231.0.0/16, 52.216.0.0/15, 3.5.0.0/19, 44.192.134.240/28, 44.192.140.64/28). - apt.ontoforce.com the installation package for debian based OS (All traffic for
53.231.0.0/16, 52.216.0.0/15, 3.5.0.0/19, 44.192.134.240/28, 44.192.140.64/28). - ecr.disqover.com the DISQOVER ECR repository for docker images (HTTP / HTTPS
for 52.1.118.156/32).
The DISQOVER installer can also install Docker on the host server, as mentioned above. To
allow this the following URLs must be whitelisted. If these are not whitelisted, the proper Docker
version should be preinstalled.
- https://download.docker.com if the OS does not have docker installed yet.
The following URLs must be whitelisted for the DISQOVER usage logging mechanism to be
functional.
- s3.amazonaws.com (For DISQOVER usage logs which are sent encrypted to Ontoforce and for DISQOVER instance exports of federated data).
- s3-us-east-1.amazonaws.com (For DISQOVER usage logs which are sent encrypted
to Ontoforce). - *.pendo.io inbound and outbound (For the tool Pendo that is used for usage statistics tracking).
Optionally, the following URLs can also be whitelisted:
- https://registry.hub.docker.com Docker Hub for general docker images.
- https://federation.disqover.com if federation should be active for the installation.
- https://rds.disqover.com if Remote Data Subscription should be active for the installation.
- pypi.ontoforce.com to install the python DISQOVER API library.
- pypi.python.org and *.pythonhosted.org to install python libraries.
The DISQOVER UI client uses Pendo (https://www.pendo.io/) for user analytics. For the user
analytics tracking to work, the client (i.e. the users’ computer, not the DISQOVER server) must
have the following rules configured:
- *.pendo.io inbound and outbound.
Optionally the rules for Pendo can be defined more strictly:
Inbound (needed to get script and display content from Pendo)
- cdn.pendo.io
- data.pendo.io
- pendo-io-static.storage.googleapis.com
- pendo-static*.storage.googleapis.com
Outbound (needed to send analytics and guide responses to Pendo)