R Shiny app publishing
Once you’ve built your Shiny app, you can make it available to users through the Analytical Platform. We have guidance for:
- Deploying your app
- Accessing your deployed app
- Managing your app users
- Managing and Monitoring Deployments
as well as common issues faced during these stages of publishing your app.
⚠️ Note on deployment ⚠️
It is not currently possible to deploy new apps on the Analytical Platform, though we are working to make this functionality available again as soon as possible.
In the meantime, you can still access and manage existing apps.
If you have an existing app that requires urgent redeployment, please submit a request via GitHub. We normally redeploy apps each Wednesday, where we have recevied a request by the Friday before.
App deployment
New apps
It is not currently possible to deploy new apps on the Analytical Platform, though we are working to make this functionality available again as soon as possible.
Existing apps
Your post-migration app and you
As part of the migration path from the Analytical Platform’s hosting to the Cloud Platform there are some changes to how environments work. Previously, only applications whose owners had specifically created a separate dev environment had a means of testing deployments before production. Post-migration, all applications will have a live-like development environment, complete with continuous integration. This section provides an overview of that new setup.
Overview
Your new environment is made up of two key elements:
- A GitHub workflow
- A Cloud Platform namespace
In brief, the workflow builds and deploys your code as a docker container, and then deploys it to Cloud Platform’s kubernetes cluster, in your application’s namespace.
The flow looks something like this:
The rationale behind this change is to:
- Facilitate testing - including a working development app version and build artifacts like docker containers - before deploying to production
- Give teams more control over their workflows
- Restore the ability to deploy without the Analytical Platform team’s intervention
Apps hosted on Cloud Platform
After the move to Cloud Platform hosting for Analytical Platform apps, you’ll have two active deployments of your apps at all times. These are your ‘dev’ (development) and ‘prod’ (production) deployments.
Be aware that both production and dev currently point to the same data.
Your code repository within the ministryofjustice organisation was built from the data-platform-app-template repo, and has inherited the continuous integration and continuous delivery (CI/CD) pipelines (GitHub Action workflows) from that repo. These workflows will automatically build your docker images, push them to a remote image store, and then deploy the application based on how you’ve made your code changes:
Deploying your application to your development environment is done through the
-dev
workflow, which will build-push-deploy when any pull request is made in the repo (e.g.feature-branch
intomain
, orfeature-branch
intodevelop-branch
). The workflow can also be manually triggered, though this will only build and push the app, it will not deploy it: repo homepage > Actions (tab) > Run workflow (right hand side of the page).- Opening any pull request will trigger a dev deployment. Any subsequent pushes to that branch (or to any other open branches) will trigger dev deployments too. If you’re working on multiple PRs at once in the repo that you will need to coordinate pushes to your PRs so you can track your deployments. You can always cancel workflow runs and rerun deployments from the Actions page in the repo.
Deploying your application to your production environment is done through the
-prod
workflow, which will build-push-deploy either:- after a pull request is merged into main
- upon publishing a release
You can view the status of your deployments either by checking the workflow runs in the Actions tab (repo-url/actions), or by checking out the deployments page (repo-url/deployments).
The above describes how CI/CD will be set up by default in the ministryofjustice repo.
Once you have ownership of the repo, you’ll have the ownership of the .github/workflow/
files too so you will be able to amend the processes and triggers so that they meet your needs.
Your new deployment pipeline
Concourse was decomissioned in 2022: its replacement is GitHub’s CI system, GitHub Actions.
GitHub Actions has workflow definitions located in your repository under .github/workflows/
.
By default, you will be provided with two workflows.
The dev environment workflow is triggered when you open or contribute to a Pull Request from any branch (for example, into your main
branch).
Its steps are:
- Check out the repository
- Authenticate to AWS
- Build a docker container from your development branch
- If the build is successful, push this container to a container registry
- Run
helm
against the development environment namespace.
The production workflow behaves in the exact same way, containing the same steps, but will be triggered when a Pull Request is merged into main
, and will deploy to the production namespace instead
Pre-migration urgent redeployment
If you have an existing app that requires urgent redeployment, please submit a request via GitHub issues. We normally redeploy apps each Wednesday, where we have recevied a request by the Friday before.
Managing published apps
Manage app users
If authentication is enabled and you choose to use email as the login option, user access management to app can be done through Control panel.
To manage the users of your app:
- Login Control panel as app admin
- Go to the Analytical Platform control panel.
- Select the Webapps tab.
- Select the name of the app you want to manage or select Manage customers.
- Select the environment where app is deployed (
dev/prod
)
To add app users:
- Enter the email addresses of the users in the box titled ‘Add app customers by entering their email addresses’ – multiple email addresses can be separated by spaces, tabs, commas and semicolons, and can contain capital letters.
- Select Add customer.
To remove an app user, select Remove customer next to the email address of the customer you want to remove.
Accessing the deployed app
Apps hosted on Cloud Platform
Your deployed app can be accessed at two URLs:
prod
is at:repository-name.apps.live.cloud-platform.service.justice.gov.uk
dev
is at:repository-name-dev.apps.live.cloud-platform.service.justice.gov.uk
(where repository-name
is the name of the relevant GitHub repository)
By default, the user list on dev
empty and you will need to add any users requiring access via control panel.
Pre-migration apps
Your deployed app can be accessed at repository-name.apps.alpha.mojanalytics.xyz
, where repository-name
is the name of the relevant GitHub repository.
If the repository name contains underscores, these will be converted to dashes in the app URL. For example, an app with a repository called repository_name
would have the URL repository-name.apps.alpha.mojanalytics.xyz
.
Authenticating to your pre-migration apps
When accessing an app, you can choose whether to sign in using an email link (default) or a one-time passcode.
To sign in with a one-time passcode, add /login?method=code
to the end of the app’s URL, for example, https://kpi-s3-proxy.apps.alpha.mojanalytics.xyz/login?method=code
.
This requires the app to have been deployed since the auth-proxy release on 30th Jan 2019.
Authenticating to your app
For the dashboard apps using passwordless flow (email login), when accessing an app, you can choose whether to sign in using a one-time passcode (default) or an email magic link.
To sign in with an email magic link, add /login?method=link
to the end of the app’s URL, for example,
https://apps.live.cloud-platform.service.justice.gov.uk/login?method=code
.
Troubleshooting app sign-in
“That email address is not authorized for this app (or possibly another error occurred)” error, after entering email address
- Check that the user is authorised to access the app:
- Log in to the control panel.
- Navigate to the app detail page.
- Choose the right deployment environment (dev/prod)
- Check if the user’s email address is listed under ‘App customers’.
- If it is not, refer them to the app owner to obtain access.
- Check that the user is using the correct email address – there is sometimes confusion between @justice and @digital.justice email addresses.
“Access denied” error, having entered email address and copied the one-time passcode into the login page
Please follow the same steps above to check whether the user is in the customer list of the app.
“IP x.x.x.x is not whitelisted”
Check that the user is trying to access the app from one of the trusted networks listed on app’s app-detail from Control panel
The app admin can modify the IP_Ranges on the app’s app-detail detail page.
Other troubleshooting tips
- Check that they are trying to access the app using a URL beginning with
https://
nothttp://
. - Look for similar issues log in the
data-platform-support
repository. - Try asking the user to clear their cookies by visiting https://alpha-analytics-moj.eu.auth0.com/logout and try again.
In addition the AP team can:
- Check the Auth0 logs for the app in Kibana
- Check the Auth0 logs in the Auth0 console
Advanced
Editing the Dockerfile
A Dockerfile
is a text document that contains all the commands a user could call on the command line to assemble an image.
In most cases, you will not need to change the Dockerfile
when deploying your app.
If your app uses packages that have additional system dependencies, you will need to add these in the Dockerfile
. If you are unsure how to do this, contact the Analytical Platform team.
A Dockerfile
reference can be found in the Docker documentation.
Getting details of current users of the app
An RShiny app can find out who is using it. This can be useful to log an audit trail of significant events. Specifically, it can determine the email address that the user logged into the app with. This is sensitive data, so you must ensure that you are following all relevant information governance processes.
The shiny-headers-demo repository contains an example of how to do this.
These features require you to be using the Analytical Platform version of shiny-server
.
Finding current users’ email addresses
You can obtain the logged in user’s email address by using the following code in the server
function of your app:
get("HTTP_USER_EMAIL", envir=session$request)
This line
in shiny-headers-demo
shows the code in context.
Finding current users’ user profiles
You can access the full user profile by making a request directly from the RShiny app to the auth-proxy’s /userinfo
endpoint using the following code inside your server
function.
# library(httr)
# library(jsonlite)
profile <- fromJSON(content(GET("http://localhost:3001/userinfo", add_headers(cookie=get("HTTP_COOKIE", envir=session$request))), "text"))
This line shows the code in context.
Example response
{
"email": "name@example.gov.uk",
"email_verified": true,
"user_id": "email|12345121312",
"picture": "https://s.gravatar.com/avatar/94deebe3b87fc5e9b3b4469112573cc0?s=480&r=pg&d=https%3A%2F%2Fcdn.auth0.com%2Favatars%2Fna.png",
"nickname": "name",
"identities": [
{
"user_id": "12345121312",
"provider": "email",
"connection": "email",
"isSocial": false
}
],
"updated_at": "2019-07-15T09:54:34.353Z",
"created_at": "2018-07-12T14:26:45.663Z",
"name": "name@example.gov.uk",
"last_ip": "1.1.1.1",
"last_login": "2019-07-15T09:54:34.353Z",
"logins_count": 20,
"blocked_for": [],
"guardian_authenticators": []
}
Troubleshooting and monitoring
Deploying locally
If you have a MacBook, you can use Docker locally to test and troubleshoot your RShiny app. You can download Docker Desktop for Mac from the Docker website.
To build and run your R Shiny app locally, follow the steps below:
- Clone your app’s repository to a new folder on your MacBook – this guarantees that the app will be built using the same code as on the Analytical Platform. You may need to create a new connection to GitHub with SSH.
- Open a terminal session and navigate to the directory containing the
Dockerfile
using thecd
command. - Build the Docker image by running:
{bash} docker build . -t IMAGE:TAG
whereIMAGE
is a name for the image, for example,my-docker-image
, andTAG
is the version number, for example,0.1
. - Run a Docker container created from the Docker image by running:
{bash} docker run -p 80:9999 IMAGE:TAG
- Go to 127.0.0.1:80 to view the app.
If the app does not work, follow the steps below to troubleshoot it:
- Start a bash session in a Docker container created from the Docker image by running:
{bash} docker run -it -p 80:80 IMAGE:TAG bash
- Install the
nano
text editor by running:{bash} apt-get update apt-get install nano
- Open
shiny-server.conf
in thenano
text editor by running:{bash} nano /etc/shiny-server/shiny-server.conf
- Add the following lines at the beginning of
shiny-server.conf
:{bash} access_log /var/log/shiny-server/access.log tiny; preserve_logs true;
- Write the changes by pressing
Ctrl+O
. - Exit the
nano
text editor by pressingCtrl+X
. - Increase the verbosity of logging and start the Shiny server by running:
{bash} export SHINY_LOG_LEVEL=TRACE /bin/shiny-server.sh
- Open a new terminal session.
- Start a new bash session in the Docker container by running:
{bash} docker exec -it CONTAINER_ID bash
You can find theCONTAINER_ID
by runningdocker ps
. - View the logs by running:
{bash} cat /var/log/shiny-server/access.log
For further details, see the Shiny server documentation.
Managing and Monitoring Deployments
Monitoring Deployments
By default all applications deployed into Cloud platform have a basic level of monitoring which can be accessed on the following link Cloud Platform grafana namespace monitoring. Grafana dashboards are accessed using single-sign-on (SSO) via your GitHub account.
This page shows all namespaces and can be refined by typing the name of your namespace in namespace drop down and selecting as needed.
Note: Kubernetes defines Limits as the maximum amount of a resource to be used by a container. This means that the container can never consume more than the memory amount or CPU amount indicated. Requests, on the other hand, are the minimum guaranteed amount of a resource that is reserved for a container.
A brief overview of namespaces
Within Kubernetes a namespace provides a mechanism for isolating groups of resources within a single cluster, it can be thought of as a virtual cluster within the cluster. Your Application is deployed into its own namespace, this restricts access to your team and enables the setting of resource limits. Within the namespace are the various Kubernetes components:
Pods the smallest deployable units of computing that you can create and manage in Kubernetes, usually one pod per function of your application ie web server, db server.
Service is a method for exposing a network application that is running as one or more Pods in your cluster, basically simplifying the connections within your namespace.
Deployment provides declarative updates for Pods and ReplicaSets.
ReplicaSet’s their purpose is to maintain a stable set of replica Pods running at any given time. As such, it is often used to guarantee the availability of a specified number of identical Pods.
What information is displayed on the Grafana namespace chart:
Namespace(s) usage on total cluster CPU in % - This is the total usage by all namespaces on the cluster
Namespace(s) usage on total cluster RAM in % - This is the total usage by all namespaces on the cluster
Kubernetes Resource Count - A simple count of resources running within your namespace.
CPU usage by Pod - CPU usage per pod within your namespace.
Memory usage by Pod - Memory usage per pod within your namespace.
Nb of pods by state - Count of pods at various states within your namespace.
Nb of containers by pod - Number of containers within each pod.
Replicas available by deployment - the number of replicas available to your deployment.
Replicas unavailable by deployment - the number of replicas unavailable to your deployment.
Persistent Volumes Capacity - If your application uses persistent volumes the total capacity is displayed here.
Persistent Volumes Inodes - If your application uses persistent volumes the detail on the inodes is displayed here.
Further technical details can be found in the Cloud Platform’s Monitoring section of the user guidance and Configuring a dashboard using Grafana UI documentation
Accessing logs
You can access your applications logs in Cloud platform by following the the CP guidance Accessing Application Log Data
Kibana on Cloud Platform
Below are some notes to aid you in working with the Kibana service, on Cloud Platform.
All logs from deployed apps can be viewed in Kibana.
To view the logs for a specific app:
- Select live_kubernetes_cluster from the
CHANGE INDEX PATTERN
dropdown list. - Select Add a filter.
- Select kubernetes.namespace_name.
- Select is as the operator.
- Insert the app’s name space by following the pattern
data-platform-app-<app_name>-<dev/prod>
. - In order to filter out all the logs related health-check, you can put
NOT log: "/healthz"
in theKQL
field. - Select Save.
- (Optional) - you can select to view only the log output by adding it from the Available Fields list in the left hand pane using the (+) button revealed on mouse hover
Log messages are displayed in the message column.
By default, Kibana only shows logs for the last 15 minutes. If no logs are available for that time range, you will receive the warning ‘No results match your search criteria’.
To change the time range, select the clock icon in the menu bar. There are several presets or you can define a custom time range.
Kibana also has experimental autocomplete and simple syntax tools that you can use to build custom searches. To enable these features, select Options_ from within the search bar, then toggle Turn on query features.
Managing Deployments
Cloud platform allows teams to connect to the Kubernetes cluster and manage their applications, using kubectl
, the command line tool for Kubernetes.
To do this you will need to setup access through JupyterLab as follows:
Go to Analytical Platform Control Panel
‘> Analytical Tools
’> open JupyterLab Data Science
‘> Terminal
Installing Kubectl
In the terminal session install kubectl
curl -LO "https://dl.k8s.io/release/$(curl -L -s https://dl.k8s.io/release/stable.txt)/bin/linux/amd64/kubectl"
curl -LO "https://dl.k8s.io/$(curl -L -s https://dl.k8s.io/release/stable.txt)/bin/linux/amd64/kubectl.sha256"
echo "$(cat kubectl.sha256) kubectl" | sha256sum --check
If valid, the output is:
kubectl: OK
chmod +x kubectl
mkdir -p ~/.local/bin
mv ./kubectl ~/.local/bin/kubectl
which kubectl
you should see:
/home/jovyan/.local/bin/kubectl
You should only need to carry out the above steps once.
Setup Cloud Platform Access
Follow the steps in CP’s Generating a Kubeconfig file documentation. As we are accessing via Jupyter Labs step 7 “Move the config file to the location expected by kubectl” has to be carried out slightly differently.
First upload the kubecfg.yaml
from your local PC to the Jupyter terminal session.
click on the upload arrow (see below) and select the kubecfg.yaml
Then continue with the remaining steps in CP’s documentation
Accessing the AWS console
If required you can access the Cloud platform via AWS, please follow the guidance in Accessing the AWS console (read-only)
Adding cron job to your application
A cronjob for restarting your application can be setup easier by adding the following line in to your app’s development or production GitHub workflow:
--set Cron.schedule="0 6 * * *"
crontab.guru can be used to setup the schedule.
If you need to a cron job for the other jobs, more guides are available on the Cloud Platform kubernetes cronjobs
Changing the resources available to the application
When you deploy an update to the application the Kubernetes resources are built from a standard Helm chart with the following default memory/cpu resources:
Limits - CPU 25m. Memory 1Gi
Requests - CPU 3m. Memory 1Gi
To override this you will have to amend the GitHub workflow in your application repository
The file(s) to amend are .github/workflows/build-push-deploy-dev.yml or build-push-deploy-prod.yml
Below is the section of code that calls the helm chart
helm upgrade --install --wait --timeout 10m0s --namespace $KUBE_NAMESPACE $KUBE_NAMESPACE mojanalytics/webapp-cp \ --set AuthProxy.Env.Auth0Domain=$AUTH0_DOMAIN \ --set AuthProxy.Env.Auth0Passwordless=$AUTH0_PASSWORDLESS \ --set AuthProxy.Env.Auth0TokenAlg=$AUTH0_TOKEN_ALG \ --set AuthProxy.Env.AuthenticationRequired=$AUTHENTICATION_REQUIRED \ --set AuthProxy.Env.IPRanges=$process_ip_range \ --set AuthProxy.Image.Repository=$ECR_REPO_AUTH0 \ --set AuthProxy.Image.Tag="v5.3.3" \ --set Namespace=$KUBE_NAMESPACE \ --set Secrets.Auth0.ClientId=$AUTH0_CLIENT_ID \ --set Secrets.Auth0.ClientSecret=$AUTH0_CLIENT_SECRET \ --set Secrets.Auth0.CookieSecret=$COOKIE_SECRET \ --set ServiceAccount.RoleARN=$APP_ROLE_ARN \ --set WebApp.Image.Repository=$ECR_REPO_WEBAPP \ --set WebApp.Image.Tag=$NEW_TAG_V \ --set WebApp.Name=$KUBE_NAMESPACE \ $custom_variables ```
To change the resources insert at the end of the file just before the $custom_variables the following as appropriate, remembering to use the line continuation “\” on the existing last line.
`--set WebApp.resources.limits.cpu=100m \` `--set WebApp.resources.limits.memory=2Gi \` `--set WebApp.resources.requests.cpu=50m \` `--set WebApp.resources.requests.memory=2Gi \` `$custom_variables`
Once the changes are pushed/merged to the repository the values will be applied to your Kubernetes namespace.
Changing the number of instances (pods) on name space
The number of app instances running on dev
/prod
is 1 by default, if users experience long response times from the application (apart from trying to improving the performance through the code itself) you can increase the number of instances to reduce the wating time on dev
/prod
GitHub workflows, for example :-
--set ReplicaCount=3
Remember: “With great power comes great responsibility” Your application’s namespace will be one of a number hosted on the same cluster, setting the values too high could crash the cluster!
Storage guidance
Guidance on storage within Cloud Platform can be found here Cloud platform storage
Continuous Deployment
Within Cloud platform your application is already automatically deployed by Github workflows but further guidance on continuous deployment within CP can be found here Cloud Platform continuous deployment.
Deploying Updates to the Application
As the application is now hosted in Cloud Platform and GitHub workflow has been implemented, you will now be able to apply update to your application, full guidance can be found here Application deployment.
Other guidance
Guidance on managing Auth and Secrets through the Control Panel can be found Manage deployment settings of an app on Control panel.