Google Cloud & Kubernetes Tools

Create a Project on Google cloud

We assume you already did this or are using the Earth Lab project. If you do not have a project create one on https://console.cloud.google.com.

Configure Your gcloud and kubectl Tools

Install the gcloud command-line tool and the kubectl component as per steps 1-3 of the z2jh GKE guide.

Make sure that you are talking to your newly created project. To list your projects use:

gcloud projects list

Use the name from the PROJECT_ID column. For example if your project is called ea-jupyter run the following command to set yourself up to use that project:

gcloud config set project ea-jupyter

Note

If you switch between different projects and clusters you might need this to switch to the right cluster. Not needed in the first run through. Setup for using the jhub2 cluster in the ea-jupyter project:

gcloud container clusters get-credentials jhub2 –zone us-central1-b –project ea-jupyter

Create a cluster on google cloud

Create a basic cluster:

gcloud container clusters create jhub2 \
    --num-nodes=1 --machine-type=n1-standard-2 \
    --zone=us-central1-b --image-type=cos_containerd \
    --enable-autoscaling --max-nodes=3 --min-nodes=1

Here, we are using the default version of kubernetes rather than specifying a version. You can see the default version of kubernetes using gcloud container get-server-config. When jhub2 was created, the defaultVersion on the RELEASE channel is 1.17.13-gke.2600.

We also use the containerd image type rather than the default cos type because the latter is being deprecated, see https://cloud.google.com/kubernetes-engine/docs/concepts/using-containerd.

Note

When you create a cluster using gcloud container clusters create, an entry is automatically added to the kubeconfig in your environment, and the default context changes to that cluster and any subsequent gcloud and kubectl commands apply to that cluster (from https://cloud.google.com/kubernetes-engine/docs/how-to/cluster-access-for-kubectl).

Give your account super-user permissions needed to set up JupyterHub:

kubectl create clusterrolebinding cluster-admin-binding --clusterrole=cluster-admin --user="<your google account email>"

Then create an autoscaling user pool (using n1-standard-8 nodes with autoscaling maximum at 5 nodes):

gcloud beta container node-pools create user-pool \
--machine-type n1-standard-8 \
--image-type=cos_containerd \
--num-nodes 0 \
--enable-autoscaling \
--min-nodes 0 \
--max-nodes 5 \
--node-labels hub.jupyter.org/node-purpose=user \
--node-taints hub.jupyter.org_dedicated=user:NoSchedule \
--zone us-central1-b \
--cluster jhub2

The node-labels and node-taints are the parts that ensure that user pods only run on the node-pool. This ensures that issues with user resources can’t take down the whole cluster (which runs on the core-pool).

Install Helm

Follow the z2jh instructions for setting up Helm <https://zero-to-jupyterhub.readthedocs.io/en/latest/kubernetes/setup-helm.html>_ noting that we are using Helm3, which no longer requires Tiller.

Install the JupyterHubs

Full guide.

One way we deviate from the guide is that we run multiple JupyterHubs on one cluster, each hub in its own namespace. The config file for each hub is in the hub-configs directory. So, when the z2jh guide asks you to edit config.yaml you should instead edit hub-configs/<hubname>.yaml.

Note

You will need to obtain the key to decrypt secrets/. See Git Crypt for details.

After this follow the instructions in :ref:ingress-and-https to setup the ingress and certificates that will route traffic securely to your hub. Without this your hub will not be reachable from the internet.