# Creating persistent conda environments on Studio’s EFS

This walkthrough should take around 10 minutes and only requires the console.

For this notebook, please use the Data Science image and Python 3 kernel.

From within SageMaker Studio, click on the Home button and then on Open Launcher.

![](../img/open-launcher.png)

Within the Launcher, locate the Notebooks and compute resources section. In this section, check that SageMaker image selected is a conda supported first party kernel image such as “Data Science”. Then select the Open Image Terminal option to open a terminal window with a new kernel – you will see a message saying “Starting image terminal…” and after a few moments the new terminal will open in a new tab.

![](../img/open-image-terminal.png)

Within the terminal, run the following commands:

```
mkdir -p ~/.conda/envs
conda create --yes -p ~/.conda/envs/custom
conda activate ~/.conda/envs/custom
conda install -y ipykernel
conda config --add envs_dirs ~/.conda/envs
```
These commands will take about 3 minutes to run and will:
1. Create a directory on the EFS volume to store the conda environments
2. Create the new conda environment
3. Activate the conda environment
4. Install the ipykernel dependencies (without the ipykernel dependency this solution will not work)
5. Create a conda configuration file (.condarc) which contains the reference to the new conda environment directory.

As this is a new conda environment, there will be no additional dependencies installed, if you would like to install other dependencies you can modify the conda install line or wait for the above commands to finish and then install any additional dependencies whilst inside the conda environment.

For this example, the numpy library will be installed by running the following command in the terminal window:

```
conda install -y numpy
python -c "import numpy; print(numpy.version.version)"
```

Now that the conda environment is created and the dependencies installed, you can create a notebook which uses this conda environment persisted on Amazon EFS. Go back to the launcher window and select the Create notebook option with the “Data Science” SageMaker image.

![](../img/create-notebook.png)

From the new notebook, click the switch kernel button in the top right-hand corner which should say “Python 3 (Data Science)”:

![](../img/switch-kernel-button.png)

From the Set up notebook environment popup, select the Kernel dropdown which will include an option for the newly created conda environment. If at first there is no option for the new conda environment, this could be because it takes a few minutes to propagate.

![](../img/setup-notebook-environment.png)

Select the new conda environment for the kernel and then click the Select button to change the kernel. Back within the notebook, the kernel name will have changed in the top right-hand corner and within a cell you can test that the dependencies installed are available.

![](../img/conda-environment-working.png)

## Clean up the Conda Environment

As the conda environment is stored on Amazon EFS, there will be an ongoing charge for the storage of this environment.

From within the image terminal, this command can be run to tidy up the conda environment:
```
conda deactivate
conda env remove -n custom
conda config --remove envs_dirs ~/.conda/envs
```

## Resources
Check out the following links for more information:
- [Managing conda environments](https://conda.io/projects/conda/en/latest/user-guide/tasks/manage-environments.html)
- [Install External Libraries and Kernels in Amazon SageMaker Studio](https://docs.aws.amazon.com/sagemaker/latest/dg/studio-notebooks-add-external.html)
