{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Overview\n",
    "\n",
    "There are four possible providers of compute services, or consulting services on campus:\n",
    "\n",
    "- [Cornell Center of Social Sciences (CCSS)](https://socialsciences.cornell.edu/computing-and-data/cloud-computing-solutions) offers general purpose and secure computing services.\n",
    "- [Center for Advanced Computing (CAC)](https://www.cac.cornell.edu/about/Default.aspx) offers advanced and cloud computing solutions, including in the [national (ACCESS-CI)](https://access-ci.org/) and [state (NYSTAR)](http://esd.ny.gov/nystar/) computing infrastructure \n",
    "- [Biotechnology Resource Center (BioHPC)](https://www.biotech.cornell.edu/facilities-brc) offers advanced on-campus computing solutions (including managing the Econ Department's own cluster)\n",
    "- [Cornell Information Technology (CIT)](https://it.cornell.edu/) is the general Cornell IT department, but can also engineer solutions for cloud services (Cornell has framework subscriptions to [Amazon Web Service (AWS/EC2)](https://it.cornell.edu/cornell-cloud/amazon-web-services-aws-contract) and [Microsoft cloud services (Azure)](https://it.cornell.edu/cornell-cloud/cornell-azure-contract)).\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 11,
   "metadata": {
    "tags": [
     "full-width",
     "remove-input"
    ]
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Project root directory: /mnt/local/raid5_1/vilhuber/Workspace-non-encrypted/git/LDI/ecco-notes\n"
     ]
    },
    {
     "data": {
      "text/html": [
       "<table border=\"1\" class=\"dataframe table table-striped table-bordered table-sm\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th>System</th>\n",
       "      <th>OS</th>\n",
       "      <th>maxNodeRAM</th>\n",
       "      <th>maxNodecores</th>\n",
       "      <th>totalRAM</th>\n",
       "      <th>totalcores</th>\n",
       "      <th>Shared</th>\n",
       "      <th>goodfor</th>\n",
       "      <th>Cost</th>\n",
       "      <th>Further Info Link</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <td>CCSS</td>\n",
       "      <td>Windows</td>\n",
       "      <td>256GB</td>\n",
       "      <td>4</td>\n",
       "      <td>about 1024GB</td>\n",
       "      <td>about 16</td>\n",
       "      <td>Yes</td>\n",
       "      <td>Desktop computing plus some GPU</td>\n",
       "      <td>Free</td>\n",
       "      <td><a href='https://socialsciences.cornell.edu/research-support/software' target='_blank'>Available software</a></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>BioHPC reserved</td>\n",
       "      <td>Linux</td>\n",
       "      <td>1024GB</td>\n",
       "      <td>32</td>\n",
       "      <td>3177GB</td>\n",
       "      <td>360</td>\n",
       "      <td>No</td>\n",
       "      <td>low-power chunked HPC</td>\n",
       "      <td>Free</td>\n",
       "      <td><a href='https://biohpc.cornell.edu/lab/ecco.htm' target='_blank'>Description on BioHPC website</a></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>BioHPC SLURM</td>\n",
       "      <td>Linux</td>\n",
       "      <td>up to 1024GB</td>\n",
       "      <td>up to 32</td>\n",
       "      <td>up to 1129 GB</td>\n",
       "      <td>up to  144</td>\n",
       "      <td>No</td>\n",
       "      <td>low-power fine-grained HPC</td>\n",
       "      <td>Free</td>\n",
       "      <td><a href='#job-scheduler-experimental' target='_blank'>See info below</a></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>BioHPC general</td>\n",
       "      <td>Linux</td>\n",
       "      <td>1024GB</td>\n",
       "      <td>56</td>\n",
       "      <td>not sure</td>\n",
       "      <td>not sure</td>\n",
       "      <td>No</td>\n",
       "      <td>HPC some GPU</td>\n",
       "      <td>Fee</td>\n",
       "      <td><a href='https://biohpc.cornell.edu/lab/hardware.aspx' target='_blank'>Full list of hardware</a></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>CAC Red Cloud</td>\n",
       "      <td>Linux + Windows</td>\n",
       "      <td>240GB</td>\n",
       "      <td>128</td>\n",
       "      <td>not known</td>\n",
       "      <td>not known</td>\n",
       "      <td>No</td>\n",
       "      <td>HPC + some GPU</td>\n",
       "      <td>Fee</td>\n",
       "      <td><a href='https://www.cac.cornell.edu/services/cloudServices.aspx' target='_blank'>More info</a></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>DigitalOcean</td>\n",
       "      <td>Linux</td>\n",
       "      <td>256GB</td>\n",
       "      <td>48</td>\n",
       "      <td>unlimited</td>\n",
       "      <td>unlimited</td>\n",
       "      <td>No</td>\n",
       "      <td>HPC + some GPU</td>\n",
       "      <td>Fee</td>\n",
       "      <td><a href='https://www.digitalocean.com/pricing/droplets' target='_blank'>More info</a></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>AWS</td>\n",
       "      <td>Linux + Windows</td>\n",
       "      <td>1024GB</td>\n",
       "      <td>128</td>\n",
       "      <td>unlimited</td>\n",
       "      <td>unlimited</td>\n",
       "      <td>No</td>\n",
       "      <td>HPC + some GPU</td>\n",
       "      <td>Fee</td>\n",
       "      <td><a href='https://aws.amazon.com/ec2/instance-types/' target='_blank'>More info</a></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>Azure</td>\n",
       "      <td>Linux + Windows</td>\n",
       "      <td>3800GB</td>\n",
       "      <td>416</td>\n",
       "      <td>unlimited</td>\n",
       "      <td>unlimited</td>\n",
       "      <td>No</td>\n",
       "      <td>HPC + some GPU</td>\n",
       "      <td>Fee</td>\n",
       "      <td><a href='https://learn.microsoft.com/en-us/azure/virtual-machines/sizes' target='_blank'>More info</a></td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>"
      ],
      "text/plain": [
       "<IPython.core.display.HTML object>"
      ]
     },
     "execution_count": 11,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "import pandas as pd\n",
    "import os\n",
    "from IPython.display import HTML\n",
    "\n",
    "\n",
    "def find_project_root(start_dir='.', file_to_find='favicon.ico'):\n",
    "    \"\"\"\n",
    "    Find the project root directory by searching for a specific file.\n",
    "    \n",
    "    Args:\n",
    "        start_dir (str): The directory to start the search from.\n",
    "        file_to_find (str): The name of the file to search for.\n",
    "        \n",
    "    Returns:\n",
    "        str: The absolute path of the project root directory.\n",
    "    \"\"\"\n",
    "    current_dir = os.path.abspath(start_dir)\n",
    "    \n",
    "    while True:\n",
    "        # Check if the file exists in the current directory\n",
    "        if os.path.isfile(os.path.join(current_dir, file_to_find)):\n",
    "            return current_dir\n",
    "        \n",
    "        # Check if we've reached the root directory\n",
    "        parent_dir = os.path.dirname(current_dir)\n",
    "        if parent_dir == current_dir:\n",
    "            raise FileNotFoundError(f\"Could not find '{file_to_find}' in the directory tree.\")\n",
    "        \n",
    "        # Move up one directory\n",
    "        current_dir = parent_dir\n",
    "\n",
    "# Example usage\n",
    "project_root = find_project_root()\n",
    "#print(f\"Project root directory: {project_root}\")\n",
    "\n",
    "\n",
    "\n",
    "# Read the CSV file\n",
    "df = pd.read_csv(os.path.join(project_root,\"_data\",\"eccochoice.csv\"))\n",
    "\n",
    "# Create a new column with the hyperlink\n",
    "df['Further Info Link'] = \"<a href='\" + df['Further info URL'] + \"' target='_blank'>\" + df['Further info'] + \"</a>\"\n",
    "\n",
    "# Drop the original columns if you don't need them in the table\n",
    "df = df.drop(['Further info', 'Further info URL'], axis=1)\n",
    "\n",
    "# Convert the DataFrame to an HTML table\n",
    "table = df.to_html(index=False, \n",
    "                   classes='table table-striped table-bordered table-sm', escape=False, \n",
    "                   render_links=True)\n",
    "\n",
    "\n",
    "# Render the HTML table in Jupyter Notebook\n",
    "HTML(table)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "\n",
    "The table is sorted by increasing complexity of preparation and cost (which are collinear). A starting point on how to easily use the latter cloud resources in a (relatively) easy way are\n",
    "\n",
    "- the nice [tutorial by Andrew Heiss](https://www.andrewheiss.com/blog/2018/07/30/disposable-supercomputer-future/) \n",
    "- the [analogsea](https://github.com/pachadotdev/analogsea) and [future](https://cran.r-project.org/web/packages/future/index.html) R packages."
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.11.8"
  },
  "widgets": {
   "application/vnd.jupyter.widget-state+json": {
    "state": {},
    "version_major": 2,
    "version_minor": 0
   }
  }
 },
 "nbformat": 4,
 "nbformat_minor": 4
}