{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Access and cite point observation data" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "To launch this notebook interactively in a Jupyter notebook-like browser interface, please click the \"Launch Binder\" button below. Note that Binder may take several minutes to launch.\n", "\n", "[](https://mybinder.org/v2/gh/hydroframe/subsettools-binder/HEAD?labpath=hf_hydrodata/point/example_get_data.ipynb)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "This notebook provides a walk-through of some example functionality for accessing and citing point observations data and site-level metadata using hf_hydrodata's `get_point_data` and `get_point_metadata` functions. Please see the full [point module](https://hf-hydrodata.readthedocs.io) documentation for information on what data is available, our data collection process, and new features we are working on! Our [Metadata Description](https://hf-hydrodata.readthedocs.io/en/latest/available_metadata.html#point-observations-metadata) page itemizes the fields that get returned from `get_point_metadata`." ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "# Import packages\n", "import sys\n", "import os\n", "import pandas as pd\n", "from hf_hydrodata import register_api_pin, get_point_data, get_point_metadata, get_citations" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# You need to register on https://hydrogen.princeton.edu/pin \n", "# and run the following with your registered information\n", "# before you can use the hydrodata utilities\n", "register_api_pin(\"your_email\", \"your_pin\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Define input parameters" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Note that `get_point_data` and `get_point_metadata` require mandatory parameters of `dataset`, `variable`, `temporal_resolution`, and `aggregation` (and `depth_level` if asking for soil moisture data). Please see [the documentation](https://hf-hydrodata.readthedocs.io/en/latest/available_data.html) for information about what point observation datasets are available and the parameters used to query them. \n", "\n", "The [hf_hydrodata API Reference](https://hf-hydrodata.readthedocs.io/en/latest/hf_hydrodata.point.html) includes information on what optional filtering parameters are available. These include filters for things like a geographic region or date range. Those parameters work cumulatively, so if `state` and `site_ids` are both supplied, for example, then only sites within `site_ids` that are *also* in `state` will be returned." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Example 1: Specify a date range and geographic bounding box" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "In this example, a specific start and end date are provided, along with a geographic domain. Start and end dates, if provided, must be in 'YYYY-MM-DD' format. If a start date is not provided, data is returned from as early as it is available. Likewise, if an end date is not provided, data is returned through as current as is available.\n", "\n", "Geographic domain specifications, if provided, can be in the form of latitude and/or longitude bounds, a 2-digit state postal code (`state`='NJ'), a specific list of site IDs (see example 2 below), or a shapefile (see example notebook \"[How To Filter Sites by USGS HUC Boundaries](https://hf-hydrodata.readthedocs.io/en/latest/point_data/examples/example_shapefile.html)\"). If no geography restriction is included, sites from the entire continental United States will be returned. In many cases, this exceeds a user's single-request limit of 1GB. Please add additional geography and/or date filters as needed to keep requests within this limit." ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
| \n", " | date | \n", "01011000 | \n", "01013500 | \n", "01015800 | \n", "01017000 | \n", "01017550 | \n", "01018000 | \n", "01019000 | \n", "01027200 | \n", "01029200 | \n", "... | \n", "01046500 | \n", "01129200 | \n", "01010000 | \n", "01010070 | \n", "01010500 | \n", "01014000 | \n", "01018500 | \n", "01021000 | \n", "04264331 | \n", "04294300 | \n", "
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | \n", "2002-01-01 | \n", "9.7069 | \n", "13.8104 | \n", "12.9048 | \n", "21.3099 | \n", "0.013301 | \n", "NaN | \n", "3.0847 | \n", "1.98666 | \n", "2.43663 | \n", "... | \n", "46.129 | \n", "23.9984 | \n", "11.9143 | \n", "1.48292 | \n", "24.0550 | \n", "61.411 | \n", "9.1126 | \n", "21.9042 | \n", "6084.5 | \n", "0.2547 | \n", "
| 1 | \n", "2002-01-02 | \n", "9.5371 | \n", "13.4142 | \n", "12.0558 | \n", "20.0364 | \n", "0.012169 | \n", "NaN | \n", "3.0564 | \n", "1.91874 | \n", "2.39135 | \n", "... | \n", "46.695 | \n", "23.8286 | \n", "11.6879 | \n", "1.41500 | \n", "23.4890 | \n", "59.713 | \n", "9.0277 | \n", "21.9042 | \n", "6056.2 | \n", "0.2547 | \n", "
| 2 | \n", "2002-01-03 | \n", "9.3390 | \n", "13.0746 | \n", "11.5181 | \n", "19.0742 | \n", "0.011886 | \n", "NaN | \n", "3.0281 | \n", "1.88195 | \n", "2.36305 | \n", "... | \n", "46.978 | \n", "23.8286 | \n", "11.5181 | \n", "1.35840 | \n", "23.0645 | \n", "58.581 | \n", "8.9145 | \n", "21.9042 | \n", "6084.5 | \n", "0.2547 | \n", "
| 3 | \n", "2002-01-04 | \n", "9.1692 | \n", "12.6501 | \n", "11.0936 | \n", "26.4322 | \n", "0.011320 | \n", "NaN | \n", "3.0564 | \n", "1.83667 | \n", "2.34890 | \n", "... | \n", "51.506 | \n", "23.6305 | \n", "11.2917 | \n", "1.31312 | \n", "22.6400 | \n", "57.449 | \n", "8.8579 | \n", "21.9042 | \n", "6056.2 | \n", "0.2547 | \n", "
| 4 | \n", "2002-01-05 | \n", "8.9994 | \n", "12.2822 | \n", "10.6691 | \n", "25.1870 | \n", "0.010754 | \n", "NaN | \n", "3.0281 | \n", "1.79139 | \n", "2.32060 | \n", "... | \n", "37.639 | \n", "23.6022 | \n", "11.0936 | \n", "1.27633 | \n", "22.2155 | \n", "56.317 | \n", "8.7447 | \n", "21.9042 | \n", "5546.8 | \n", "0.2830 | \n", "
5 rows × 32 columns
\n", "| \n", " | site_id | \n", "site_name | \n", "site_type | \n", "agency | \n", "state | \n", "latitude | \n", "longitude | \n", "first_date_data_available | \n", "last_date_data_available | \n", "record_count | \n", "... | \n", "doi | \n", "huc8 | \n", "conus1_x | \n", "conus1_y | \n", "conus2_x | \n", "conus2_y | \n", "gagesii_drainage_area | \n", "gagesii_class | \n", "gagesii_site_elevation | \n", "usgs_drainage_area | \n", "
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | \n", "01011000 | \n", "Allagash River near Allagash, Maine | \n", "stream gauge | \n", "USGS | \n", "ME | \n", "47.069722 | \n", "-69.079444 | \n", "1910-07-01 | \n", "2023-11-30 | \n", "34028 | \n", "... | \n", "None | \n", "01010002 | \n", "nan | \n", "nan | \n", "4210 | \n", "2783 | \n", "3186.8440 | \n", "Non-ref | \n", "187.0 | \n", "1478.00 | \n", "
| 1 | \n", "01013500 | \n", "Fish River near Fort Kent, Maine | \n", "stream gauge | \n", "USGS | \n", "ME | \n", "47.237500 | \n", "-68.582778 | \n", "1903-07-29 | \n", "2023-12-01 | \n", "36507 | \n", "... | \n", "None | \n", "01010003 | \n", "nan | \n", "nan | \n", "4237 | \n", "2810 | \n", "2252.6960 | \n", "Ref | \n", "157.0 | \n", "873.00 | \n", "
| 2 | \n", "01015800 | \n", "Aroostook River near Masardis, Maine | \n", "stream gauge | \n", "USGS | \n", "ME | \n", "46.523056 | \n", "-68.371667 | \n", "1957-09-14 | \n", "2023-12-01 | \n", "24185 | \n", "... | \n", "None | \n", "01010004 | \n", "nan | \n", "nan | \n", "4276 | \n", "2747 | \n", "2313.7550 | \n", "Non-ref | \n", "166.0 | \n", "892.00 | \n", "
| 3 | \n", "01017000 | \n", "Aroostook River at Washburn, Maine | \n", "stream gauge | \n", "USGS | \n", "ME | \n", "46.777222 | \n", "-68.157222 | \n", "1930-08-01 | \n", "2023-12-01 | \n", "34091 | \n", "... | \n", "None | \n", "01010004 | \n", "nan | \n", "nan | \n", "4281 | \n", "2773 | \n", "4278.9070 | \n", "Non-ref | \n", "131.0 | \n", "1654.00 | \n", "
| 4 | \n", "01017550 | \n", "Williams Brook at Phair, Maine | \n", "stream gauge | \n", "USGS | \n", "ME | \n", "46.628056 | \n", "-67.953056 | \n", "1999-11-01 | \n", "2023-12-01 | \n", "8797 | \n", "... | \n", "None | \n", "01010005 | \n", "nan | \n", "nan | \n", "4300 | \n", "2762 | \n", "10.0323 | \n", "Ref | \n", "176.0 | \n", "3.82 | \n", "
5 rows × 23 columns
\n", "| \n", " | site_id | \n", "site_name | \n", "site_type | \n", "agency | \n", "state | \n", "latitude | \n", "longitude | \n", "first_date_data_available | \n", "last_date_data_available | \n", "record_count | \n", "... | \n", "doi | \n", "huc8 | \n", "conus1_x | \n", "conus1_y | \n", "conus2_x | \n", "conus2_y | \n", "gagesii_drainage_area | \n", "gagesii_class | \n", "gagesii_site_elevation | \n", "usgs_drainage_area | \n", "
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | \n", "01013500 | \n", "Fish River near Fort Kent, Maine | \n", "stream gauge | \n", "USGS | \n", "ME | \n", "47.2375 | \n", "-68.582778 | \n", "1903-07-29 | \n", "2023-12-01 | \n", "36507 | \n", "... | \n", "None | \n", "01010003 | \n", "nan | \n", "nan | \n", "4237 | \n", "2810 | \n", "2252.696 | \n", "Ref | \n", "157.0 | \n", "873.0 | \n", "
1 rows × 23 columns
\n", "| \n", " | site_id | \n", "site_name | \n", "site_type | \n", "agency | \n", "state | \n", "latitude | \n", "longitude | \n", "first_date_data_available | \n", "last_date_data_available | \n", "record_count | \n", "... | \n", "doi | \n", "huc8 | \n", "conus1_x | \n", "conus1_y | \n", "conus2_x | \n", "conus2_y | \n", "gagesii_drainage_area | \n", "gagesii_class | \n", "gagesii_site_elevation | \n", "usgs_drainage_area | \n", "
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | \n", "01011000 | \n", "Allagash River near Allagash, Maine | \n", "stream gauge | \n", "USGS | \n", "ME | \n", "47.069722 | \n", "-69.079444 | \n", "1910-07-01 | \n", "2023-11-30 | \n", "34028 | \n", "... | \n", "None | \n", "01010002 | \n", "nan | \n", "nan | \n", "4210 | \n", "2783 | \n", "3186.844 | \n", "Non-ref | \n", "187.0 | \n", "1478.0 | \n", "
| 1 | \n", "01013500 | \n", "Fish River near Fort Kent, Maine | \n", "stream gauge | \n", "USGS | \n", "ME | \n", "47.237500 | \n", "-68.582778 | \n", "1903-07-29 | \n", "2023-12-01 | \n", "36507 | \n", "... | \n", "None | \n", "01010003 | \n", "nan | \n", "nan | \n", "4237 | \n", "2810 | \n", "2252.696 | \n", "Ref | \n", "157.0 | \n", "873.0 | \n", "
| 2 | \n", "01029500 | \n", "East Branch Penobscot River at Grindstone, Maine | \n", "stream gauge | \n", "USGS | \n", "ME | \n", "45.730278 | \n", "-68.589444 | \n", "1902-10-01 | \n", "2023-12-01 | \n", "37315 | \n", "... | \n", "None | \n", "01020002 | \n", "nan | \n", "nan | \n", "4293 | \n", "2656 | \n", "2816.295 | \n", "Non-ref | \n", "93.0 | \n", "837.0 | \n", "
3 rows × 23 columns
\n", "| \n", " | date | \n", "06614800 | \n", "06620000 | \n", "06701500 | \n", "06701900 | \n", "06707500 | \n", "06708800 | \n", "06709000 | \n", "06709530 | \n", "06710150 | \n", "... | \n", "382628104493700 | \n", "382629104493000 | \n", "383619104520401 | \n", "383637104531301 | \n", "383944104474201 | \n", "384037104472001 | \n", "384047104510301 | \n", "384048104504901 | \n", "384220104503701 | \n", "391504106225200 | \n", "
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | \n", "2005-01-01 | \n", "0.013584 | \n", "3.5375 | \n", "1.9244 | \n", "2.19325 | \n", "5.2921 | \n", "0.163574 | \n", "0.52072 | \n", "0.55751 | \n", "0.056600 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.002547 | \n", "0.0 | \n", "0.0 | \n", "0.021508 | \n", "0.0 | \n", "0.008490 | \n", "0.0 | \n", "0.004245 | \n", "
| 1 | \n", "2005-01-02 | \n", "0.013301 | \n", "3.3960 | \n", "1.9244 | \n", "2.14514 | \n", "5.2072 | \n", "0.144896 | \n", "0.48110 | \n", "0.53770 | \n", "0.052355 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.002547 | \n", "0.0 | \n", "0.0 | \n", "0.024621 | \n", "0.0 | \n", "0.008207 | \n", "0.0 | \n", "0.004245 | \n", "
| 2 | \n", "2005-01-03 | \n", "0.013301 | \n", "3.3111 | \n", "1.9244 | \n", "2.15080 | \n", "5.1506 | \n", "0.128765 | \n", "0.49525 | \n", "0.50374 | \n", "0.058015 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.002547 | \n", "0.0 | \n", "0.0 | \n", "0.023772 | \n", "0.0 | \n", "0.007924 | \n", "0.0 | \n", "0.004245 | \n", "
| 3 | \n", "2005-01-04 | \n", "0.013301 | \n", "3.3960 | \n", "1.9244 | \n", "2.15080 | \n", "5.0091 | \n", "0.119992 | \n", "0.48110 | \n", "0.48110 | \n", "0.051506 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.002547 | \n", "0.0 | \n", "0.0 | \n", "0.025470 | \n", "0.0 | \n", "0.007924 | \n", "0.0 | \n", "0.004245 | \n", "
| 4 | \n", "2005-01-05 | \n", "0.013301 | \n", "3.3960 | \n", "1.9244 | \n", "2.23853 | \n", "4.1035 | \n", "0.139236 | \n", "0.41601 | \n", "0.50374 | \n", "0.046412 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.002547 | \n", "0.0 | \n", "0.0 | \n", "0.022923 | \n", "0.0 | \n", "0.007924 | \n", "0.0 | \n", "0.004245 | \n", "
5 rows × 269 columns
\n", "| \n", " | site_id | \n", "site_name | \n", "site_type | \n", "agency | \n", "state | \n", "latitude | \n", "longitude | \n", "first_date_data_available | \n", "last_date_data_available | \n", "record_count | \n", "... | \n", "doi | \n", "huc8 | \n", "conus1_x | \n", "conus1_y | \n", "conus2_x | \n", "conus2_y | \n", "gagesii_drainage_area | \n", "gagesii_class | \n", "gagesii_site_elevation | \n", "usgs_drainage_area | \n", "
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | \n", "06614800 | \n", "MICHIGAN RIVER NEAR CAMERON PASS, CO | \n", "stream gauge | \n", "USGS | \n", "CO | \n", "40.496094 | \n", "-105.865012 | \n", "1973-10-01 | \n", "2023-12-01 | \n", "18322 | \n", "... | \n", "None | \n", "10180001 | \n", "1054 | \n", "818 | \n", "1481 | \n", "1764 | \n", "4.0284 | \n", "Ref | \n", "3188.0 | \n", "1.54 | \n", "
| 1 | \n", "06620000 | \n", "NORTH PLATTE RIVER NEAR NORTHGATE, CO | \n", "stream gauge | \n", "USGS | \n", "CO | \n", "40.936639 | \n", "-106.339194 | \n", "1904-06-01 | \n", "2023-12-01 | \n", "39782 | \n", "... | \n", "None | \n", "10180001 | \n", "1020 | \n", "870 | \n", "1448 | \n", "1817 | \n", "3702.6370 | \n", "Non-ref | \n", "2388.0 | \n", "1431.00 | \n", "
| 2 | \n", "06701500 | \n", "SOUTH PLATTE RIVER BELOW CHEESMAN LAKE, CO | \n", "stream gauge | \n", "USGS | \n", "CO | \n", "39.209157 | \n", "-105.267773 | \n", "1924-10-01 | \n", "2007-09-29 | \n", "29217 | \n", "... | \n", "None | \n", "10190002 | \n", "1091 | \n", "671 | \n", "nan | \n", "nan | \n", "4557.0680 | \n", "Non-ref | \n", "2081.0 | \n", "1752.00 | \n", "
| 3 | \n", "06701900 | \n", "SOUTH PLATTE RIVER BLW BRUSH CRK NEAR TRUMBULL... | \n", "stream gauge | \n", "USGS | \n", "CO | \n", "39.259990 | \n", "-105.221938 | \n", "2002-07-19 | \n", "2023-12-01 | \n", "7792 | \n", "... | \n", "None | \n", "10190002 | \n", "nan | \n", "nan | \n", "1523 | \n", "1627 | \n", "5252.5570 | \n", "Non-ref | \n", "1990.0 | \n", "2028.00 | \n", "
| 4 | \n", "06707500 | \n", "SOUTH PLATTE RIVER AT SOUTH PLATTE, CO | \n", "stream gauge | \n", "USGS | \n", "CO | \n", "39.409156 | \n", "-105.169990 | \n", "1896-01-01 | \n", "2007-09-29 | \n", "32959 | \n", "... | \n", "None | \n", "10190002 | \n", "nan | \n", "nan | \n", "nan | \n", "nan | \n", "6689.0300 | \n", "Non-ref | \n", "1901.0 | \n", "2579.00 | \n", "
5 rows × 23 columns
\n", "