GeofabrikDownloader.get_raw_directory_index

static GeofabrikDownloader.get_raw_directory_index(url, verbose=False)[source]

Get a raw directory index.

This includes logs of older files and their and download URLs.

Parameters
  • url (str) – a URL to the web resource

  • verbose (bool or int) – whether to print relevant information in console, defaults to False

Returns

a table of raw directory index

Return type

pandas.DataFrame or None

Examples:

>>> from pydriosm.downloader import GeofabrikDownloader

>>> geofabrik_downloader = GeofabrikDownloader()

>>> ex_url = 'https://download.geofabrik.de/europe/great-britain.html'

>>> raw_dir_idx = geofabrik_downloader.get_raw_directory_index(ex_url)

>>> print(raw_dir_idx.head())
                               File  ...                            FileURL
0             great-britain-updates  ...  https://download.geofabrik.de/...
1  great-britain-latest.osm.pbf.md5  ...  https://download.geofabrik.de/...
2  great-britain-200914.osm.pbf.md5  ...  https://download.geofabrik.de/...
3                 great-britain.kml  ...  https://download.geofabrik.de/...
4      great-britain-latest.osm.pbf  ...  https://download.geofabrik.de/...

[5 rows x 4 columns]

>>> ex_url = 'http://download.geofabrik.de/'

>>> raw_dir_idx = geofabrik_downloader.get_raw_directory_index(
...     ex_url, verbose=True)
The web page does not have a raw directory index.