Due to the lapse in federal government funding, NASA is not updating this website. We sincerely regret this inconvenience.
NASA Logo
Ocean Color Science Software

ocssw V2022

Public Member Functions

def __init__ (self, timeout=5, max_tries=5, verbose=False, clobber=False)
 
def open_url (self, url, ntries=None, get=False)
 
def needs_download (self, url, filepath, check_times=False, response=None)
 
def download_file (self, url, filepath)
 
def list_pageurls (self, url, regex='')
 
def download_allfiles (self, url, dirpath, regex='', check_times=False, response=None, clobber=False, dry_run=False)
 
def spider (self, url, level=0, visited=None)
 

Public Attributes

 timeout
 
 max_tries
 
 verbose
 
 clobber
 
 session
 

Detailed Description

Definition at line 137 of file SessionUtils.py.

Constructor & Destructor Documentation

◆ __init__()

def __init__ (   self,
  timeout = 5,
  max_tries = 5,
  verbose = False,
  clobber = False 
)

Definition at line 139 of file SessionUtils.py.

Member Function Documentation

◆ download_allfiles()

def download_allfiles (   self,
  url,
  dirpath,
  regex = '',
  check_times = False,
  response = None,
  clobber = False,
  dry_run = False 
)
Downloads all available files from a remote url into a local dirpath.
Default is to download only if local file doesn't match remote size;
set clobber=True to always download.

Definition at line 271 of file SessionUtils.py.

◆ download_file()

def download_file (   self,
  url,
  filepath 
)

Definition at line 235 of file SessionUtils.py.

◆ list_pageurls()

def list_pageurls (   self,
  url,
  regex = '' 
)
Returns a sorted, unique set of links from a given url.
Optionally specify regex to filter for acceptable files;
default is to list only links starting with url.

Definition at line 246 of file SessionUtils.py.

◆ needs_download()

def needs_download (   self,
  url,
  filepath,
  check_times = False,
  response = None 
)
Returns False if filepath is present and size matches remote url;
True otherwise.  Optionally check timestamp as well.

Definition at line 205 of file SessionUtils.py.

◆ open_url()

def open_url (   self,
  url,
  ntries = None,
  get = False 
)
Return requests.Session object for specified url.
Retries up to self.max_tries times if server is busy.
By default, retrieves header only.

Definition at line 146 of file SessionUtils.py.

◆ spider()

def spider (   self,
  url,
  level = 0,
  visited = None 
)
Demo crawler

Definition at line 302 of file SessionUtils.py.

Member Data Documentation

◆ clobber

clobber

Definition at line 143 of file SessionUtils.py.

◆ max_tries

max_tries

Definition at line 141 of file SessionUtils.py.

◆ session

session

Definition at line 144 of file SessionUtils.py.

◆ timeout

timeout

Definition at line 140 of file SessionUtils.py.

◆ verbose

verbose

Definition at line 142 of file SessionUtils.py.


The documentation for this class was generated from the following file:
  • /gfs-oceanweb/web/ocssw/ocssw_src/src/scripts/seadasutils/SessionUtils.py