dbase
NOTE: You may want to use pandas instead of this.
The dbase.py
class, can be used to read/write/summarize/plot time-series data.
To summarize the functionality:
- data and variable names stored in a dictionary - accessible using variable names
- load/save from/to csv/pickle format, including date information (shelve format to be added)
- plotting and descriptive statistics, with dates if provided
- adding/deleting variables, including trends/(seasonal)dummies
- selecting observations based on dates or other variable values (e.g., > 1/1/2003)
- copying instance data
Attached also the dbase_pydoc.txt
information for the class.
Example Usage
To see the class in action download the file and run it (python dbase.py). This will create an example data file (./dbase_test_files/data.csv) that will be processed by the class.
To import the module:
import sys
sys.path.append('attachments/dbase')
import dbase
After running the class you can load the example data using
data = dbase.dbase("attachments/dbase/data.csv", date = 0)
In the above command ‘0’ is the index of the column containing dates.
You can plot series ‘b’ and ‘c’ in the file using
data.dataplot('b','c')
You get descriptive statistics for series ‘a’,’b’, and ‘c’ by using
data.info('a','b','c')
==============================================================================
============================ Database information ============================
==============================================================================
file: /mnt/data/pauli/prj/scipy/SciPy-CookBook/ipython/attachments/dbase/data.csv
# obs: 100
# variables: 3
Start date: 08 Jan 2001
End date: 02 Dec 2002
var min max mean std.dev miss levels
==============================================================================
a -2.56 3.35 -0.08 1.08 0
b -2.00 2.16 -0.02 0.98 0
c -1.91 2.54 0.18 0.93 0