All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #003399"> ENHANCEMENT </span>
Update CI Pipeline and dev dependencies, target Python 3.8 issue #154, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: black"> REMOVED </span>
Drop support for Python 3.6, issue #154, by Andreas Schuderer.
<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #339933"> FEATURE </span>
Add training metadata reporting and querying, issue #142, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #339933"> FEATURE </span>
Add support for typed CSVs (optiondtypes_path
ofFileDataSource <mllaunchpad.datasources.FileDataSource>
andFileDataSink <mllaunchpad.datasources.FileDataSink>
), issue #127, by Elisa Partodikromo.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #339933"> FEATURE </span>
Add Spark support (experimental), seeexamples/spark_datasource.py
and issue #145, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #003399"> ENHANCEMENT </span>
FileDataSink (csv, euro_csv, raw DataSink types) now attempts to create missing paths if required, issue #148, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #993300"> FIXED </span>
Keep generated RAML files free of command line messages, issue #126, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #993300"> FIXED </span>
Change default text file encoding to UTF-8 for config files, text_file Data Sources/Sinks and JSON model metadata. NOTE: If you have been using non-ASCII characters in any of the above, you will need to check that the encoding of the relevant existing file(s) is UTF-8. issue #128, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #993300"> FIXED </span>
Explicitly support repeated query parameters (and array types like number[]) through RAML definition. Remove IP from training metadata due to problem on macOS Big Sur. Fix doc build by pinning Sphinx dependency docutils to version 0.17.1. issue #147, by Andreas Schuderer.
<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #993300"> FIXED </span>
mllaunchpad --verbose
now correctly logs DEBUG information, issue #119, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #993300"> FIXED </span>
Fixed an issue where builtin DataSources could not be found when configured, issue #118, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #993300"> FIXED </span>
Readthedocs now shows the up-to-dateAPI docs <mllaunchpad>
, issue #110, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #003399"> ENHANCEMENT </span>
Addedchunksize
parameter for piecemeal data reading to builtin DataSources, issue #120, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #339933"> FEATURE </span>
Added functionality to better support unit testing in model development (added optional parameters tomllaunchpad.train_model
,mllaunchpad.retest
andmllaunchpad.predict
, addedmllaunchpad.get_validated_config_str
), issue #116, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #339933"> FEATURE </span>
Added generic SqlDataSource for RedShift, Postgres, MySQL, SQLite, Oracle, Microsoft SQL (ODBC), and their dialects, issue #121, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #003399"> ENHANCEMENT </span>
New command line interface (usage changes only slightly, see issue), issue #77, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #003399"> ENHANCEMENT </span>
DataSource caching overhaul: data cached separately for differentparams <mllaunchpad.datasources.FileDataSource.get_dataframe>
, configurablecache_size
, issue #97, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: black"> REMOVED </span>
Removed 'api:version:' (deprecated since 0.1.0) from configuration ('model:version:' is now the only location to specify both the model and the API version), issue #66, by Andreas Schuderer.
<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #993300"> FIXED </span>
Oracle DataSource'sget_dataframe
now interpretsNull
asnan
, issue #86, by Bob Platte.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #003399"> ENHANCEMENT </span>
Add a truckload of unit tests, issue #46, by Andreas Schuderer.
<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #993300"> FIXED </span>
Fix missing classifiers on PyPI, by Andreas Schuderer.
<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #993300"> FIXED </span>
Fix misleading error message at WSGI entry point if model could not be loaded, issue #61, by Bob Platte.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #993300"> FIXED </span>
Use correct reference to werkzeug's FileStorage, issue #63, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #003399"> ENHANCEMENT </span>
Document DataSources and DataSinks, issue #88, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #003399"> ENHANCEMENT </span>
Document configuration, issue #67, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #003399"> ENHANCEMENT </span>
Add utility user functionorder_columns
for enforcing equal data column order between data sources and API parameters, issue #37, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #003399"> ENHANCEMENT </span>
Config file is now being checked for omitted required keys, PR #65, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #339933"> FEATURE </span>
Add Impala DataSource (in examples), issue #4, by Elisa Partodikromo.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: orange"> DEPRECATED </span>
'api:version:' to be removed from configuration ('model:version:' will be the only location to specify both the model and the API version), issue #66, by Andreas Schuderer.
<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #993300"> FIXED </span>
Fix examples which could not be run on Windows, issue #34, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #993300"> FIXED </span>
Correcting variable names in TEMPLATE_cfg.yml, issue #43, by Bart Driessen.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #993300"> FIXED </span>
Changed config fallback file name to the more ugly ./LAUNCHPAD_CFG.yml, direct commit, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #993300"> FIXED </span>
Migrate frompipenv
topip
withrequirements/*.txt
, issue #36, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #003399"> ENHANCEMENT </span>
Extend documentation: getting started, use case, structure, deployment requirements, usage, issue #18, by Andreas Schuderer.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #003399"> ENHANCEMENT </span>
Improve contribution documentation, issue #35, by Gosia Rorat.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #339933"> FEATURE </span>
Added funcionality to include sub-config support, issue #28, by Elisa Partodikromo.<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #339933"> FEATURE </span>
Added file upload support (multipart/form-data, experimental), PR #47, by Andreas Schuderer.
<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #993300"> FIXED </span>
Link from GitHub README to documentation, issue #18, by Andreas Schuderer.
<span style="font-family: Sans-Serif; font-size: 0.6em; color: white; font-weight: bold; padding: 0.05em; border-radius: 0.2em; display: inline-block; background-color: #339933"> FEATURE </span>
First release on PyPI, by Andreas Schuderer.