Metadata-Version: 2.1
Name: rs_datasets
Version: 0.5.0
Summary: Tool for autodownloading recommendation systems datasets
Home-page: https://github.com/Darel13712/rs_datasets
Author: Yan-Martin Tamm
Author-email: darel142857@gmail.com
License: UNKNOWN
Description: # Welcome to rs_datasets
        
        ![](https://img.shields.io/pypi/v/rs_datasets?color=%2300ccff)
        ![](https://img.shields.io/badge/datasets-15-00ccff)
        
        This tool allows you download, unpack and read 
        recommender systems datasets into `pandas.DataFrame` as easy as `data = Dataset()`.
        
        ## Installation
        
        ```
        pip install rs_datasets
        ```
        
        ## Documentation
        Please see [documentation](https://darel13712.github.io/rs_datasets/) to this project to 
        see available datasets and examples of use.
        
        ## Example of use
        
        ```python
        from rs_datasets import MovieLens
        ml = MovieLens()
        ml.info()
        ```
        ```text
        ratings
           user_id  item_id  rating  timestamp
        0        1        1     4.0  964982703
        1        1        3     4.0  964981247
        2        1        6     4.0  964982224
        items
           item_id  ...                                       genres
        0        1  ...  Adventure|Animation|Children|Comedy|Fantasy
        1        2  ...                   Adventure|Children|Fantasy
        2        3  ...                               Comedy|Romance
        [3 rows x 3 columns]
        tags
           user_id  item_id              tag   timestamp
        0        2    60756            funny  1445714994
        1        2    60756  Highly quotable  1445714996
        2        2    60756     will ferrell  1445714992
        links
           item_id  imdb_id  tmdb_id
        0        1   114709    862.0
        1        2   113497   8844.0
        2        3   113228  15602.0
        ```
        Loaded DataFrames are available as class attributes.
        
        ## Note
        
        This package relies on `datatable` to read files. 
        There are some known issues with reading some of the datasets, which should be solved with the release of `datatable==1.1.0`,
        but they are quite slow on releases. If you experience problems with reading datasets, you may try to downgrade datatable 
        to 0.11 or 0.9. Or you can install a dev build `1.1.0a2102` or newer from [s3](https://h2o-release.s3.amazonaws.com/datatable/index.html).
        Find your python version, copy link for whl and do `pip install link`. Sorry for the inconvenience.
        
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.6
Description-Content-Type: text/markdown
