暫無描述

Ali c8c98631a4 Fixed file size 2 年之前
bop_scripts 5b7cbee370 Mise à jour de la visualisation 2 年之前
config cf37238947 Structuration du projet 2 年之前
database_constitution 57387df1c9 Initial push 2 年之前
img 57387df1c9 Initial push 2 年之前
old cf37238947 Structuration du projet 2 年之前
submissions 4e3433f6ea Fixed bug 2 年之前
.gitignore f810bd13ae Parsing ATC drugs from nml 2 年之前
README.md 204db2b9b8 Updated readme 2 年之前
biologyOrderPrediction_starting_kit.html e0ef580cd0 Set to rampkit format 2 年之前
biologyOrderPrediction_starting_kit.ipynb e0ef580cd0 Set to rampkit format 2 年之前
biologyOrderPrediction_starting_kit_no_input.html e0ef580cd0 Set to rampkit format 2 年之前
download_data.py c8c98631a4 Fixed file size 2 年之前
getting_started.ipynb cf37238947 Structuration du projet 2 年之前
problem.py e0ef580cd0 Set to rampkit format 2 年之前
requirements.txt 204db2b9b8 Updated readme 2 年之前

README.md

biologyOrderPredictor

Authors : Lévi-dan Azoulay, Shana Zirah, Jonas Benhammou, Gaspard André, Nathane Berrebi and Ali Bellamine

This project has beeen realised in the Data Science Master of the Paris Polytechnique institute (M2DS) for the Datacamp class.

Install

Dependencies

    python3 -m venv .venv
    source .venv/bin/activate

    pip install --upgrade pip
    pip install -r requirements.txt

Downloading data

Because we cannot publicly release the data, you need to provide a download token to get them.

    python3 download_data.py [download_token]

Getting started

You can open the getting started notebook to have a first insight of the provided data.

We provided a more exhaustive data analysis and a proposition of algorithm in the starting kit notebook.

Two classifier are provided :

  • A dummy classifier (dummy)
  • The starting kit classifier (starting_kit)

You can run with the ramp-test command :

    ramp-test --submission dummy # Running dummy classifier
    ramp-test --submission starting_kit # Running starting kit classifier