Skip to content
View CyrilJl's full-sized avatar

Block or report CyrilJl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
CyrilJl/README.md

Banner

Welcome to my GitHub! 👋

My name is Cyril, and I am an applied mathematics engineer, interested in machine learning, data assimilation, time series, and cartographic data more generally! You will find on my GitHub some Python package repositories available on PyPI and conda-forge:

Machine Learning
  • optimask: For managing missing data in arrays. Allows maximizing the number of valid data before learning a model.
  • datafiller: For imputing missing data in a 2D numpy array. Provides also a wrapper for pandas time series dataframes. An easy-to-implement and efficient package, based on optimask and scikit-learn.
  • apyxl: A simple wrapper around xgboost, shap, and hyperopt to produce explainable non-linear regressions in one line of code. apyxl is not intended for production but rather as an aid to understanding or a first approach to a dataset.
Large Data
  • batchstats: The extension of numpy for calculating statistics of large data larger than available memory or for data arriving in batches.
Weather Data
Geospatial Data
  • rasterizer: Easy and efficient rasterization of polygons and lines, from geopandas to xarray.
Visualization
  • mapflow: A package to export spatio-temporal xarray.DataArray as video files, ideal for better understanding the data and the underlying physical phenomena.

I aim to produce well-written, documented, and easy-to-use open-source packages. Do not hesitate to open an issue if you encounter a bug or difficulty. 🙂

Pinned Loading

  1. datafiller datafiller Public

    Data imputation

    Python

  2. MeteoFetch MeteoFetch Public

    Récupérez facilement les dernières prévisions Arome et Arpege de MétéoFrance et IFS de l'ECMWF

    Python 12 1

  3. OptiMask OptiMask Public

    Efficiently finding the largest (non-necessarily contiguous) sub-matrix without NaN in Python

    Jupyter Notebook 1 2

  4. BatchStats BatchStats Public

    Python package for efficient, online statistical computations on streaming or large-scale data

    Python 1

  5. apyxl apyxl Public

    apyxl simplifies non-linear regressions/classifications and model explainability for all users

    Jupyter Notebook

  6. mapflow mapflow Public

    xarray dataarrays to videos

    Python