Skip to content
View AkimfromParis's full-sized avatar
Block or Report

Block or report AkimfromParis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
AkimfromParis/README.md

🇫🇷 Akim Mousterou | ムステロ・アキム

☕ NLP Engineer from Paris, France | パリ、フランスからの自然言語処理エンジニア

💾 & 🇯🇵 Master's degree, in Natural Language Processing - Multilingual engineering, Japanese at I.N.A.L.CO

(I.N.A.L.C.O 東洋言語文化学院、自然言語処理修士 | 卒業)

💵 Master's Degree in International Business (EMIB) at ESCP Europe

(ESCPヨーロッパ・ビジネススクール、経営管理修士 | 卒業)

🏁 I have managed AI and business initiatives for prestigious brands, publishing companies, technology companies, and financial institutions for over 10 years. Born and raised in Paris, I am fluent in French, English, Japanese (JLPT N2), and Spanish. I am passionate about quantitative finance, network effects, and natural language processing.

Please feel free to connect with me on GitHub, LinkedIn, Discord, or HuggingFace! 😃


Libérté, égalité, architecture open-sourcé... Open-source RAG LlamaIndex and multilingual LLM from Mistral AI in a low-resource environment over financial statements:

  • Vanilla RAG (retrieval-augmented generation) with LlamaIndex and pgvecto.rs from TensorChord
  • Quantized model of Mistal8x7B from Mistral AI and LlamaCPP inference tool
  • Test of BGE-M3 embedding model from BAAI
  • Plus benchmark in German on Porsche AG, in French on Hermès, and in Italian on Brunello Cucinelli

Generative AI for all - Quick implementation with an open-source RAG LlamaIndex and Japanese LLM from ELYZA, Inc. in a low-resource environment over legal documents:

  • RAG (retrieval-augmented generation) is LlamaIndex with a vanilla Hybrid search (combining retrieval from both text search and vector search)
  • Japanese LLM “ELYZA-japanese-Llama-2-7b-instruct” created by Japanese startup, ELYZA, Inc.
  • Open-source database PostgreSQL transformed into a vector database by the great library PG Vector
  • Plus Q&A analysis in Japanese, embedding pricing war, and generative AI strategy of France, USA, and Japan

NER-Luxury is a fine-tuned XLM-Roberta model for the subtask N.E.R (Named Entity Recognition) in English. NER-Luxury is domain-specific for the fashion and luxury industry with bespoke labels. NER-Luxury is trying to be a bridge between the aesthetic side and the quantitative side of the fashion and luxury industry.

  • 38.063 sentences in English (covering the beauty, fashion, and luxury industries)
  • 32 labels from companies, groups, and holdings to luxury brands, models, and magazines
  • Loss: 0.3990, Accuracy: 0.9427, F1: 0.7879

To promote communication between NLP practitioners, I created a vocabulary deck of 420 technical words for Anki Pro (learning software based on space repetition) in Japanese, English, and French for natural language processing.

  • Test on earning calls of Fast Retailing Co., Ltd 2022 with Whisper OpenAI

  • LDA analysis on shareholder's letter

  • Sustainability as a strategy in fashion and in NLP

  • Introduction from Pregroup Grammar, DisCoCat, to Lambeq
  • Specificities of the Japanese language
  • Pregroup Grammar in free word order

  • Quick financial analysis
  • Why is TikT0k a mistake in luxury?
  • Black Scholes (closed form) and the Greeks implemented in Python and in C++
  • Linguistic specificities for NLP in Japanese
  • Leveraging spaCy framework, and Ginza model, and building a custom NER model
  • Insights on Masayoshi Son (Softbank), Hiroshi Mikitani (Rakuten), and Haruhiko Kuroda (Bank of Japan)
  • Few thoughts on NLP in Japanese
  • Naive OLS Linear regression model
  • Facebook's Prophet model
  • Bayesian Linear regression model with STAN
  • And a quick introduction to the Hierarchical model
  • Natural Language Processing using NLTK and Vador
  • Time Series for sentiment analysis
  • "Meme" as a marketing or investment strategy?
  • Insights on the human perceptions of leadership on Twitter
  • Financial analysis of luxury groups in 2020 during a high volatility state with the spread of SARS-Cov2
  • Found an increasing correlation on Y-o-Y between luxury groups and the Facebook Inc. ecosystem
  • Conclusion for stakeholders and shareholders

Popular repositories

  1. RAG-Japanese RAG-Japanese Public

    Open source RAG with Llama Index for Japanese LLM in low resource settting

    Jupyter Notebook 5 1

  2. Mixtral_RAG Mixtral_RAG Public

    Open-source RAG using Mixtal 8x7B for financial statements in German, French, and Italian

    Jupyter Notebook 5

  3. quantumNLP_jp quantumNLP_jp Public

    Quantum Natural Language Processing for Japanese. Why the quantum approach will be better than large language models for the Japanese language.

    Jupyter Notebook 2 2

  4. ner_japanese ner_japanese Public

    The biggest tech investor Masayoshi Son is back on Twitter since 2020 but in Japanese. What he is saying?

    Jupyter Notebook 1

  5. anki_jap_ai_vocabulary anki_jap_ai_vocabulary Public

    Anki flashcards in Japanese for AI, ML, DL, NLP & CV - Collaborative deck V1

    1

  6. luxury_risk luxury_risk Public

    How the spread of SARS-Cov2 impacted luxury groups in 2020? How Instagram became more important than Vogue?

    Jupyter Notebook