Miscellaneous codes and writings for MLOps
-
Updated
Jun 18, 2024 - Jupyter Notebook
Miscellaneous codes and writings for MLOps
A repository of notebooks and data sources for data engineers, data analysts and data scientists, chiefly proof of concept level
Python scripts to process, and analyze log files using PySpark.
Final Project for Harvard's Scala for Big Data Systems course
contains notebooks on topic modeling, spark and pandas implementation
Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"
NLP functions with John Snow's Spark NLP in the Java language
This project is a Spark ML pipeline using Pyspark for NLP, using annotators: DocumentAssembler, Tokenizer, WordEmbeddingsModel, PerceptronModel & NerCrfModel. It prints a transformed DataFrame showing POS & NER columns, and analyzes any relationship between found entities & their POS attributes. Hands-on experience with Spark, Pyspark & Spark-NLP.
Project that captures information about all Dark Souls 3 (DS3) weapons and performs textual analysis on.
SparkNLP and Healthcare SparkNLP based analysis of scientific literature on equine colic.
Compilation of NLP notebooks from various sources that address several technical challenges.
An implementation of NLP Sandbox PHI Annotator API based on Spark NLP
Final project of "Big Data Analytics and Business Intelligence" course.
Add a description, image, and links to the spark-nlp topic page so that developers can more easily learn about it.
To associate your repository with the spark-nlp topic, visit your repo's landing page and select "manage topics."