Never Forget Another Line of Code

Datasnips is a free code snippet hosting platform for Data Science & AI. It enables your code snippets to be organized, searchable & shareable.

PUBLIC SNIPPETS

LATEST SNIPPETS

SNIPPET COLLECTIONS

ML CODE BUILDER

POPULAR TAGS

Lemmatise DataFrame Text Using NLTK

Python

 1|  import nltk
 2|  nltk.download('wordnet')
 3|  from nltk.stem import WordNetLemmatizer
 4|  lemmatizer = WordNetLemmatizer()
 5|  def lemmatize_words(text):
 6|      words = text.split()
 7|      words = [lemmatizer.lemmatize(word,pos='v') for word in words]
 8|      return ' '.join(words)
 9|  df['text'] = df['text'].apply(lemmatize_words)

detro

Nltk | Nlp | Lemmatise

Did you find this snippet useful?

Sign up for free to to add this to your code library

Remove Stop Words from Text in DataFrame Column

Python

Pandas | Nltk | Nlp

133

LightGBM Custom Loss Function

Python

LightGBM | Loss function

121

Tuning XGBoost Hyperparameters with Grid Search

Python

XGBoost | Tuning

117

How to Convert DataFrame Values Into Percentages

Python

Lambda | Percentages | Pipe

109

How to Scale Data Using Standard Scaler But Keep Column Names

Python

Scaler | Standard

79

LightGBM Hyperparameter Tuning with GridSearch

Python

Python | LightGBM | Hyperparameter tuning | Gridsearch

77

Calculating Root Mean Squared Error (RMSE) with Sklearn and Python

Python

77

Dynamically Create Columns in Pandas Dataframe

Python

Pandas | Columns

61

How to Train a Catboost Classifier with GridSearch Hyperparameter Tuning

Python

Catboost | Hyperparameter tuning

60

Overwriting a Python Logging File Instead of Appending

Python

55