Swift version: 5.6
Apple’s NaturalLanguage framework is able to lemmatize text for us, which is the process of converting words to the forms you would find in a dictionary – making plural nouns singular, finding the root forms of conjugated verbs, and so on, while also taking into account the context in which they are used.
To do this, first create an instance of NLTagger
enabling its .lemma
scheme, then call enumerateTags()
on it to find all the root word forms. This will pass you the tag (the root word) if it exists, plus the range of the original text in the string.
So, you could lemmatize a whole sentence like this:
import NaturalLanguage
let text = "This is text with plurals such as geese, people, and millennia."
let tagger = NLTagger(tagSchemes: [.lemma])
tagger.string = text
tagger.enumerateTags(in: text.startIndex..<text.endIndex, unit: .word, scheme: .lemma) { tag, range in
let stemForm = tag?.rawValue ?? String(text[range])
print(stemForm, terminator: "")
return true
}
Text lemmatized in this way will be lowercase, preserving any punctuation. So, that snippet will output “this be text with plural such as goose, person, and millennium.”
If you intend to lemmatize text frequently, consider making it an extension on String
like this:
extension String {
func lemmatized() -> String {
let tagger = NLTagger(tagSchemes: [.lemma])
tagger.string = self
var result = [String]()
tagger.enumerateTags(in: self.startIndex..<self.endIndex, unit: .word, scheme: .lemma) { tag, tokenRange in
let stemForm = tag?.rawValue ?? String(self[tokenRange])
result.append(stemForm)
return true
}
return result.joined()
}
}
With that in place you can now lemmatize text easily:
let text = "This is text with plurals such as geese, people, and millennia."
print(text.lemmatized())
TAKE YOUR SKILLS TO THE NEXT LEVEL If you like Hacking with Swift, you'll love Hacking with Swift+ – it's my premium service where you can learn advanced Swift and SwiftUI, functional programming, algorithms, and more. Plus it comes with stacks of benefits, including monthly live streams, downloadable projects, a 20% discount on all books, and free gifts!
Sponsor Hacking with Swift and reach the world's largest Swift community!
Available from iOS 12.0
This is part of the Swift Knowledge Base, a free, searchable collection of solutions for common iOS questions.
Link copied to your pasteboard.