Word Mover’s Distance in Python

Category: python
#word embeddings #text classification #earth mover's distance

In document classification and other natural language processing applications, having a good measure of the similarity of two texts can be a valuable building block. Ideally, such a measure would capture semantic information. Cosine similarity on bag-of-words vectors is known to do well in practice, but it inherently cannot capture when documents say the same thing in completely different words. …

Kemeny-Young Optimal Rank Aggregation in Python

Category: python
#python #rank aggregation #kendall #tau #kemeny #kemeny-young #voting theory

Rank aggregation is a problem with many important applications and naive approaches to it go wrong in subtle ways. Let’s say that your national Quidditch league is dominated by five major wizard sports newspapers. Yes, the ones with moving images and everything. Every week after the games, each of them publishes a ranking of the star players. For now, let’s suppose that the set of players under investigation is always the same, as the problem becomes a bit more complicated otherwise. …

BibTeX-powered publications list for Pelican with pelican-bibtex

Category: python
#bibtex #blog #citations #pelican #publications #pybtex #references #static blog #static website #Uncategorized

Hook

Wouldn’t you like to manage your academic publications list easily within the context of your static website? Without resorting to external services, or to software like bibtex2html, which is very nice but will then require restyling to fit your templates?

Look no more, with the help of pelican-bibtex …