Batch-Running Recommendations¶

The lenskit.batch module contains support for batch-running recommender and predictor algorithms. This is often used as part of a recommender evaluation experiment.

Recommendation¶

lenskit.batch.recommend(algo, model, users, n, candidates, ratings=None, nprocs=None)¶

Batch-recommend for multiple users. The provided algorithm should be a algorithms.Recommender or algorithms.Predictor (which will be converted to a top-N recommender).

Parameters:

algo – the algorithm
model – The algorithm model
users (array-like) – the users to recommend for
n (int) – the number of recommendations to generate (None for unlimited)
candidates – the users’ candidate sets. This can be a function, in which case it will be passed each user ID; it can also be a dictionary, in which case user IDs will be looked up in it.
ratings (pandas.DataFrame) – if not None, a data frame of ratings to attach to recommendations when available.

Returns:

A frame with at least the columns user, rank, and item; possibly also score, and any other columns returned by the recommender.

Rating Prediction¶

lenskit.batch.predict(algo, pairs, model=None, nprocs=None)¶

Generate predictions for user-item pairs. The provided algorithm should be a algorithms.Predictor or a function of two arguments: the user ID and a list of item IDs. It should return a dictionary or a pandas.Series mapping item IDs to predictions.

Parameters:	or (predictor(callable) – py:class:algorithms.Predictor): a rating predictor function or algorithm. pairs (pandas.DataFrame) – a data frame of (`user`, `item`) pairs to predict for. If this frame also contains a `rating` column, it will be included in the result. model (any) – a model for the algorithm.
Returns:	a frame with columns `user`, `item`, and `prediction` containing the prediction results. If `pairs` contains a rating column, this result will also contain a rating column.
Return type:	pandas.DataFrame

Scripting Evaluation¶

class lenskit.batch.MultiEval(path, predict=True, recommend=100, candidates=<class 'lenskit.topn.UnratedCandidates'>, nprocs=None)¶

A runner for carrying out multiple evaluations, such as parameter sweeps.

Parameters:

path (str or pathlib.Path) – the working directory for this evaluation. It will be created if it does not exist.
predict (bool) – whether to generate rating predictions.
recommend (int) – the number of recommendations to generate per user (None to disable top-N).
candidates (function) – the default candidate set generator for recommendations. It should take the training data and return a candidate generator, itself a function mapping user IDs to candidate sets.

add_algorithms(algos, parallel=False, attrs=[], **kwargs)¶

Add one or more algorithms to the run.

Parameters:	algos (algorithm or list) – the algorithm(s) to add. parallel (bool) – if `True`, allow this algorithm to be trained in parallel with others. attrs (list of str) – a list of attributes to extract from the algorithm objects and include in the run descriptions. kwargs – additional attributes to include in the run descriptions.

add_datasets(data, name=None, candidates=None, **kwargs)¶

Add one or more datasets to the run.

Parameters:	data – the input data set(s) to run. Can be one of the followin: A tuple of (train, test) data. An iterable of (train, test) pairs, in which case the iterable is not consumed until it is needed. A function yielding either of the above, to defer data load until it is needed. kwargs – additional attributes pertaining to these data sets.

run()¶: Run the evaluation.