[BUG] sar_movielens.ipynb - top_k = model.recommend_k_items(test, top_k=TOP_K, remove_seen=True) error #2015

lordaouy · 2023-10-09T20:23:56Z

Description

I got error when running sar_movielens.ipynb notebooks from azure machine learning compute instance

2023-10-09 20:08:29,498 INFO     Calculating recommendation scores
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
Cell In[8], line 2
      1 with Timer() as test_time:
----> 2     top_k = model.recommend_k_items(test, top_k=TOP_K, remove_seen=True)
      4 print("Took {} seconds for prediction.".format(test_time.interval))

File /anaconda/envs/msftrecsys2/lib/python3.9/site-packages/recommenders/models/sar/sar_singlenode.py:533, in SARSingleNode.recommend_k_items(self, test, top_k, sort_top_k, remove_seen)
    520 def recommend_k_items(self, test, top_k=10, sort_top_k=True, remove_seen=False):
    521     """Recommend top K items for all users which are in the test set
    522 
    523     Args:
   (...)
    530         pandas.DataFrame: top k recommendation items for each user
    531     """
--> 533     test_scores = self.score(test, remove_seen=remove_seen)
    535     top_items, top_scores = get_top_k_scored_items(
    536         scores=test_scores, top_k=top_k, sort_top_k=sort_top_k
    537     )
    539     df = pd.DataFrame(
    540         {
    541             self.col_user: np.repeat(
   (...)
    546         }
    547     )

File /anaconda/envs/msftrecsys2/lib/python3.9/site-packages/recommenders/models/sar/sar_singlenode.py:346, in SARSingleNode.score(self, test, remove_seen)
    344 # calculate raw scores with a matrix multiplication
    345 logger.info("Calculating recommendation scores")
--> 346 test_scores = self.user_affinity[user_ids, :].dot(self.item_similarity)
    348 # ensure we're working with a dense ndarray
    349 if isinstance(test_scores, sparse.spmatrix):

File /anaconda/envs/msftrecsys2/lib/python3.9/site-packages/scipy/sparse/_base.py:411, in _spbase.dot(self, other)
    409     return self * other
    410 else:
--> 411     return self @ other

File /anaconda/envs/msftrecsys2/lib/python3.9/site-packages/scipy/sparse/_base.py:622, in _spbase.__matmul__(self, other)
    620 def __matmul__(self, other):
    621     if isscalarlike(other):
--> 622         raise ValueError("Scalar operands are not allowed, "
    623                          "use '*' instead")
    624     return self._mul_dispatch(other)

ValueError: Scalar operands are not allowed, use '*' instead

In which platform does it happen?

It happen on Azure ML compute instance

How do we replicate the issue?

This is setup script I use in AzureML :

# 1. Install gcc if it is not installed already. On Ubuntu, this could done by using the command
# sudo apt install gcc

# 2. Create and activate a new conda environment
conda create -n msftrecsys2 python=3.9.16
conda activate msftrecsys2

# 3. Install the core recommenders package. It can run all the CPU notebooks.
pip install recommenders
pip install recommenders[gpu]
# 4. create a Jupyter kernel
python -m ipykernel install --user --name msftrecsys2 --display-name msftrecsys2

# 5. Clone this repo within VSCode or using command line:
git clone https://github.com/recommenders-team/recommenders.git

this is my compute instance setting:

Expected behavior (i.e. solution)

Cell below should run without throwing error

with Timer() as test_time:
    top_k = model.recommend_k_items(test, top_k=TOP_K, remove_seen=True)

print("Took {} seconds for prediction.".format(test_time.interval))

Other Comments

The text was updated successfully, but these errors were encountered:

Petkomat · 2023-10-11T11:53:44Z

Faced the same problem. I think it's scipy related and that return np.array(result) lines at the end of similarity functions in recommenders/utils/python_utils.py should be changed to return result.toarray().

For example, the new version of jaccard should read as

def jaccard(cooccurrence):
    """<docstring>"""

    diag_rows, diag_cols = _get_row_and_column_matrix(cooccurrence.diagonal())

    with np.errstate(invalid="ignore", divide="ignore"):
        result = cooccurrence / (diag_rows + diag_cols - cooccurrence)

    # return np.array(result) # not this
    return result.toarray()

moflotas · 2023-12-04T08:33:21Z

@lordaouy I am not sure, but it might be problem with scipy itself
I solved it lowering the version from the latest to scipy==1.10.1 and it worked perfectly

SimonYansenZhao · 2024-02-22T12:53:00Z

See scipy/scipy#18796 and #1954

SimonYansenZhao · 2024-04-30T04:58:46Z

Resolved in PR #2083

lordaouy added the bug Something isn't working label Oct 9, 2023

SimonYansenZhao mentioned this issue Feb 24, 2024

Add support for Python 3.10 and 3.11 #1937

Merged

4 tasks

SimonYansenZhao closed this as completed Apr 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] sar_movielens.ipynb - top_k = model.recommend_k_items(test, top_k=TOP_K, remove_seen=True) error #2015

[BUG] sar_movielens.ipynb - top_k = model.recommend_k_items(test, top_k=TOP_K, remove_seen=True) error #2015

lordaouy commented Oct 9, 2023

Petkomat commented Oct 11, 2023 •

edited

moflotas commented Dec 4, 2023

SimonYansenZhao commented Feb 22, 2024 •

edited

SimonYansenZhao commented Apr 30, 2024

[BUG] sar_movielens.ipynb - top_k = model.recommend_k_items(test, top_k=TOP_K, remove_seen=True) error #2015

[BUG] sar_movielens.ipynb - top_k = model.recommend_k_items(test, top_k=TOP_K, remove_seen=True) error #2015

Comments

lordaouy commented Oct 9, 2023

Description

In which platform does it happen?

How do we replicate the issue?

Expected behavior (i.e. solution)

Other Comments

Petkomat commented Oct 11, 2023 • edited

moflotas commented Dec 4, 2023

SimonYansenZhao commented Feb 22, 2024 • edited

SimonYansenZhao commented Apr 30, 2024

Petkomat commented Oct 11, 2023 •

edited

SimonYansenZhao commented Feb 22, 2024 •

edited