No task available with custom recipe for text classification

kunalr · September 15, 2021, 4:42am

Hi,
I am using a custom recipe for multi-label text classification.
But I am getting issue "No tasks available" after few annotations.
Only works if I restart the instance server.
Using prodigy version: 1.10.8

fetching data from input json file.
Command:

PRODIGY_LOGGING=basic  prodigy article_cat articles5 articles_part_5.jsonl -F recipe.py

similar to : https://support.prodi.gy/t/no-tasks-available-for-non-active-learning-classification/2067

Could you please check my recipe?

import prodigy
from prodigy.components.loaders import JSONL

@prodigy.recipe(
"article_cat",
dataset=("The dataset to save to", "positional", None, str),
file_path=("Path to texts", "positional", None, str),
)
def article_cat(dataset, file_path):
"""Annotate the sentiment of texts using different mood options."""
stream = JSONL(file_path) # load in the JSONL file
stream = add_options(stream) # add options to each task
blocks = [
{"view_id": "html"},
{"view_id": "text"},
{"view_id": "choice", "text": None, "html": None}
]
return {
"dataset": dataset, # save annotations in this dataset
"view_id": "blocks", # set the view_id to "blocks"
"stream": list(stream),
"config": {
"blocks": blocks, # add the blocks to the config
}
}

def add_options(stream):
#Helper function to add options to every task in a stream
options = [
{"id": "1", "text": "A"},
{"id": "2", "text": "B"},
{"id": "3", "text": "C"},
{"id": "4", "text": "D"}
]
#I few more labels

for task in stream:
task["options"] = options
yield task

ines · September 16, 2021, 1:30am

Hi! Is there anything in your prodigy.json and/or are you using multi-user sessions?

kunalr · September 16, 2021, 6:37am

I am not using multi-user.
however, running 5 different prodigy instances with the same recipe.py script and different input data files( output datasets as well), on same machine.
Here is my prodigy json.

{
"theme": "basic",
"buttons": ["accept","undo"],
"custom_theme": {"largeText":18,"mediumText":16,"smallText":14,"cardMinWidth":300,"cardMaxWidth":1400,"cardMinHeight":200,"buttonSize":50,"relationHeight":130,"relationHeightWrap":40},
"batch_size": 8,
"history_size": 8,
"host": "*",
"cors": true,
"db": "sqlite",
"db_settings": {},
"api_keys": {},
"validate": true,
"auto_exclude_current": true,
"instant_submit": false,
"feed_overlap": false,
"ui_lang": "en",
"project_info": ["dataset", "session", "lang", "recipe_name", "view_id", "label"],
"show_stats": true,
"hide_meta": false,
"instructions": "instructions.html",
"swipe": false,
"swipe_gestures": { "left": "accept", "right": "reject" },
"split_sents_threshold": false,
"global_css": ".prodigy-content { text-align: left }",
"javascript": null,
"writing_dir": "ltr",
"show_whitespace": false,
"choice_style": "multiple",
"auto_count_stream": true,
"total_examples_target": 2233,
"show_flag": true
}

ines · September 20, 2021, 6:10am

This all looks reasonable! Just one quick comment: the auto_count_stream and total_examples_target settings were both only introduced in v1.11, so they won't have any effect in v1.10. So if you want to use them, you should upgrade to v1.11 – if you can, this would be interesting to try in a separate environment to see if it solves the problem you're seeing.

I've tried out your recipe with the same settings and some random data file and I can't seem to reproduce the problem Some things to check on your end:

What's in the input JSONL files? Do they contain duplicates? How many examples are in them? Do you see "No tasks available" at the beginning of the file or do you actually hit the end? (Maybe you want to set "force_stream_order": true so that refreshing the browser doesn't request the next batch? This only makes sense if you only have one user per instance, though.)
Since you're running multiple instances, do you have enough memory?

sangdeep · September 28, 2021, 2:59pm

No this is first time i posted here..
Many thanks.. "force_stream_order": true solved the problem.. Could you please add this to documentation i noticed that it's not mentioned there Installation & Setup · Prodigy · An annotation tool for AI, Machine Learning & NLP

kunalr · October 6, 2021, 4:20am

Hi Ines,

I tried adding "force_stream_order": true to my prodigy.json.
However, the same error: "No task available" comes after few annotations.

Now this is 4th time I had to restart the instance after a complaint from clients.
Could you please help on this?

Regards,
KJR

kunalr · October 6, 2021, 4:24am

Even after restarting the instance; I got the same issue now.

ines · October 6, 2021, 10:43am

Can you share some more details on the data you're using? How large is your data file and did you confirm that it still includes examples that are not yet present in your dataset? Does you data contain duplicates that could be excluded?

Topic		Replies	Views
Application says "No task available" usage , solved , streams	3	440	September 28, 2021
"No tasks available" even though there's plenty of samples left usage , textcat	21	5508	September 13, 2021
No Tasks Available for Non-Active Learning Classification custom	3	797	October 1, 2019
showing no task available even data not yet completely annotated usage	10	1477	October 20, 2021
No tasks available in prodigy==1.11.8 when batch_size=1, instant_submit=True but there should be tasks available bug , ner , solved , multi-user	4	1005	January 24, 2023

No task available with custom recipe for text classification

Related topics