Source: arxiv | Overall 6.4/10 | Corroboration: 1
Signal 9.4
Novelty 4.0
Impact 2.0
Confidence 8.7
Actionability 8.2
Summary: arXiv:2605.20052v1 Announce Type: cross Abstract: Automatic report labeling facilitates the identification of clinical findings from unstructured text and enables large-scale.
- What happened: arXiv:2605.20052v1 Announce Type: cross Abstract: Automatic report labeling facilitates the identification of clinical findings from unstructured text and enables.
- Why it matters: arXiv:2605.20052v1 Announce Type: cross Abstract: Automatic report labeling facilitates the identification of clinical findings from unstructured text and enables.
- What to do: Validate with one small internal benchmark and compare against your current baseline this week.
Deep
Context
arXiv:2605.20052v1 Announce Type: cross Abstract: Automatic report labeling facilitates the identification of clinical findings from unstructured text and enables large-scale annotation for medical imaging research.
What's new
In this paper, we propose PromptRad, a knowledge-enhanced multi-label \textbf{prompt}-tuning approach for \textbf{rad}iology report labeling under low-resource settings.
Key details
- Existing rule-based labelers struggle with the diverse descriptions in clinical reports, while fine-tuning pre-trained language models (PLMs) requires large amounts of labeled data that are often unavailable in clinical settings.
- In this paper, we propose PromptRad, a knowledge-enhanced multi-label \textbf{prompt}-tuning approach for \textbf{rad}iology report labeling under low-resource settings.
- PromptRad reformulates multi-label classification as masked language modeling and incorporates synonyms from the UMLS Metathesaurus into a multi-word verbalizer to enrich category representations.
- By fine-tuning the PLM without additional classification layers, PromptRad requires substantially less labeled data than conventional fine-tuning.
Results & evidence
- arXiv:2605.20052v1 Announce Type: cross Abstract: Automatic report labeling facilitates the identification of clinical findings from unstructured text and enables large-scale annotation for medical imaging research.
- Experiments on liver CT reports show that PromptRad outperforms dictionary-based and fine-tuning baselines with only 32 labeled training examples, and achieves competitive performance with GPT-4 despite using a much smaller model.
- Computer Science > Computation and Language [Submitted on 19 May 2026] Title:PromptRad: Knowledge-Enhanced Multi-Label Prompt-Tuning for Low-Resource Radiology Report Labeling View PDF HTML (experimental)Abstract:Automatic report labeling facilitates the id...
Limitations / unknowns
- Generalization outside curated tasks is still unclear.
Next-step validation checks
- Reproduce one claim with a public baseline and fixed evaluation settings.
- Check robustness on out-of-distribution or long-context cases.
- Track whether independent teams report matching results.
Source: arxiv | Overall 6.2/10 | Corroboration: 1
Signal 9.4
Novelty 4.0
Impact 2.0
Confidence 8.7
Actionability 6.5
Summary: arXiv:2604.16503v2 Announce Type: replace-cross Abstract: Training strong video generation models usually requires massive datasets, large parameter counts, and substantial.
- What happened: arXiv:2604.16503v2 Announce Type: replace-cross Abstract: Training strong video generation models usually requires massive datasets, large parameter counts, and.
- Why it matters: arXiv:2604.16503v2 Announce Type: replace-cross Abstract: Training strong video generation models usually requires massive datasets, large parameter counts, and.
- What to do: Validate with one small internal benchmark and compare against your current baseline this week.
Deep
Context
arXiv:2604.16503v2 Announce Type: replace-cross Abstract: Training strong video generation models usually requires massive datasets, large parameter counts, and substantial compute.
What's new
First, Shared Cross-Attention strengthens text control when video token sequences become long.
Key details
- In this work, we ask whether strong text-to-video quality is possible at a much smaller budget: fewer than 10M clips and less than 100,000 H200 GPU hours.
- Our core claim is that part of the answer lies in how model capacity is organized, not only in how much of it is used.
- In video generation, prompt alignment, temporal consistency, and fine-detail recovery can interfere with one another when they are handled through the same pathway.
- Motif-Video 2B addresses this by separating these roles architecturally, rather than relying on scale alone.
Results & evidence
- arXiv:2604.16503v2 Announce Type: replace-cross Abstract: Training strong video generation models usually requires massive datasets, large parameter counts, and substantial compute.
- In this work, we ask whether strong text-to-video quality is possible at a much smaller budget: fewer than 10M clips and less than 100,000 H200 GPU hours.
- On VBench, Motif-Video~2B reaches 83.76\%, surpassing Wan2.1 14B while using 7$\times$ fewer parameters and substantially less training data.
Limitations / unknowns
- To make this design effective under a limited compute budget, we pair it with an efficient training recipe based on dynamic token routing and early-phase feature alignment to a frozen pretrained video encoder.
Next-step validation checks
- Reproduce one claim with a public baseline and fixed evaluation settings.
- Check robustness on out-of-distribution or long-context cases.
- Track whether independent teams report matching results.
Source: arxiv | Overall 6.2/10 | Corroboration: 1
Signal 9.4
Novelty 4.0
Impact 2.0
Confidence 8.7
Actionability 6.5
Summary: arXiv:2605.15336v2 Announce Type: replace-cross Abstract: In this report, we present HoloMotion-1, a humanoid motion foundation model for zero-shot whole-body motion tracking.
- What happened: Learning from such heterogeneous data introduces new challenges, including reconstruction noise, source-domain mismatch, uneven motion quality, and the need for temporal.
- Why it matters: To address these challenges, HoloMotion-1 integrates large-capacity temporal modeling, a sparsely activated Mixture-of-Experts Transformer with KV-cache inference for.
- What to do: Validate with one small internal benchmark and compare against your current baseline this week.
Deep
Context
Learning from such heterogeneous data introduces new challenges, including reconstruction noise, source-domain mismatch, uneven motion quality, and the need for temporal modeling under large behavioral variation.
What's new
Learning from such heterogeneous data introduces new challenges, including reconstruction noise, source-domain mismatch, uneven motion quality, and the need for temporal modeling under large behavioral variation.
Key details
- A key innovation of HoloMotion-1 is to scale control-policy training with a large-scale hybrid motion corpus, where video-reconstructed motions from in-the-wild videos provide the dominant source of motion diversity, while curated motion-capture and in-hous...
- This data regime enables HoloMotion-1 to move beyond conventional MoCap-only training and exposes the policy to substantially broader behaviors, capture conditions, and motion styles.
- Learning from such heterogeneous data introduces new challenges, including reconstruction noise, source-domain mismatch, uneven motion quality, and the need for temporal modeling under large behavioral variation.
- To address these challenges, HoloMotion-1 integrates large-capacity temporal modeling, a sparsely activated Mixture-of-Experts Transformer with KV-cache inference for real-time control, and a sequence-level training strategy that improves learning efficienc...
Results & evidence
- arXiv:2605.15336v2 Announce Type: replace-cross Abstract: In this report, we present HoloMotion-1, a humanoid motion foundation model for zero-shot whole-body motion tracking.
- A key innovation of HoloMotion-1 is to scale control-policy training with a large-scale hybrid motion corpus, where video-reconstructed motions from in-the-wild videos provide the dominant source of motion diversity, while curated motion-capture and in-hous...
- This data regime enables HoloMotion-1 to move beyond conventional MoCap-only training and exposes the policy to substantially broader behaviors, capture conditions, and motion styles.
Limitations / unknowns
- Generalization outside curated tasks is still unclear.
Next-step validation checks
- Reproduce one claim with a public baseline and fixed evaluation settings.
- Check robustness on out-of-distribution or long-context cases.
- Track whether independent teams report matching results.