大模型Agent与RLHF论文 - 猎人搜索 轻松搜寻全网资源

  • file:多模态大模型直播群.png
  • file:微软最全综述:Multimodal Foundation Models From Specialists to General-Purpose Assistants.pdf
  • file:首篇综述:A Survey on Multimodal Large Language Models.pdf
  • file:WebGPT Browser-assisted question-answering with human feedback.pdf
  • file:Visual ChatGPT Talking, Drawing and Editing with Visual Foundation Models.pdf
  • file:Training language models to follow instructions with human feedback.pdf
  • file:Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback.pdf
  • file:Teaching language models to support answers with verified quotes.pdf
  • file:Scaling Laws for Reward Model Overoptimization.pdf
  • file:Scalable agent alignment via reward modeling a research direction.pdf
  • file:Reward learning from human preferences and demonstrations in Atari.pdf
  • file:Revisiting the Weaknesses of Reinforcement Learning for Neural Machine Translation.pdf
  • file:Red Teaming Language Models to Reduce Harms Methods, Scaling Behaviors, and Lessons Learned.pdf
  • file:Recursively Summarizing Books with Human Feedback.pdf
  • file:You Only Look at Screens Multimodal Chain-of-Action Agents.pdf
  • file:Voyager An open-ended embodied agent with large language models.pdf
  • file:Trustworthy LLMs a Survey and Guideline for Evaluating Large Language Models' Alignment.pdf
  • file:TPTU Task Planning and Tool Usage of Large Language Model-based AI Agents.pdf
  • file:Towards More Human-Like AI Communication.pdf
  • file:Towards a unified agent with foundation models.pdf
  • file:ToolLLM Facilitating large language models to master 16000+ real-world apis.pdf
  • file:Toolformer Language models can teach themselves to use tools.pdf
  • file:Steve-Eye Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds.pdf
  • file:Self-Alignment with Instruction Backtranslation.pdf
  • file:SeamlessM4T-Massively Multilingual & Multimodal Machine Translation.pdf
  • file:WeMM-main.zip
  • file:VisualGLM-6B-main.zip
  • file:LLaMA2-Accessory-main.zip
  • file:Qwen-VL-master.zip
  • file:UnifiedMultimodalInstructionTuning-main.zip
  • file:Multimodal-GPT-main.zip
  • file:LRV-Instruction-main.zip
  • file:You Need Multiple Exiting Dynamic Early Exiting for.pdf
  • file:VLMO Unified Vision-Language Pre-Training with.pdf
  • file:Unified Vision-Language Pre-Training for Image Captioning and VQA.pdf
  • file:UNIFIED VISION AND LANGUAGE PROMPT LEARNING.pdf
  • file:Pro-tuning Unified Prompt Tuning for Vision Tasks.pdf
  • file:BLIP Bootstrapping Language-Image Pre-training for.pdf
分享时间 2023-11-06
入库时间 2024-08-11
状态检测 有效
资源类型 BDY
分享用户 胖**可爱
资源有问题? 点此举报

相似推荐

  • 大模型Agent与RLHF论文
  • 九天菜菜-大模型与Agent开发实战
  • 极客时间-AI大模型微调训练营第0期[完结]
  • 迪哥《2024Ai必会Agent精讲班 (应用解读+项目实战) 》
  • 九天菜菜-大模型与Agent开发实战
  • 迪哥《2024Ai必会Agent精讲班 (应用解读+项目实战) 》
  • JK-企业级Agents开发实战营第1期(极客)
  • AI Agent智能应用从0到1(应用解读+项目实战)
  • 极客时间《彭靖田AI大模型微调训练营》
  • 极客时间《彭靖田AI大模型微调训练营》

用户其它资源

  • 大模型Agent与RLHF论文

最新资源

  • 853日本推理书单(2)
  • 829微信热门网文书单(100部)(1)
  • 830央视荐读书单(40部)(1)
  • 831早起书单(1)
  • 832豆瓣热门图书榜书单(2025年5月)(1)
  • 833人民日报大地书单(25部)(1)
  • 705思维习惯书单
  • 828二零二五热门传记书单(20部)
  • 国学大师费勇课程合集+作品合集
  • 熊逸 课程【合集】