大模型Agent与RLHF论文 - 猎人搜索轻松搜寻全网资源

file:多模态大模型直播群.png
file:微软最全综述：Multimodal Foundation Models From Specialists to General-Purpose Assistants.pdf
file:首篇综述：A Survey on Multimodal Large Language Models.pdf
file:WebGPT Browser-assisted question-answering with human feedback.pdf
file:Visual ChatGPT Talking, Drawing and Editing with Visual Foundation Models.pdf
file:Training language models to follow instructions with human feedback.pdf
file:Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback.pdf
file:Teaching language models to support answers with verified quotes.pdf
file:Scaling Laws for Reward Model Overoptimization.pdf
file:Scalable agent alignment via reward modeling a research direction.pdf
file:Reward learning from human preferences and demonstrations in Atari.pdf
file:Revisiting the Weaknesses of Reinforcement Learning for Neural Machine Translation.pdf
file:Red Teaming Language Models to Reduce Harms Methods, Scaling Behaviors, and Lessons Learned.pdf
file:Recursively Summarizing Books with Human Feedback.pdf
file:You Only Look at Screens Multimodal Chain-of-Action Agents.pdf
file:Voyager An open-ended embodied agent with large language models.pdf
file:Trustworthy LLMs a Survey and Guideline for Evaluating Large Language Models' Alignment.pdf
file:TPTU Task Planning and Tool Usage of Large Language Model-based AI Agents.pdf
file:Towards More Human-Like AI Communication.pdf
file:Towards a unified agent with foundation models.pdf
file:ToolLLM Facilitating large language models to master 16000+ real-world apis.pdf
file:Toolformer Language models can teach themselves to use tools.pdf
file:Steve-Eye Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds.pdf
file:Self-Alignment with Instruction Backtranslation.pdf
file:SeamlessM4T-Massively Multilingual & Multimodal Machine Translation.pdf
file:WeMM-main.zip
file:VisualGLM-6B-main.zip
file:LLaMA2-Accessory-main.zip
file:Qwen-VL-master.zip
file:UnifiedMultimodalInstructionTuning-main.zip
file:Multimodal-GPT-main.zip
file:LRV-Instruction-main.zip
file:You Need Multiple Exiting Dynamic Early Exiting for.pdf
file:VLMO Unified Vision-Language Pre-Training with.pdf
file:Unified Vision-Language Pre-Training for Image Captioning and VQA.pdf
file:UNIFIED VISION AND LANGUAGE PROMPT LEARNING.pdf
file:Pro-tuning Unified Prompt Tuning for Vision Tasks.pdf
file:BLIP Bootstrapping Language-Image Pre-training for.pdf

分享时间	2023-11-06
入库时间	2024-08-11
状态检测	有效
资源类型	BDY
分享用户	胖**可爱

资源有问题？点此举报

大模型Agent与RLHF论文 - 猎人搜索轻松搜寻全网资源

相似推荐

用户其它资源

最新资源

大模型Agent与RLHF论文 - 猎人搜索 轻松搜寻全网资源

相似推荐

用户其它资源

最新资源

大模型Agent与RLHF论文 - 猎人搜索轻松搜寻全网资源