2022年被引次数最多的AI论文列表
本表是Zeta Alpha收集的2022年AI领域被引次数最多的论文列表。关于论文作者和单位分析,请参考:
排序 | 论文名 | 被引次数 | 发表机构 | 国家或地区 | 机构类型 |
---|---|---|---|---|---|
1 | AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models | 1331 | European Molecular Biology Laboratory | - | Academia |
2 | ColabFold: making protein folding accessible to all | 1138 | Max Planck Institute for Multidisciplinary Sciences | Germany | Academia |
3 | A ConvNet for the 2020s | 835 | Meta, UC Berkeley | USA, USA | Industry, Academia |
4 | Hierarchical Text-Conditional Image Generation with CLIP Latents | 718 | OpenAI | USA | Industry |
5 | PaLM: Scaling Language Modeling with Pathways | 426 | USA | Industry | |
6 | Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding | 390 | USA | Industry | |
7 | Instant Neural Graphics Primitives with a Multiresolution Hash Encoding | 342 | NVIDIA | USA | Industry |
8 | SignalP 6.0 predicts all five types of signal peptides using protein language models | 274 | Technical University of Denmark, ETH Zurich | Denmark, Switzerland | Academia, Academia |
9 | Swin Transformer V2: Scaling Up Capacity and Resolution | 266 | University of Science and Technology of China | China | Academia |
10 | Training language models to follow instructions with human feedback | 254 | OpenAI | USA | Industry |
11 | Chain of Thought Prompting Elicits Reasoning in Large Language Models | 224 | USA | Industry | |
12 | Flamingo: a Visual Language Model for Few-Shot Learning | 218 | DeepMind | UK | Industry |
13 | Classifier-Free Diffusion Guidance | 194 | USA | Industry | |
14 | Magnetic control of tokamak plasmas through deep reinforcement learning | 194 | DeepMind | UK | Industry |
15 | data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language | 191 | Meta | USA | Industry |
16 | OPT: Open Pre-trained Transformer Language Models | 187 | Meta | USA | Industry |
17 | BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation | 184 | Salesforce | USA | Industry |
18 | A Generalist Agent | 180 | DeepMind | UK | Industry |
19 | LaMDA: Language Models for Dialog Applications | 180 | USA | Industry | |
20 | CMT: Convolutional Neural Networks Meet Vision Transformers | 172 | University of Sydney | Australia | Academia |
21 | Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model | 158 | Microsoft | USA | Industry |
22 | What Makes Good In-Context Examples for GPT-3? | 157 | Duke University | USA | Academia |
23 | Ensemble unsupervised autoencoders and Gaussian mixture model for cyberattack detection | 145 | Ningbo University of Technology | China | Academia |
24 | Training Compute-Optimal Large Language Models | 144 | DeepMind | UK | Industry |
25 | Learning robust perceptive locomotion for quadrupedal robots in the wild | 141 | ETH Zurich | Switzerland | Academia |
26 | Do As I Can, Not As I Say: Grounding Language in Robotic Affordances | 135 | USA | Industry | |
27 | How Do Vision Transformers Work? | 129 | Yonsei University, NAVER | South Korea, South Korea | Academia, Industry |
28 | Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs | 127 | Tsinghua University | China | Academia |
29 | Large Language Models are Zero-Shot Reasoners | 124 | University of Tokyo | Japan | Academia |
30 | Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time | 122 | University of Washington | USA | Academia |
31 | Patches Are All You Need? | 116 | Carnegie Mellon University | USA | Academia |
32 | Competition-Level Code Generation with AlphaCode | 113 | DeepMind | UK | Industry |
33 | TensoRF: Tensorial Radiance Fields | 110 | ShanghaiTech University | China | Academia |
34 | Video Diffusion Models | 103 | USA | Industry | |
35 | Data Analytics for the Identification of Fake Reviews Using Supervised Learning | 102 | Dr. Babasaheb Ambedkar Marathwada University | India | Academia |
36 | Visual Prompt Tuning | 102 | Cornell University | USA | Academia |
37 | DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection | 100 | Hong Kong University of Science and Technology | Hong Kong | Academia |
38 | VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training | 100 | Nanjing University, Tencent | China, China | Academia, Industry |
39 | Rethinking the Role of Demonstrations: What Makes In-Context Learning Work? | 99 | University of Washington, Meta | USA, USA | Academia, Industry |
40 | BEVFormer: Learning Bird’s-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers | 96 | Nanjing University, Shanghai AI Lab | China, China | Academia, Academia |
41 | Conditional Prompt Learning for Vision-Language Models | 93 | Nanyang Technological University | Singapore | Academia |
42 | Fine-Tuning can Distort Pretrained Features and Underperform Out-of-Distribution | 93 | Stanford University | USA | Academia |
43 | Measuring and Improving the Use of Graph Information in Graph Neural Networks | 93 | Chinese University of Hong Kong | Hong Kong | Academia |
44 | Exploring Plain Vision Transformer Backbones for Object Detection | 91 | Meta | USA | Industry |
45 | GeoDiff: a Geometric Diffusion Model for Molecular Conformation Generation | 90 | Mila, University of Montreal | Canada, Canada | Academia, Academia |
46 | OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework | 88 | Alibaba Group | China | Industry |
47 | Block-NeRF: Scalable Large Scene Neural View Synthesis | 86 | UC Berkeley | USA | Academia |
48 | Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents | 86 | UC Berkeley | USA | Academia |
49 | Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models | 81 | University of Notre Dame | USA | Academia |
50 | Outracing champion Gran Turismo drivers with deep reinforcement learning | 80 | Sony | Japan | Industry |
51 | BC-Z: Zero-Shot Task Generalization with Robotic Imitation Learning | 77 | USA | Industry | |
52 | DN-DETR: Accelerate DETR Training by Introducing Query DeNoising | 74 | Hong Kong University of Science and Technology | Hong Kong | Academia |
53 | Equivariant Diffusion for Molecule Generation in 3D | 73 | University of Amsterdam | Netherlands | Academia |
54 | Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images | 73 | NVIDIA | USA | Industry |
55 | GPT-NeoX-20B: An Open-Source Autoregressive Language Model | 72 | EleutherAI | - | Industry |
56 | Online reinforcement learning multiplayer non-zero sum games of continuous-time Markov jump linear systems | 72 | Anhui University | China | Academia |
57 | Detecting Twenty-thousand Classes using Image-level Supervision | 70 | Meta | USA | Industry |
58 | Image fusion in the loop of high-level vision tasks: A semantic-aware real-time infrared and visible image fusion network | 68 | Wuhan University | China | Academia |
59 | LAION-5B: An open large-scale dataset for training next generation image-text models | 66 | LAION | Germany | Industry |
60 | Denoising Diffusion Restoration Models | 65 | Technion | Israel | Academia |
61 | VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance | 64 | EleutherAI | - | Industry |
62 | CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields | 63 | City University of Hong Kong | Hong Kong | Academia |
63 | Solving Quantitative Reasoning Problems with Language Models | 63 | USA | Industry | |
64 | Masked Autoencoders As Spatiotemporal Learners | 61 | Meta | USA | Industry |
65 | Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language | 59 | USA | Industry | |
66 | ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond | 59 | University of Sydney | Australia | Academia |
67 | Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks | 58 | Microsoft | USA | Industry |
68 | Language-driven Semantic Segmentation | 57 | Cornell University | USA | Academia |
69 | Vision-Language Pre-Training with Triple Contrastive Learning | 56 | University of Texas at Arlington | USA | Academia |
70 | Deep Reinforcement Learning-Based Path Control and Optimization for Unmanned Ships | 55 | Tongji University | China | Academia |
71 | EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction | 54 | MIT | USA | Academia |
72 | Omnivore: A Single Model for Many Visual Modalities | 54 | Meta | USA | Industry |
73 | Quantifying Memorization Across Neural Language Models | 54 | USA | Industry | |
74 | DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection | 53 | Johns Hopkins University | USA | Academia |
75 | Genetic Algorithm-Based Trajectory Optimization for Digital Twin Robots | 53 | Wuhan University of Science and Technology | China | Academia |
76 | Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors | 53 | Meta | USA | Industry |
77 | Discovering faster matrix multiplication algorithms with reinforcement learning | 52 | DeepMind | UK | Industry |
78 | DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation | 52 | Google, Boston University | USA, USA | Industry, Academia |
79 | PETR: Position Embedding Transformation for Multi-View 3D Object Detection | 52 | Megvii | China | Industry |
80 | Protein structure predictions to atomic accuracy with AlphaFold | 51 | DeepMind | UK | Industry |
81 | ABAW: Valence-Arousal Estimation, Expression Recognition, Action Unit Detection & Multi-Task Learning Challenges | 50 | Queen Mary University of London | UK | Academia |
82 | HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular Video | 50 | University of Washington | USA | Academia |
83 | UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models | 49 | University of Hong Kong | Hong Kong | Academia |
84 | A Systematic Evaluation of Large Language Models of Code | 48 | Carnegie Mellon University | USA | Academia |
85 | Robust Speech Recognition via Large-Scale Weak Supervision | 48 | OpenAI | USA | Industry |
86 | Diffusion Models: A Comprehensive Survey of Methods and Applications | 47 | Peking University | China | Academia |
87 | Can language models learn from explanations in context? | 46 | DeepMind | UK | Industry |
88 | NELA-GT-2021: A Large Multi-Labelled News Dataset for The Study of Misinformation in News Articles | 46 | Rensselaer Polytechnic Institute | USA | Academia |
89 | ActionFormer: Localizing Moments of Actions with Transformers | 44 | Nanjing University, 4Paradigm Inc. | China, China | Academia, Industry |
90 | Least-to-Most Prompting Enables Complex Reasoning in Large Language Models | 44 | USA | Industry | |
91 | Diffusion-LM Improves Controllable Text Generation | 43 | Stanford University | USA | Academia |
92 | Overview of The Shared Task on Homophobia and Transphobia Detection in Social Media Comments | 41 | National University of Ireland Galway | Ireland | Academia |
93 | Text and Code Embeddings by Contrastive Pre-Training | 40 | OpenAI | USA | Industry |
94 | Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality | 40 | Hugging Face | USA | Industry |
95 | BLOOM: A 176B-Parameter Open-Access Multilingual Language Model | 39 | BigScience Team | France | Industry |
96 | Red Teaming Language Models with Language Models | 39 | DeepMind, New York University | UK, USA | Industry, Academia |
97 | Transformer Memory as a Differentiable Search Index | 39 | USA | Industry | |
98 | Torsional Diffusion for Molecular Conformer Generation | 38 | MIT | USA | Academia |
99 | Unified Contrastive Learning in Image-Text-Label Space | 37 | Microsoft | USA | Industry |
100 | Benchmarking Generalization via In-Context Instructions on 1, 600+ Language Tasks | 36 | University of Washington | USA | Academia |
欢迎大家关注DataLearner官方微信,接受最新的AI技术推送
