公交车被弄高潮免费观看,一级日本熟妇浓毛hdsex视频

登錄/注冊我要招人

400-885-9898

更新于 12月17日

APP

舉報

Model Inference Engineer (PC & Android)

面議

北京海淀區(qū)
5-10年
碩士
全職
招1人

職位描述

大模型

We are looking for a senior-level engineer to focus on high-performance model inference across PC and Android platforms. The role centers on optimizing LLM/multimodal models for low latency and efficient memory use, implementing C++ runtimes, applying advanced acceleration techniques, and collaborating closely with research teams to bring optimized inference solutions into production environments.
Key Responsibilities
? Design and implement optimized model inference pipelines for PC (x86/AMD/Intel) and Android (ARM).
? Apply quantization, operator/kernal fusion, memory optimization, and runtime scheduling techniques.
? Work with at least one major inference stack: llama.cpp, Qualcomm AI SDKs (QNN/QAIRT/QSDK) or MTK Neuro Pillot; better to have experience with OpenVINO, Ryzen AI, and other inference SDKs.
? Profile and tune CPU/GPU/NPU performance using industry-standard profiling tools.
? Collaborate with model researchers to translate new methods into efficient runtime implementations.
Required Qualifications
? Master’s degree or above, with 3+ years experience in model inference, runtime engineering, or performance optimization.
? Strong C++ programming skills; familiarity with Android NDK/JNI is a plus.
? Solid understanding of transformer architectures, inference mechanisms, and acceleration methods.
? Hands-on experience with at least one of: llama.cpp, Qualcomm AI SDKs (QNN/QAIRT/QSDK) or MTK Neuro Pillot; better to have experience with OpenVINO, Ryzen AI, and other inference SDKs.
? Ability to read English technical papers and documentation; English communication preferred.
Preferred
? Experience with ONNX Runtime, TVM, XNNPack, or mobile performance tools.
? Contributions to open-source inference or optimization frameworks.

工作地點

北京市海淀區(qū)西北旺東路10號聯(lián)想全球總部

完善一份簡歷
1736萬+企業(yè)在線搜索，780萬+海量職位精準推薦

相似職位

cv算法工程師2-4萬
北京 - 東城
詠峰(大連)科技有限公司
大模型算法工程師3-4.5萬
北京 - 豐臺
樂碼仕
數據模型與算法研發(fā)工程師1.5-3萬·13薪
北京 - 東城
石久信用產業(yè)集團有限公司
算法實習生240-450元/天
北京 - 海淀
分音塔科技
大語言模型+視頻行為分析算法工程師5000-10000元/次
北京 - 海淀
大慶高新區(qū)中環(huán)電力控制系統(tǒng)有限公司
ai算法工程師2-3萬
北京 - 東城
萬寶盛華企業(yè)管理咨詢（上海）有限公司

查看更多相似職位

職位發(fā)布者

方女士/招聘PMO

聯(lián)想(北京)有限公司公司標簽

聯(lián)想（HKSE: 992）（ADR: LNVGY）是一家年收入700億美元的全球化科技公司，位列《財富》世界500強第159名，在世界各地共有75,000名員工，服務遍布全球180個市場數以百萬計的客戶。為實現“智能，為每一個可能”的公司愿景，我們在不斷夯實個人電腦全球市場冠軍地位的基礎上，更進軍基礎設施、手機、解決方案和服務等新的增長領域。憑借堅定執(zhí)行智能化轉型戰(zhàn)略和持續(xù)開發(fā)改變世界的創(chuàng)新與技術，我們正在為世界各地的億萬消費者打造一個更加包容、值得信賴和可持續(xù)發(fā)展的數字化未來。歡迎訪問聯(lián)想官方網站 https://www.lenovo.com，并關注“聯(lián)想集團”微博及微信公眾號等社交媒體官方賬號，獲取聯(lián)想最新動態(tài)。面向新一輪智能化變革的產業(yè)升級契機，聯(lián)想提出智能化變革戰(zhàn)略，圍繞智能物聯(lián)網（Smart IoT)、智能基礎架構(Smart Infrastructure)、行業(yè)智能（Smart Verticals）三個方向，立志成為行業(yè)智能化變革的引領者和賦能者。2020/2021財年，聯(lián)想進一步擴展和提升服務業(yè)務，以服務和解決方案為導向推動轉型的深入，力爭在未來十年內將服務和解決方案打造成聯(lián)想新的核心競爭力。目前，聯(lián)想核心業(yè)務由三大業(yè)務集團組成，分別為專注智能物聯(lián)網的IDG智能設備業(yè)務集團、專注智能基礎設施的ISG基礎設施方案業(yè)務集團及專注行業(yè)智能與服務的SSG方案服務業(yè)務集團。聯(lián)想集團致力于通過持續(xù)創(chuàng)新、卓越運營和全球布局，推進業(yè)務的可持續(xù)發(fā)展，實現基業(yè)長青。

公司主頁

關于我們: 公司介紹; 聯(lián)系我們; 誠聘英才

產品與服務: 人才招聘; 企業(yè)招聘

使用與幫助: 賬號注銷; 意見反饋; 發(fā)票制度; 防騙指南; 法律協(xié)議; 資質公示

智聯(lián)招聘更懂你的價值

智聯(lián)app小程序官方微信企業(yè)版APP

京ICP備17067871號?合字B2-20210134

京公網安備 11010502030147號?人力資源許可證:1101052003273號

網上有害信息舉報專區(qū)?違法不良信息舉報電話:400-885-9898 關愛未成年舉報熱線:400-885-9898-7

朝陽區(qū)人力資源與社會保障局監(jiān)督電話?

網絡110報警服務電子營業(yè)執(zhí)照