<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:webfeeds="http://webfeeds.org/rss/1.0">
    <channel>
        <title><![CDATA[AI+X Global Talent Community]]></title>
        <description><![CDATA[AI+X Global Talent Community]]></description>
        <link>https://gtc.blendedlearn.org</link>
        <generator>Bettermode RSS Generator</generator>
        <lastBuildDate>Sat, 13 Jun 2026 08:56:45 GMT</lastBuildDate>
        <atom:link href="https://gtc.blendedlearn.org/rss/feed" rel="self" type="application/rss+xml"/>
        <pubDate>Sat, 13 Jun 2026 08:56:45 GMT</pubDate>
        <copyright><![CDATA[2026 AI+X Global Talent Community]]></copyright>
        <language><![CDATA[en-US]]></language>
        <ttl>60</ttl>
        <webfeeds:icon></webfeeds:icon>
        <webfeeds:related layout="card" target="browser"/>
        <item>
            <title><![CDATA[A Trace-Driven Study of Priority-Aware Tensor Prefetching for MobileNetV2 Inference on TCM-Like Edge AI Accelerators?]]></title>
            <description><![CDATA[Our team addressed the memory-movement bottleneck in edge AI inference, especially for lightweight CNN models such as MobileNetV2 running on accelerator-based systems with limited on-chip buffer ...]]></description>
            <link>https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/a-trace-driven-study-of-priority-aware-tensor-prefetching-for-mobilenetv2-FZ1XLyabjsIZR6i</link>
            <guid isPermaLink="true">https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/a-trace-driven-study-of-priority-aware-tensor-prefetching-for-mobilenetv2-FZ1XLyabjsIZR6i</guid>
            <category><![CDATA[AI ]]></category>
            <dc:creator><![CDATA[Hu Shengyuan]]></dc:creator>
            <pubDate>Fri, 12 Jun 2026 13:59:14 GMT</pubDate>
            <content:encoded><![CDATA[<p>Our team addressed the memory-movement bottleneck in edge AI inference, especially for lightweight CNN models such as MobileNetV2 running on accelerator-based systems with limited on-chip buffer capacity. Although MobileNetV2 reduces computation using depthwise and pointwise convolutions, frequent tensor movement between off-chip DRAM and on-chip memory can still cause latency and energy overhead. To address this problem, we developed a trace-driven memory scheduling simulator that compares different tensor-management policies, including LRU and a priority-aware scheduling policy. Our final Phase 12 priority policy preserves LRU-style activation reuse while adding conservative future-aware weight prefetching for upcoming compute layers. We evaluated the policy using MobileNetV2 layer traces, 3-layer window-based design space exploration, and selected buffer sizes such as 192 KB and 512 KB. The 192 KB case connects to a SCALE-Sim-like buffer configuration, while the 512 KB case demonstrates reduced DRAM read traffic for selected DepthwiseConv2d windows. We further validated selected DRAM-level behavior using Ramulator and DRAMPower. The impact of this project is that it shows how lightweight scheduling decisions can improve inference latency and, in selected cases, reduce DRAM-device energy. The study provides a practical methodology for analyzing tensor reuse, prefetch opportunities, and memory-energy tradeoffs in edge AI accelerators.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[Calorie Counting with Smart Glasses]]></title>
            <description><![CDATA[Problem: Estimating food calories manually is time consuming and often inaccurate because users must identify foods and estimate portion sizes themselves.

Solution: We developed an AI smart glasses ...]]></description>
            <link>https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/calorie-counting-with-smart-glasses-Qkvv794XOMNDEIL</link>
            <guid isPermaLink="true">https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/calorie-counting-with-smart-glasses-Qkvv794XOMNDEIL</guid>
            <category><![CDATA[Program Outcomes]]></category>
            <dc:creator><![CDATA[Advik]]></dc:creator>
            <pubDate>Fri, 12 Jun 2026 13:58:05 GMT</pubDate>
            <content:encoded><![CDATA[<h3 class="text-lg" data-toc-id="678d0db2-0e5e-487e-b968-ac066fe61d49" id="678d0db2-0e5e-487e-b968-ac066fe61d49"></h3><p>Problem: Estimating food calories manually is time consuming and often inaccurate because users must identify foods and estimate portion sizes themselves.</p><p>Solution: We developed an AI smart glasses system that uses YOLOv8 for food recognition and stereo vision for volume estimation, then calculates calories using nutrition data.</p><p>Impact: The system provides fast, hands free calorie estimation, making dietary tracking more convenient and demonstrating the potential of AI powered wearable health technology. Most importantly, this offers better depth estimation than AI calorie apps because it uses 2 cameras.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[Food-Calorie-Estimator An AI Glasses System for Food Recognition and Calorie Estimation]]></title>
            <description><![CDATA[The Food-Calorie-Estimator is a simulated AI glasses system designed to recognize food and estimate its calorie content automatically. The system uses a pair of stereo images to obtain depth ...]]></description>
            <link>https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/food-calorie-estimator-an-ai-glasses-system-for-food-recognition-and-e9raEw35Jxyf4qY</link>
            <guid isPermaLink="true">https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/food-calorie-estimator-an-ai-glasses-system-for-food-recognition-and-e9raEw35Jxyf4qY</guid>
            <category><![CDATA[Program Outcomes]]></category>
            <dc:creator><![CDATA[Chih-Jung Hsu]]></dc:creator>
            <pubDate>Fri, 12 Jun 2026 13:51:35 GMT</pubDate>
            <content:encoded><![CDATA[<p>The Food-Calorie-Estimator is a simulated AI glasses system designed to recognize food and estimate its calorie content automatically. The system uses a pair of stereo images to obtain depth information and a YOLO-based model to identify the food category and region. The detected food area is combined with the estimated depth to calculate food volume. The volume is then converted into approximate mass using a food-density table, and the calorie value is calculated from the nutritional information of the detected food. A Gradio-based interface displays the food detection result, depth map, estimated volume, mass, and total calories. The system also includes optional AI narration, text-to-speech output, latency measurement, and power-consumption estimation to simulate a complete smart-glasses user experience.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[A Trace-Driven Study of Priority-Aware Tensor Prefetching for MobileNetV2 Inference on TCM-Like Edge AI Accelerators]]></title>
            <description><![CDATA[Our team addressed the memory-movement bottleneck in edge AI inference, especially for lightweight CNN models such as MobileNetV2 running on accelerator-based systems with limited on-chip buffer ...]]></description>
            <link>https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/a-trace-driven-study-of-priority-aware-tensor-prefetching-for-mobilenetv2-uFii2YiVKVARI7P</link>
            <guid isPermaLink="true">https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/a-trace-driven-study-of-priority-aware-tensor-prefetching-for-mobilenetv2-uFii2YiVKVARI7P</guid>
            <category><![CDATA[AI ]]></category>
            <category><![CDATA[Engineering]]></category>
            <category><![CDATA[VLSI]]></category>
            <dc:creator><![CDATA[Jia Qi Choy]]></dc:creator>
            <pubDate>Fri, 12 Jun 2026 13:51:16 GMT</pubDate>
            <content:encoded><![CDATA[<p>Our team addressed the memory-movement bottleneck in edge AI inference, especially for lightweight CNN models such as MobileNetV2 running on accelerator-based systems with limited on-chip buffer capacity. Although MobileNetV2 reduces computation using depthwise and pointwise convolutions, frequent tensor movement between off-chip DRAM and on-chip memory can still cause latency and energy overhead.</p><p>To address this problem, we developed a trace-driven memory scheduling simulator that compares different tensor-management policies, including LRU and a priority-aware scheduling policy. Our final Phase 12 priority policy preserves LRU-style activation reuse while adding conservative future-aware weight prefetching for upcoming compute layers. We evaluated the policy using MobileNetV2 layer traces, 3-layer window-based design space exploration, and selected buffer sizes such as 192 KB and 512 KB. The 192 KB case connects to a SCALE-Sim-like buffer configuration, while the 512 KB case demonstrates reduced DRAM read traffic for selected DepthwiseConv2d windows. We further validated selected DRAM-level behavior using Ramulator and DRAMPower.</p><p>The impact of this project is that it shows how lightweight scheduling decisions can improve inference latency and, in selected cases, reduce DRAM-device energy. The study provides a practical methodology for analyzing tensor reuse, prefetch opportunities, and memory-energy tradeoffs in edge AI accelerators.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[A Trace-Driven Study of Priority-Aware Tensor Prefetching for MobileNetV2 Inference on TCM-Like Edge AI Accelerators]]></title>
            <description><![CDATA[Our team addressed the memory-movement bottleneck in edge AI inference, especially for lightweight CNN models such as MobileNetV2 running on accelerator-based systems with limited on-chip buffer ...]]></description>
            <link>https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/a-trace-driven-study-of-priority-aware-tensor-prefetching-for-mobilenetv2-pmipIVzvpy520rW</link>
            <guid isPermaLink="true">https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/a-trace-driven-study-of-priority-aware-tensor-prefetching-for-mobilenetv2-pmipIVzvpy520rW</guid>
            <dc:creator><![CDATA[Yeshwanth]]></dc:creator>
            <pubDate>Fri, 12 Jun 2026 13:50:47 GMT</pubDate>
            <content:encoded><![CDATA[<p>Our team addressed the memory-movement bottleneck in edge AI inference, especially for lightweight CNN models such as MobileNetV2 running on accelerator-based systems with limited on-chip buffer capacity. Although MobileNetV2 reduces computation using depthwise and pointwise convolutions, frequent tensor movement between off-chip DRAM and on-chip memory can still cause latency and energy overhead. To address this problem, we developed a trace-driven memory scheduling simulator that compares different tensor-management policies, including LRU and a priority-aware scheduling policy. Our final Phase 12 priority policy preserves LRU-style activation reuse while adding conservative future-aware weight prefetching for upcoming compute layers. We evaluated the policy using MobileNetV2 layer traces, 3-layer window-based design space exploration, and selected buffer sizes such as 192 KB and 512 KB. The 192 KB case connects to a SCALE-Sim-like buffer configuration, while the 512 KB case demonstrates reduced DRAM read traffic for selected DepthwiseConv2d windows. We further validated selected DRAM-level behavior using Ramulator and DRAMPower. The impact of this project is that it shows how lightweight scheduling decisions can improve inference latency and, in selected cases, reduce DRAM-device energy. The study provides a practical methodology for analyzing tensor reuse, prefetch opportunities, and memory-energy tradeoffs in edge AI accelerators.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[Food Calorie Estimator]]></title>
            <description><![CDATA[Food-Calorie-Estimator leverages state-of-the-art deep learning models to:Detect and classify food items from images using YOLOv8n. Estimate portion sizes and volumes using depth analysis. Calculate ...]]></description>
            <link>https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/food-calorie-estimator-Y0hIYN9CZJk24KU</link>
            <guid isPermaLink="true">https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/food-calorie-estimator-Y0hIYN9CZJk24KU</guid>
            <category><![CDATA[OCE | PBL Project]]></category>
            <dc:creator><![CDATA[Evelyn Zhao]]></dc:creator>
            <pubDate>Fri, 12 Jun 2026 13:49:53 GMT</pubDate>
            <content:encoded><![CDATA[<p>Food-Calorie-Estimator leverages state-of-the-art deep learning models to:<strong>Detect and classify</strong> food items from images using YOLOv8n. <strong>Estimate portion sizes</strong> and volumes using depth analysis. <strong>Calculate nutritional content</strong> (calories, macronutrients)<strong>Provide an interactive UI</strong> for easy use. The system is trained on the <strong>Nutrition5k dataset</strong> (113 food classes) and includes a web-based interface for seamless interaction.</p><p>Here's our team distribution:</p><ul><li><p><strong>Yihan Zhao</strong> — YOLOv8n Model Training &amp; Development</p></li><li><p><strong>Wending Zhu</strong> — UI Simulation &amp; User Interface Design</p></li><li><p><strong>Advik Iyer</strong> — Project Idea Proposal</p></li><li><p><strong>Chih-Jung Hsu</strong> — Latency Optimization &amp; Performance</p></li></ul><p>UI Basic Workflows:</p><p>First, we upload left/right stereo images or use bundled demos. Then we set some parameters: <strong>B</strong> (baseline): Distance between cameras in mm (~60 mm for iPhone) and <strong>fx</strong> (focal length): Camera focal length in pixels (~2852 for iPhone 1× portrait). Then we run analysis. This model detects foods, estimate volume, and calculate calories. Finally we can see detection overlay, depth map, AR nutrition panel, and narration. Additionally, we use Deepseek AI Narration, which requires API key in <code>.env</code> . We add a function of text to speech function. It will automatically output voice for results.</p><p><strong>YOLO Traning:</strong></p><p>We train the YOLOv8n model on the Nutrition5k dataset, which covers a total of 13 distinct food classes and is pre-divided into training, validation and test subsets. We execute model training by running the command python <a class="text-interactive hover:text-interactive-hovered" rel="noopener noreferrer nofollow" href="http://train.py">train.py</a>. Our core training hyperparameters are configured as follows: we select YOLOv8n, the nano-sized variant prioritizing fast inference speed; we set the total training epochs to 100, paired with an early stopping rule that terminates training if no performance gain is observed within 20 consecutive epochs. The batch size is fixed at 16, a value optimized for the NVIDIA A100 GPU, and we adopt a uniform input image resolution of 640×640. All training logs, weight files and result visualizations are automatically stored in the runs/ directory. In terms of recognition scope, this model is capable of identifying all 113 food categories from the Nutrition5k dataset, including a wide range of fruits and vegetables such as apples, bananas, bell peppers and avocados.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[A Trace-Driven Study of Priority-Aware Tensor Prefetching for MobileNetV2 Inference on TCM-Like Edge AI Accelerators]]></title>
            <description><![CDATA[Our team addressed the memory-movement bottleneck in edge AI inference, especially for lightweight CNN models such as MobileNetV2 running on accelerator-based systems with limited on-chip buffer ...]]></description>
            <link>https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/a-trace-driven-study-of-priority-aware-tensor-prefetching-for-mobilenetv2-9aXobzqKkmXJcoT</link>
            <guid isPermaLink="true">https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/a-trace-driven-study-of-priority-aware-tensor-prefetching-for-mobilenetv2-9aXobzqKkmXJcoT</guid>
            <category><![CDATA[Program Outcomes]]></category>
            <dc:creator><![CDATA[Ching Chi Cheng]]></dc:creator>
            <pubDate>Fri, 12 Jun 2026 13:49:11 GMT</pubDate>
            <content:encoded><![CDATA[<p>Our team addressed the memory-movement bottleneck in edge AI inference, especially for lightweight CNN models such as MobileNetV2 running on accelerator-based systems with limited on-chip buffer capacity. Although MobileNetV2 reduces computation using depthwise and pointwise convolutions, frequent tensor movement between off-chip DRAM and on-chip memory can still cause latency and energy overhead. To address this problem, we developed a trace-driven memory scheduling simulator that compares different tensor-management policies, including LRU and a priority-aware scheduling policy. Our final Phase 12 priority policy preserves LRU-style activation reuse while adding conservative future-aware weight prefetching for upcoming compute layers. We evaluated the policy using MobileNetV2 layer traces, 3-layer window-based design space exploration, and selected buffer sizes such as 192 KB and 512 KB. The 192 KB case connects to a SCALE-Sim-like buffer configuration, while the 512 KB case demonstrates reduced DRAM read traffic for selected DepthwiseConv2d windows. We further validated selected DRAM-level behavior using Ramulator and DRAM Power. The impact of this project is that it shows how lightweight scheduling decisions can improve inference latency and, in selected cases, reduce DRAM-device energy. The study provides a practical methodology for analyzing tensor reuse, prefetch opportunities, and memory-energy tradeoffs in edge AI accelerators.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[Identification of Edge Cases and Weather Impacts on Sensor Distance Estimation for Autonomous Driving]]></title>
            <description><![CDATA[This study aims to address perception failures (recognition errors) under adverse weather conditions such as heavy rain, dense fog, and snow, which currently represent the most critical technical ...]]></description>
            <link>https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/identification-of-edge-cases-and-weather-impacts-on-sensor-distance-79ZsqliJl6O6VRh</link>
            <guid isPermaLink="true">https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/identification-of-edge-cases-and-weather-impacts-on-sensor-distance-79ZsqliJl6O6VRh</guid>
            <category><![CDATA[Program Outcomes]]></category>
            <dc:creator><![CDATA[Sara Hamada]]></dc:creator>
            <pubDate>Fri, 12 Jun 2026 12:11:21 GMT</pubDate>
            <content:encoded><![CDATA[<p>This study aims to address perception failures (recognition errors) under adverse weather conditions such as heavy rain, dense fog, and snow, which currently represent the most critical technical barriers to the commercial deployment of Level 4 autonomous driving. Analyzing disengagement and accident logs of autonomous vehicles reveals that weather-induced sensor degradation is the primary trigger for automated systems to relinquish vehicle control (triggering takeover requests), directly leading to collisions. Nevertheless, comprehensive evaluations regarding which sensors suffer from measurement difficulties under specific meteorological conditions, as well as the quantitative magnitude of such performance loss, have remained insufficient. To bridge this gap, this project conducts a two-dimensional cross-analysis spanning meteorological parameters (rainfall intensity, fog density, and solar altitude angle) and perception modalities (LiDAR, stereo cameras, and pre-trained monocular AI camera models). Through this framework, we quantitatively clarify the inherent performance limitations of each sensor solution while leveraging these analytical findings to develop and propose anti-interference multi-sensor fusion algorithms and weather-adaptive dynamic control switching strategies.</p><p>As a concrete methodology, a pedestrian crossing scenario was constructed within the open-source autonomous driving simulator "CARLA." Ranging experiments for each sensor were conducted under randomized adverse weather conditions, and time-series logs were collected and analyzed at a high sampling rate of 20 FPS (Delta t = 0.05 s). First, the baseline performance of each individual sensor was evaluated against the ground-truth (GT) distance under an ideal environment characterized by zero precipitation, zero fog, and a 45 dgree solar altitude angle (true noon). The results demonstrated that the stereo camera suffered from occasional ranging errors due to intense sunlight contrasts, as its recognition performance heavily relies on structural contrast. The AI camera exhibited a decline in accuracy at long ranges exceeding 20 meters, yet it demonstrated the highest tracking performance among the three sensors within a short range of 20 meters or less. Meanwhile, the semantic LiDAR proved to be completely distance-independent in the absence of weather noise, exhibiting exceptionally high stability, particularly at close ranges of 15 meters or less.</p><p>Next, to comprehensively evaluate the multi-modal impacts of meteorological variations and ambient lighting shifts, rainfall and fog parameters were normalized into 0.1 increments to visualize an "Ultimate 9-Matrix ODD Analysis" isolated across three distinct time slots: noon, sunset, and night. To ensure a stringent automotive safety standard, a successful recognition flag was defined as establishing a stable lock-on distance where the relative error against the ground-truth remained within 30% (alpha = 0.3) for 10 consecutive frames (0.50 seconds). The analysis revealed that during sunset, the low solar altitude angle causes intense backlighting (glare) to hit the camera lenses directly. This induces severe perception latency, forcing the vehicle to get approximately 10 meters closer to the target compared to noon before it can initiate a stable distance estimation. Furthermore, in nighttime environments under most adverse weather scenarios, camera-based modalities (stereo and AI) suffered from complete blindness until the target approached within a critical range of approximately 10 meters, highlighting a fatal recognition boundary.</p><p>In the final stage, using the $11 \times 11$ experimental matrix as a base dataset, raw empirical data points were 100% preserved while the remaining unmeasured sparse cells were smoothly interpolated based on local trends via grid-data interpolation (Griddata method). This synthesized a seamless, continuous contour map with a color range scaled from 5 meters (red: critical hazard) to 35 meters (blue: safe cruising). By overlaying a physical safety line of 18 meters—representing the total stopping distance (perception delay plus braking distance) when traveling at 40 km/h on a wet road surface with a friction coefficient of $\mu = 0.4$—the gap between perception limits and physical control constraint was evaluated. The quantitative results conclusively proved that regardless of the time of day (noon, sunset, or night), when the environment reaches extreme conditions of "fog density of 80% or higher" and "precipitation rate of 85–90% or higher," the stable tracking distance completely drops below this 18-meter threshold. Consequently, the vehicle is forced into a "braking limit violation zone" where a collision becomes physically unavoidable even at a moderate speed of 40 km/h. Ultimately, the outcomes of this research mathematically and physically visualize the hard boundaries of autonomous perception, delivering a solid engineering foundation for the safety-critical architecture design of all-weather autonomous driving systems.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[Identification of Edge Cases and Weather Impacts on Sensor Distance Estimation for Autonomous Driving]]></title>
            <description><![CDATA[CORE PROJECT ORIENTATION

Traditional static multi-sensor fusion algorithms fail to adapt to asymmetric performance drops of cameras, LiDAR and visual sensors under adverse weather. This project ...]]></description>
            <link>https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/identification-of-edge-cases-and-weather-impacts-on-sensor-distance-rxzIKPH0sGiF9kV</link>
            <guid isPermaLink="true">https://gtc.blendedlearn.org/project-team-matching-uuc2fiio/post/identification-of-edge-cases-and-weather-impacts-on-sensor-distance-rxzIKPH0sGiF9kV</guid>
            <category><![CDATA[OCE | PBL Project]]></category>
            <dc:creator><![CDATA[Junhao Zhu]]></dc:creator>
            <pubDate>Fri, 12 Jun 2026 11:43:58 GMT</pubDate>
            <content:encoded><![CDATA[<h2 class="text-xl" data-toc-id="80be2f95-3cdc-4497-84ce-785b5f93f129" id="80be2f95-3cdc-4497-84ce-785b5f93f129">Core Project Orientation</h2><p>Traditional static multi-sensor fusion algorithms fail to adapt to asymmetric performance drops of cameras, LiDAR and visual sensors under adverse weather. This project delivers an integrated simulation, algorithm and test framework to build all-weather reliable autonomous perception systems.</p><h2 class="text-xl" data-toc-id="532af055-3f6d-4018-8d16-1de85fba7df2" id="532af055-3f6d-4018-8d16-1de85fba7df2">Main Functional Modules</h2><ol><li><p><strong>Quantitative Sensor Performance Evaluation</strong></p><p> It quantifies performance degradation of stereo cameras, AI vision cameras and LiDAR under varied meteorological and lighting conditions, and identifies inherent detection limits such as LiDAR’s stable effective range and vision sensors’ fog interference thresholds.</p></li><li><p><strong>Weather-Adaptive Dynamic Multi-Sensor Fusion</strong></p><p> Unlike static fusion strategies, the framework dynamically assigns weight to each sensor based on real-time weather and light input. It elevates the weight of weather-resistant LiDAR in fog or heavy rain and suppresses unreliable visual data under strong glare to mitigate perception failures.</p></li><li><p><strong>Automated High-Speed Edge Case Mining</strong></p><p> The pipeline automatically generates rare high-risk driving scenarios for speeds up to 40 km/h and above. It discovers hidden safety hazards like insufficient braking distance triggered by sensor malfunction, enriching test coverage for extreme driving conditions.</p></li><li><p><strong>End-to-End Automated CARLA Experiment &amp; Data Operation System</strong></p></li></ol><ul><li><p>Deploys and maintains a fully automated simulation pipeline based on the CARLA simulator</p></li><li><p>Implements an automatic data collection system integrated with Lark Bot and Git monitoring</p></li><li><p>Supports segmented file upload, real-time exception alerting, and full workflow automation for massive repeated simulations</p></li><li><p>Standardizes data generation, quality control and experimental result delivery</p></li></ul><ol start="5"><li><p><strong>Weather Scenario Engineering &amp; Hyperparameter Tuning</strong></p></li></ol><ul><li><p>Enables customized sequential weather variation scene design in simulation</p></li><li><p>Embeds Optuna for automated hyperparameter search to optimize fusion model performance across weather profiles</p></li></ul><h2 class="text-xl" data-toc-id="e55c09f5-21a2-4ee5-82d9-e37dab13de4c" id="e55c09f5-21a2-4ee5-82d9-e37dab13de4c">Project Value</h2><p>This toolkit resolves the poor generalization of conventional sensor fusion under complex weather. It combines adaptive perception algorithms with automated simulation testing, providing reproducible technical support for the development of safe, all-weather autonomous driving systems.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[From Game Engines to Robot Fingers: How One Researcher Mapped the Future of Computer Graphics]]></title>
            <description><![CDATA[On June 8, AI+X GTC hosted an industry workshop at the intersection of computer graphics, machine learning, and real-world careers. The session was designed for international students who want to ...]]></description>
            <link>https://gtc.blendedlearn.org/storytelling-blog-0cxvq5wl/post/from-game-engines-to-robot-fingers-how-one-researcher-mapped-the-future-ps9mywLO9UUNB52</link>
            <guid isPermaLink="true">https://gtc.blendedlearn.org/storytelling-blog-0cxvq5wl/post/from-game-engines-to-robot-fingers-how-one-researcher-mapped-the-future-ps9mywLO9UUNB52</guid>
            <dc:creator><![CDATA[BlendED]]></dc:creator>
            <pubDate>Fri, 12 Jun 2026 00:43:03 GMT</pubDate>
            <content:encoded><![CDATA[<p>On June 8, AI+X GTC hosted an industry workshop at the intersection of computer graphics, machine learning, and real-world careers. The session was designed for international students who want to understand not just what to study — but how technical skills actually land in industry, and what the path from classroom to cutting-edge research looks like from the inside.</p><p>Our speaker was <strong>Yifei Li</strong>, a PhD graduate in Computer Science from MIT, where her research sits at the intersection of graphics, differentiable simulation, and physical AI. Before that, she completed her Bachelor's at Carnegie Mellon University — one of the top CS programs in the world. </p><p>Over the course of her academic career, she completed <strong>seven internships</strong> at companies including Activision Blizzard, Google, Meta Reality Lab, Boston Dynamics AI Institute, and NVIDIA. Each one took her somewhere different: game engines, augmented reality, digital humans, robotics. She wasn't chasing trends. She was following the problems.</p><p>This workshop was her telling us what she learned.</p><hr><p></p><h2 class="text-xl" data-toc-id="aded2ff4-53eb-4180-b58a-dbfc3cf74d13" id="aded2ff4-53eb-4180-b58a-dbfc3cf74d13">Chapter 1: The First Internship — Graphics Is Engineering</h2><blockquote><p><em>"Graphics means making things very fast to run in real time, writing code that other engineers can use, optimizing performance to its extreme."</em></p></blockquote><p>When Yifei was a sophomore at Carnegie Mellon, she spent a summer writing code that made grass move.</p><p>Not metaphorically. Real grass — thousands of individually generated grass blades, swaying with wind, shifting in detail as the camera moved — all rendered in real time across the massive Battle Royale map of <em>Call of Duty: Black Ops 4</em>. It was her first internship, at Activision Blizzard. By the end of it, her work had shipped on every console and mobile platform the game touched.</p><p>Before that summer, she thought graphics was mostly about visual algorithms. After it, she understood something different: graphics in industry means performance, systems thinking, and code that survives a production environment. The grass looked beautiful. But she spent most of those months optimizing it.</p><hr><p></p><h2 class="text-xl" data-toc-id="5f54b0ec-68c4-4ffe-91d6-2cff51242ee1" id="5f54b0ec-68c4-4ffe-91d6-2cff51242ee1">Chapter 2: Google Maps — Making Virtual Objects Feel Real</h2><blockquote><p><em>"Small realism errors can break immersion. There are actually many things that could go wrong."</em></p></blockquote><p>Her next stop was Google Maps, on the augmented reality navigation team. The challenge: when virtual information widgets float in front of real buildings on your phone screen, how do you make them look like they actually belong there?</p><p>The answer was the sun. By knowing a user's GPS coordinates and exact time, you can calculate the sun's position and render accurate shadows on virtual objects — shadows that match the real world around them. Yifei built that lighting estimation pipeline. It had to work in sunshine, in overcast weather, across different cities, at any time of day. The engineering problem wasn't the idea. It was the robustness.</p><hr><p></p><h2 class="text-xl" data-toc-id="6e670cb9-d2f3-43c2-89f7-1724342f4470" id="6e670cb9-d2f3-43c2-89f7-1724342f4470">Chapter 3: Meta Reality Lab — The Research Project That Went Viral</h2><blockquote><p><em>"At the beginning, this was framed as a research project. I never imagined it would actually be used."</em></p></blockquote><p>Graduate school shifted Yifei's work from product-focused engineering to open-ended research. At Meta Reality Lab, she tackled an unlikely problem: teaching an AI to animate hand-drawn humanoid characters — the lopsided, expressive, anatomically approximate kind that children draw.</p><p>The system had to detect a skeleton in an image that looked nothing like a real body, reconstruct the character's geometry, and apply real motion sequences to it. Nobody expected it to become a product.</p><p>It did. Meta released it as a public web demo. Millions of people used it. Those users generated the data the team needed to keep improving the model. The lesson Yifei took: the distance between a research idea and something people love is sometimes shorter than you think.</p><hr><p></p><h2 class="text-xl" data-toc-id="cf9f3230-9e85-4f6a-85ed-2b28e5881ba5" id="cf9f3230-9e85-4f6a-85ed-2b28e5881ba5">Chapter 4: The Bigger Picture — Riding Industry Shifts</h2><blockquote><p><em>"The skills underneath stayed the same. What they were applied to kept changing."</em></p></blockquote><p>One of the most valuable parts of Yifei's talk wasn't about any single project. It was the pattern across all of them.</p><p>She entered the industry in 2018, when games were the natural home for graphics talent. By 2020, the focus was shifting to AI. By 2021, every major tech company was building toward the metaverse — digital humans, virtual avatars, physics simulation fast enough to feel real. By 2024, the wave was robotics.</p><p>Yifei followed each current. Her later internships at Boston Dynamics AI Institute and NVIDIA put her inside the physical world: designing robotic gripper shapes in simulation, optimizing finger geometry for manipulation, working on simulation-supervised visual reasoning for humanoid robots.</p><p>The skills — geometry, simulation, differentiable physics, rendering — stayed constant. Their application just kept expanding.</p><hr><p></p><h2 class="text-xl" data-toc-id="da3db11d-17c9-487c-ba88-107243126f0b" id="da3db11d-17c9-487c-ba88-107243126f0b">Chapter 5: Why Graphics Belongs Everywhere Now</h2><blockquote><p><em>"Graphics tells us what structure should look like. That's exactly what modern AI needs."</em></p></blockquote><p>By the end of the session, the picture was clear: computer graphics isn't a niche skill. It's a language for describing the physical world.</p><p>Geometry. Material properties. Light. Motion. Contact. Force. These are the building blocks graphics has always worked with — and they're exactly what modern AI systems need when they try to reason about the real world. Whether that's a robot learning to pick up an object, a simulation training a neural network, or a differentiable physics engine that lets you optimize a design by describing what you want it to do.</p><p>The problems worth working on, Yifei said, are the ones where you have to convert messy real-world input into something structured and controllable. That's what graphics has always done.</p><hr><p></p><h2 class="text-xl" data-toc-id="4eb6b1e6-18b3-4d4c-9fb4-a9e9e524ca63" id="4eb6b1e6-18b3-4d4c-9fb4-a9e9e524ca63">Chapter 6: Q&amp;A — What the Interview Actually Looks Like</h2><blockquote><p><em>"For research roles, it's more about a match on the skills. They want to make sure you have the right skills for the problem."</em></p></blockquote><p>The session closed with questions from attendees. The one that generated the most detail: how do engineering and research interviews actually differ?</p><p>Engineering interviews test coding, systems design, and behavioral fit. Research interviews are a different shape: less focused on distributed systems or databases, more focused on whether your specific background matches the specific problem the team is working on. Expect a presentation round where you walk through your own research in depth — they want to see how you think.</p><p>It's a small distinction that changes how you prepare, and how you choose what to apply for.</p><hr><p></p><h2 class="text-xl" data-toc-id="3ceb82aa-09c9-4380-a12f-ed1ef7f8206f" id="3ceb82aa-09c9-4380-a12f-ed1ef7f8206f">Join the GTC</h2><p>AI+X GTC hosts sessions like this regularly — researchers, engineers, and builders sharing what the work actually looks like from the inside. It's free, open to all international students, and built around one idea: the earlier you understand what's happening at the frontier, the better positioned you are when you get there.</p><p>If you want to go deeper — BlendED's NVIDIA PBL puts students inside the exact space Yifei described: visual computing, simulation, and AI systems that reason about the physical world. <a class="text-interactive hover:text-interactive-hovered" rel="noopener noreferrer nofollow" href="https://program.blendedlearn.org/pbls/computer-3d-graphics-and-deep-learning-nvidia-project"><strong>Learn more about the NVIDIA PBL →</strong></a></p><hr><p></p><p><em>Yifei Li is a researcher at MIT working at the intersection of computer graphics, differentiable simulation, and physical AI. Her work spans cloth simulation, fluid dynamics, robotic manipulation, and digital humans.</em></p>]]></content:encoded>
        </item>
    </channel>
</rss>