Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Internship Experiences at University of Illinois at Urbana-Champaign, Study notes of Genetics

The internship experiences of graduate students from the Department of Mathematics at the University of Illinois at Urbana-Champaign. The internships cover a range of topics including health data analysis, scientific content creation, cybersecurity, and software engineering. details on the projects undertaken by each intern and the skills they developed during their internships.

Typology: Study notes

2021/2022

Uploaded on 05/11/2023

amodini
amodini 🇺🇸

4.7

(18)

12 documents

1 / 27

Toggle sidebar

Related documents


Partial preview of the text

Download Internship Experiences at University of Illinois at Urbana-Champaign and more Study notes Genetics in PDF only on Docsity! Internship Experiences University of Illinois at Urbana-Champaign Department of Mathematics Graduate Program 2017 During the summer, Stacey Butler worked with health data at the Champaign-Urbana Public Health District. Specifically, she looked for patterns in sanitary (restaurant) inspections for Champaign area food establishments. Several years of data were analyzed using Pandas in Python, checking for inspector bias regarding different cuisine types using OLS regression and variation among inspectors' average scores, and examining the effects of policy changes on inspection scores. This summer Sneha Chaubey worked as an intern in the Scientific Content group at Wolfram Research, in Champaign. She continued her work from last year building on the Wolfram function site in Wolfram Alpha, collecting formulas, properties, theorems, etc for both elementary and complex mathematical functions and presenting the information in Mathematica notebooks. Functions include the Riemann zeta function, Dirichlet L-function, gamma function, exponential, logarithmic, sine and cosine integrals, and Jacobi and Elliptic integrals. Another project involved writing programs to construct general differential equations whose solutions in special cases are the Heun functions or Legendre functions. Yongwoon Escobar worked as an intern at Sandia National Laboratories in the Center for Cybersecurity Defenders program. His first project involved studying cryptographic protocols, gaining experience on implementing a post-quantum public-key protocol on hardware, and establishing a pipeline of tools to process documents of interest relevant to cryptography. His second project implemented additional features and solutions for a graph partitioning algorithm and library in C/C++ for large graphs, on the scale of billions of vertices and trillions of edges, for systems running in parallel. Near the conclusion of his internship, he met and interviewed with various Sandia teams and organizations for potential future employment. Benjamin Fulan - AbbVie (Research Park) The internship this summer on the tinnitus project in the Speech and Hearing Science Department (supported by the PI4 grant) continued Mary Angelica Gramcko-Tursi’s work from last year. The overall aim of the project is to find biomarkers for Tinnitus that can be used for diagnosis. The project involves lead matrices, which are used to recover the cyclic order of activity levels in specified regions of interest in the brain, based on fMRI data. Mary Angelica examined differences in lead matrices over each session as well as across different sessions, by projecting them onto subspaces of high variation to determine whether there was any discernible difference between the two subject groups (with and without tinnitus). Her work showed that the data being extracted from lead matrices is in fact meaningful, rather than being a result of noise. These methods may also be useful in other diagnostic settings too. During this summer, Tigran Hakobyan was a PhD software engineering intern at Facebook. He worked on designing a Traveling Salesman heuristic algorithm for nonlinear path weight and finite traversal duration constraints with arbitrary spatial graph partitioning. His work provided distributional statistics on the number, costs, and values of salesman routes under different constraints. David Hannasch – Department of Defense Derek Jung worked at the North Las Vegas facility of National Security Technologies, in an internship arranged through the NSF Mathematical Sciences Graduate Internship program. He worked on deblurring images, by investigating a mathematical model of the blurred image of an opaque edge. That problem serves as a model for understanding how all images are blurred. At a technical level, his work showed a certain integral operator equation is ill-posed while a Tikhonov regularized problem is well-posed, by using techniques from measure theory, functional analysis and harmonic analysis. William Karr worked at the Caterpillar Data Innovation Lab in the Research Park during the summer of 2017. The Data Innovation Lab hosts a diverse set of projects related to data and technology, proposed by dealers and clients of Caterpillar. Bill worked on developing a mobile web application called vSite for use by workers and supervisors in mining sites. The application would be used to mark problem areas and keep track of work that had been done around the site. This summer Vanessa Rivera-Quiñones interned at the lab of Professor Carla Cáceres from the Department of Animal Biology. Her work focused on studying the changes in the phototactic behavior (i.e. movement towards or away from light) of the zooplankter Daphnia Dentifera as a response to infection pressures. As part of her internship, she received lab training and collaborated on the experimental design process, a new experience for her. Vanessa will continue gathering data through the fall semester with the ultimate goal of building a mathematical model that describes this phenomenon. Nishant Rodrigues worked with the Formal Systems Laboratory in the Computer Science Department to help write an executable semantics for the Ethereum Virtual Machine in the K Framework. The group submitted a paper for review. The K framework has it's foundations in Rewriting and Matching Logic. It allows creating mathematically well-defined and executable specifications for programming languages. With these semantics, K can use Reachability Logic for program verification, that is, to automatically generate proofs of correctness for properties of programs written in these languages. Rewriting Logic, Matching Logic and Reachability have solid mathematical foundations. Many members of the Formal Methods sub-department are interested in working more closely with the Mathematics Department. Hao Sun’s project at the Ameren Innovation Center in the Research Park aimed to describe the relationship between the temperature and Ameren customers’ usage, and hence to predict future usage based on historical temperature data. Hao combined smart-meter customer data and NOAA’s temperature data in several different mathematical models to predict daily customer usage. Model errors were analyzed using linear regression, multiple linear regression, and modified multiple linear regression. Based on the multiple linear regression model, he used machine learning techniques to construct the modified multiple linear regression model, which was found to give the best predictions. This summer Albert Tamazyan worked as a software engineering intern with the Communication Products team at Yahoo. His projects in the Android Mail group were about developing machine learning algorithms for categorizing emails in mailboxes and for generating message filters for user accounts. He learned relevant tools and technologies, and gained valuable programming experience. During the summer of 2017, Corbin Tucker worked as an intern on the Predictive Analytics team at Blue Cross Blue Shield of Tennessee in Chattanooga. His responsibilities included researching possible analysis methods from peer-reviewed articles, assisting in identifying predictive variables, and cleaning text data for a machine learning algorithm. He also had a project of his own predicting patients who are most likely to suffer from a first time heart attack or stroke. As part of the internship, Corbin was introduced to new programming languages, analytics tools, and statistical methods which helped him to build his data analysis skills. Venkata Sravani Vadali worked at the Ameren Innovation Center (Research Park) for the summer on two separate projects. In one project, her team had to model a new billing scheme for Ameren that analyzes a demand-based charge as opposed to traditional billing. They analyzed terabytes of customers’ data and came up with mathematical equations that related the energy charge to the demand charge to keep Ameren’s revenue unchanged. In her second project, Sravani and the team determined the environmental factors that would affect the movement of utility poles. Various modeling and machine learning techniques were implemented in the project. She will continue working on these projects in the Fall. Her analyses was done in R, Python and SQL / Hive. During the summer, Lan Wang worked as a researcher in the Data Science group at John Deere, in the Research Park. Her task was to improve the accuracy of position data and smooth out the driving paths using Kalman Filter techniques, and then to analyze the paths using machine learning techniques, visualize the mower stripes, and finally deploy all Python scripts on AWS (Amazon Web Services) Lambda platform. During this work, she had the opportunity to combine her Math knowledge with real industrial problems. When applying the Kalman Filter, she built both linear and non-linear models to make the filter work better. When doing visualization, she took advantage of geometric insights to find efficient and effective methods to clean and parallelize mower paths. Meanwhile, she improved her programming capabilities and learned how to do operations in the cloud, taking advantage of Amazon Web Services, Amazon Turk, and the Google Compute Engine. She really enjoyed working at John Deere! Xiao Wang’s internship at the Bud Analytics Lab in the Research Park centered on customer data sets from the eCommerce department in China (especially through Tmall.com and Wechat stores). Her job included: analyzing customers’ behavior, especially different promotions and how they affect member conversion rate; training random forests to classify golden/platinum members; clustering customers from their behavior using Kmodes, ROCK (RObust Clustering using linKs) and Correlation Explanation algorithms; applying time series analysis to forecast eCommerce transaction volume. She enjoyed learning how to connect technical results to business insights. This past summer at Dow AgroSciences, Argen West worked on weed resistance modeling. First he learned about the different genetic mechanisms that cause weeds to build up resistance to herbicides and the methods that are used to model those mechanisms. Then he wrote Matlab code to predict the onset of herbicide resistance depending on different input variables. His internship culminated in a co-authored paper with his adviser that is set to be submitted for publication. During the summer, Qiang Wu worked on the Networked Stackelberg Model under the supervision of Professor Subhonmesh Bose (ECE Department). The specific project involved simulating the model and estimating the efficiency loss in a general network. Qiang began by learning the Classical Cournot Model, then coded up a simulation of the general Stackelberg Network competition. Besides that, he also revised the proof of some important theorems in a theoretical analysis of the model. During the process, he gained experience on using several optimization software packages. During the summer of 2017, Derrek Yager was partially supported by the National Science Foundation to work for the New York City Department of Transportation on taxi path inference problems. The NYC Taxi & Limousine Commission gathers GPS location trails, or “breadcrumbs,” from taxis at 1-2 minute intervals. A given month can produce over 300 million breadcrumbs. Due to GPS noise as a result of the urban canyon effect, the breadcrumbs rarely fall on the road. Using pgrouting, python, and parallel processing, Derrek separated the breadcrumbs into trips and then modeled potential paths connecting the breadcrumbs. Using Markov chains, he chose the most probable path so that he can now analyze the data using his own modification of Sparse Non-negative Matrix Factorization. This work extends a PI4 project from the previous summer with Vaibhav Karve, and ties in with an Illinois Geometry Lab project he has been co-supervising during the academic year. Instagram is a rapidly growing platform with over 500 million users sharing and consuming content every day. The Instagram Ads team in the Menlo Park, where Mingyu Zhao spent her summer internship, connects businesses to people by building a platform to show ads to the users. The team is focused on using state-of-the-art machine learning techniques to understand users' interests and increase the relevance of business content shown on Instagram. In particular, Mingyu worked on ad auctions, which include selection and pricing mechanisms for choosing the best ad based on the predicted performance. This project relied on game theory to make the auction incentive-compatible. During the summer, Dara Zirlin attended the IMA Math-to-Industry Boot Camp at the University of Minnesota. This was a six week program, where the first three weeks focused on courses in programming, data analysis, and mathematical modeling. The next three weeks were spent on group projects. The first project involved predicting the outcome of men’s singles tennis tournaments. The second involved optimizing oil well placement to maximize profit. During her PI4 internship, Mary Angelica Gramcko-Tursi worked with the Speech and Hearing Department under the supervision of Professors Fatima Hussein and Yuliy Baryshnikov. Her work was part of a larger project aimed at finding a biological basis for Tinnitus. As of now, fMRI scans of people with normal hearing and those with Tinnitus are virtually indistinguishable. If a difference could be found, the discovery would help establish a proper diagnosis and open doors for new treatment. Unfortunately fMRI images have the limitation of detecting brain activity in a single instant, while it is possible that different regions work together over an extended period of time and are activated in patterns or cycles with time delays between each region, rather than activating instantaneously. In order to detect differences in this delayed behavior, Mary Angelica computed “lead matrices”, matrices whose elements quantify time delays between levels of activity in certain regions of interest (ROIs) under study. In order to isolate possible differences, she used non-linear programming and optimization techniques to find optimal separating hyperplanes of various pairs of sets of lead matrices, this time interpreted as points in a Euclidean space, differentiated either by subject category or choice of time interval. Finally, she used statistical tools to detect levels of variation between subject groups and time intervals. By testing for separability and using simple analyses on the lead matrices themselves, she was able to isolate certain ROIs which may point a way forward for further research on the project. Throughout the entire process, Mary Angelica learned about the wide range of mathematical tools that can be used in analyzing biological processes. During the summer of 2016, Byron Heersink held an internship at Sandia National Laboratories in Albuquerque. There, he joined an ongoing project in developing and analyzing certain types of game-theoretic models, and in particular, to contribute rigorous mathematical analysis of the models. In addition to simplifying and improving proofs in some of Sandia’s earlier work, he developed algorithms for performing analytical calculations of certain outcomes of the models, and implemented them in Python. He also assisted in the validation of model simulations developed by other interns. Another component of the work was investigating the possibility of applying recent work in control theory to complex systems research at Sandia. Lastly, Byron took part in a program giving interns the opportunity to learn about Sandia’s energy research, which included many interesting tours of various Sandia facilities. Elliot Kaplan worked at Wolfram research during the summer of 2016, in an internship supported by the PI4 program. He worked with a team to develop a way to talk about theorems and definitions in pure mathematics using the Wolfram language. This included numerous discussions about the best way to encode basic ideas in set theory (such as products of sets) as well as abstract concepts (uniqueness up to an equivalence relation) and concrete concepts (Cauchy sequences). In doing this, Elliot learned how pure mathematics, even foundations of mathematics, can be useful in an industry setting. Throughout the summer, Elliot worked with another intern (Ian Ford) to encode the first six chapters of Munkres’ classic text “Topology”. William Karr worked at the Caterpillar Data Innovation Lab in Research Park during the summer of 2016. The Data Innovation Lab has a diverse set of data and technology related projects proposed by dealers and clients of Caterpillar. Some of the projects involve data mining and machine learning. Bill worked on developing a mobile application called Track & Trace to be used by municipalities to track and manage fleets of weather-related vehicles. The application allows operators to tag problem areas and allows county engineers and supervisors to monitor progress. This application could eventually be used to analyze historical records to determine how to more efficiently allocate resources for fleets of such vehicles. As part of the PI4 program, Vaibhav Karve studied the traffic patterns in New York city taxi data. Under the guidance of Professor Richard Sowers in the Department of Industrial and Systems Engineering, he analyzed the post-processing traffic data of nearly ten thousand taxis running on the 260,000 links in the New York city road grid. This data spanned every hour of the day, every day of the year, for four years. Vaibhav used algorithms of Non-negative Matrix Factorization (NMF) to produce good low-rank approximations to the taxi data, and then isolated patterns in the traffic with the aim of improving our understanding of traffic flow and traffic dynamics. To achieve these goals, he used an algorithm known as Sparse-NMF (a combination of NMF and k-means clustering). Sparse-NMF allowed him to compress data while simultaneously clustering road links based on their traffic patterns. This clustering could be used in the future to understand individual components that contribute in the overall traffic dynamics. For this project, Vaibhav wrote code in Python and ran it on the Illinois Campus Computing Cluster. He will continue the visualization aspects of this project with a larger team as part of the Illinois Geometry Lab in Fall 2016. Nicholas Kosar spent the summer of 2016 as an intern at Personify, on an internship set up through the PI4 program. Nick worked on two projects: exploring the feasibility of augmented reality with a standard smart phone and using a webcam with a depth sensor to create virtual meetings. Through this internship, Nick gained exposure to current research that is being done in augmented and virtual reality; he also gained experience working in computer vision. Instagram is a rapidly growing platform with over 400 million users sharing and consuming content everyday. The Instagram Data Products team in the Bay Area, where Shiya Liu worked for the summer, helps users to find content of interest by building search, content discovery and recommendation products on Instagram. The team is focused on using state of the art Machine Learning techniques to understand users' interests and increase the relevance of content shown on Instagram. In particular, Shiya worked in the Media Ranking team. Interesting challenges for the coming year include mapping users' interests to content consumed on Instagram, discovering and identifying events happening on Instagram, and connecting users to the most relevant accounts. As an intern in the Content Development Division at Wolfram|Alpha Scientific Content, Amita Malik's job was to build data related to number theory and complex analysis, by investigating results equivalent to the Riemann Hypothesis (RH). Since, RH arises in many seemingly different fields of Mathematics, it would be useful to find all this data together, and would save a great deal of time and help mathematicians with their research. The second part of Amita's project was to curate theorems in complex analysis, as a first step towards automatic theorem proving in this area using Mathematica. During the summer of 2016, Andrew McConvey worked as a summer intern in the Securities division at Goldman Sachs in New York. Andrew rotated between two different "Strats" teams, working first with the Foreign Exchange Electronic Trading desk before moving to the Structured Credit Trading team. His responsibilities included analyzing methods of streaming data to clients, specifically how to balance the need for real time data with the cost of sending new information. He also developed a tool to help evaluate and easily display the cost and risk of a trade. As part of the internship, Andrew was introduced to new programming languages and packages, which helped him to improve his coding and data analysis skills. Anna Mitchell worked this summer at RLI Insurance Company, a specialty product insurance company headquartered in Peoria. She worked with the reserving team in the risk services department, whose main responsibilities include determining the amount of money to set aside, or “reserve”, for future claims and liabilities. Anna primarily provided assistance with quarterly reserve reviews, and reconciled data for reserve studies, but also had a project of her own analyzing the effectiveness of five methods used by the reserving team to predict future loss development. She had the opportunity to present this information to all the top executives of the firm. This internship experience enhanced Anna’s professional skills and introduced her to several successful practitioners in actuarial science. During the summer of 2016, Sarka Petrickova worked on data analysis and model fitting in the biological sciences, in an internship arranged through the PI4 program. Under the guidance of David LeBauer (Energy Biosciences Institute) and Yuliy Baryshnikov (Mathematics), she worked together with another intern, Benjamin Wright, on two projects. In the first project they studied light interception in canopies. The second project concerned root systems of canopies like sorghum or maize. Here, the main goal was to compare the precision of various measuring tools that are, or could be, used in the field to estimate important parameters of the root system. Sarka reports that this internship was a great opportunity to learn about crop sciences, improve her programming skills, and get a feel for what it is like to work in an applied setting. Michael Raftery - Centers for Medicare and Medicaid Services Sepideh Rezvani worked on data analytics projects at the Ameren Innovation Center at the Research Park. She collaborated with other interns on customer usage clustering, detecting anomalies, and sentiment analysis projects using Ameren’s new cloud-based data. Based on usage patterns, they identified six main clusters containing more than 90% of customers. To detect anomalies, she trained a machine learning model to predict customers’ energy usage patterns for the first three months of 2016, using Ameren data collected by smart meters over the year of 2015. This project helps detect fraud, and the goal for the Fall is to improve the model using neural networks. In the sentiment analysis project, Sepideh analyzed Twitter data in order to alert Ameren if the general sentiment is “very negative” in a specific location, for instance due to an outage. She reports that the internship provided a great opportunity to learn cloud-based tools for cleaning, manipulating, analyzing and visualizing data, and also presentation skills using Tableau dashboards. All the projects required a strong knowledge of linear algebra, statistics and programming in Python. 2015 Nerses Aramyan: During his internship at Wolfram Research in Summer 2015, Nerses Aramyan investigated and developed identities for classes of special functions such as the Wigner D, Siegel Theta, and Bell Y functions. These identities will now be included on the Wolfram Alpha website and in the knowledge base of the Mathematica software. Nerses converted abstract mathematical expressions into a machine-readable form, and tested the formulas extensively both symbolically and numerically. He reports that “one of the most satisfying theoretical topics I learned in the course of this work was the connection between Lie group theory, representation theory, harmonic analysis, and the theory of special functions.” Hannah Burson worked with Professor Rebecca Smith in the Pathobiology Department of the College of Veterinary Medicine in the summer of 2015, an internship arranged through the PI4 program. She worked on models of the spread of Orf, a poxvirus common in sheep, which persists even in closed populations. As little is known about the transmission dynamics of Orf, Hannah used the Python programming language to implement several different possible models. Collaborators have designed a study to collect data that will be compared with the output of the code, in order to better estimate the dynamics Orf transmission. For the Summer 2015 PI4 internship, Stacey Butler worked on a project in James O'Dwyer’s lab in the Plant Biology Department. The project explored properties of communities of interacting species, with competitive, mutualistic and trophic interactions. She studied a variety of dynamic evolutionary models, mostly involving ordinary differential equations of Lotka-Volterra type, taken from a variety of ecology papers searching for consistent patterns in the structure of the communities. In particular, the project examined the spectrum of the interaction matrix and of the Jacobian of the system, and the eigenvector associated with the leading eigenvalue, and the models were analyzed via simulations written in Python. In the summer of 2015, Jed Chou worked in Professor Tandy Warnow’s lab where he and his teammates empirically evaluated a new method, SVDquartets, for inferring evolutionary phylogenies from DNA sequences generated under the Multispecies Coalescent (a statistical model in which gene trees can be topologically different from the overall species tree). A contentious topic in this area is whether so-called "summary methods", which infer tree topologies on individual genes and then combine these gene trees into an overall species trees, should be used to infer species trees. SVDquartets was designed to bypass this issue of gene tree estimation error, but had not yet been compared to other coalescent-based species tree estimation methods. Jed and his team compared it with several other leading methods for phylogenetic inference on simulated datasets with very short gene alignments. One interesting outcome was that leading summary methods can in fact be highly accurate relative to the other methods on some datasets with short sequences. Through this internship, Jed learned about using simulations to understand algorithms, improved his Python and UNIX scripting skills, and he also helped write a paper that will be published in BMC Genomics. Erin Compaan spent the summer interning at a communications consulting firm in San Diego. She had met the director of the company at a lecture the previous year, and arranged the internship herself. The projects involved using machine learning algorithms to sift through large volumes of data and identify salient characteristics. While programming was a big part of the day-to-day job - mostly in C and Python - the best part of the summer was the opportunity to collaborate with senior research scientists and other interns to make progress on a relevant and challenging problem. Lin Cong worked with Wolfram Alpha this summer as an intern, in the scientific content group. His duty there was to summarize properties of function spaces, so that users can access them on the wolframalpha.com website and manipulate the connections between different spaces. To achieve this goal, Lin consulted academic articles, compared between different authors and references, determined the desired properties, and finally encoded the properties into the Data file. The last step involves combining mathematical expressions with the Wolfram language (Mathematica software). He used knowledge from Functional Analysis and Harmonic Analysis repeatedly, along with learning lots of new mathematics. The team environment in the scientific content group was friendly and helpful, and Lin enjoyed the internship tremendously. Elizabeth Field, along with fellow mathematics student Goran Tomic, held an internship at the Institute for Genomic Biology in Summer 2015, organized through PI4 and supervised by Dr. David LeBauer. They joined the Predictive Ecosystem Analyzer (PEcAn) project, which is a toolbox consisting of two main parts: 1) a scientific workflow which manages environmental data and which is compatible with various ecosystem models, and 2) a Bayesian data assimilation system which synthesizes environmental data with the ecological models. Their project was aimed at developing a method of assimilating observations of biomass in order to provide better estimates for initial parameter values for an ecosystem model. To do this, they wrote an R package which implements a particle filter method. Through this internship, she learned about Baysian data analysis and gained valuable programming skills. For the second year in a row, Meghan Galiardi held a summer internship at Sandia National Laboratories in New Mexico. She mainly worked on her thesis research while at Sandia, but took several steps to connect her research with projects at Sandia. First, Meghan worked on a team trying to identify Sandia models in which complex systems features play an integral role. Meghan's role was add more mathematical/analytical capabilities to the project. Second, Meghan worked closely with two staff members, explaining to them the details of her thesis and brainstorming possible applications to Sandia projects. Additionally, Meghan learned skills in project development, presentation-giving, and interviewing. Sandia has a very welcoming environment and is a great place to work. Meghan advises anyone who is considering an internship at Sandia to meet with representatives and learn about opportunities at the Labs. Meghan also thanks the PI4 grant for travel support. Alessandro Gondolo worked as a full-time intern with Agrible Inc., a Research Park start-up providing actionable information to growers. Alessandro's internship was mediated by the department's PI4 initiative. The projects he worked on bridged the gap between mathematics and computer science—taking his knowledge of numerical analysis and PDEs and implementing physical models in Python. The experience pushed Alessandro to better understand both broad computer science concepts such as object-oriented programming and design patterns and day- to-day coding etiquette such as data scraping and version control system, Git. He gained experience with code reviews and ticketing systems, JIRA and Trac. At the end of the summer, Agrible Inc. offered Alessandro a fall internship; he has accepted the position and extends his thanks to the PI4 initiative for the opportunities PI4 provided. Byron Heersink worked under the supervision of Dr. Samuel Beshers in the Entomology Department on modeling ant colonies, in a Summer 2015 internship arranged through the PI4 program. He designed and wrote a program in Python emulating the behavior of an ant colony, using a stimulus-threshold response model to control how workers perceive and respond to various tasks. Hence Byron could investigate how the level of need for a task or the sensitivity of workers to perform a task would affect the patterns of division of labor and the colony’s ability to manage its task needs. The Ameren Illinois Innovation Center hosted Sushma Kini during Summer 2015, for an internship arranged through the PI4 program in the Department of Mathematics. During her internship period in the customer analytics project, she analyzed the customers of Ameren Illinois and their energy usage patterns by means of statistical inferences and Python programming. By the end of the summer, the teams could segment the customers and identify which energy efficiency programs should be offered to them. The work culture at Innovation Center provides a sound technical learning experience along with a focus on opportunities for developing one’s leadership qualities. “Big data” analysts are in high demand in the job market right now, and Sushma highly recommends working as an intern to transition smoothly into a full time career in the field. Nicholas Kosar spent the summer of 2015 as an intern at the Ameren iCenter in the University of Illinois Research Park, on an internship set up through the PI4 program. His two main goals were to segment customers based on electricity and gas consumption patterns and identify customers that could benefit from energy efficiency offerings. In addition, he also helped out on a project that was analyzing the reliability of Ameren’s electrical grid. During this internship, Nicholas gained experience using Python to look at large data sets. During the Summer of 2015, Andrew McConvey worked at Akuna Capital, a proprietary trading firm based in Chicago. He undertook a month-long crash course in options theory and basic trading strategies, and then got to apply this theory in a simulated environment before trading actual products in the market. Throughout the summer, Andrew also worked with a small team of other interns to develop a model for the volatility of option markets and identify trading opportunities. This project deepened his knowledge of Python and introduced him to several packages for data analysis. Andrew met representatives from Akuna Capital at the Mathematics Corporate Day last October, and was offered the internship in January. During her internship at Dow AgroSciences in Summer 2015, which was arranged by the PI4 program, Tara Negron Santiago worked on the problem of weed resistance resulting from repeated use of a herbicide. Weeds evolve and build up resistance to specific herbicides such as glyphosate (Roundup) through repeated use. Academic herbicide resistance modeling tools have been developed to integrate knowledge about weed biology (species specific), genetics, weed management (herbicide dose response, herbicide rotation), and a variety of other interacting factors such as seed bank density, cropping system, and the initial resistant allele frequency in large weed populations. Tara developed a graphical user interface and post- processing routines in VBA coupled with Matlab, to enable weed scientists and field managers to easily use and interpret results from an existing but rather complicated weed resistance research tool. Additional functionality included the ability to explore various Best Management Practices to try to avoid future issues analogous to what is observed today with glyphosate resistant weeds. This summer Wei Qin worked as a Back End Developer intern at Agrible. This data science company at the UI Research Park provides business-critical information to select the agricultural communities. The goal for the internship was to learn the agricultural models and implement them in Python. These models typically involve second order partial differential equations which cannot be solved explicitly, and so numerical methods are essential. Thanks to her prior coursework and research in the Mathematics department, Wei was able to construct a finite volume method to solve the models, yielding solutions that are stable and consistent. In addition, she employed statistical methods to analyze and calibrate the model results, using the statistical package R to analyze the sensitivity of the model with respect to parameters. Wei says “the ability to learn new things fast will always be important, no matter which jobs we are doing!” Derrek Yager: Derrek Yager worked as graduate coordinator for the Valparaiso “Experience in Research by Undergraduate Mathematicians” program over the summer of 2015. His duties included liaising between the undergraduate students and the faculty supervisors, and preparing a series of presentations on how to create papers, talks, and posters using LaTeX. He also gave a seminar talk sharing his own research. He aided the Combinatorics group as a research mentor, and arranged trips, lunches, and activities to keep the summer experience lively. Bingji Yi: At Deutsche Bank during Summer 2015, Bingji Yi found a method to extrapolate the Parametric Implied Volatility (PIV) surface to keep it arbitrage-free and twice differentiable. This surface is a fundamental building block in quantitative finance, for it parameterizes the Black-Scholes implied volatility as a function of strike and expiration; hence modelers can still use Black-Scholes formulas for pricing and hedging efficiently. The problem with PIV is the lack of any guarantee that it will lead to arbitrage-free option prices. In practice, arbitrage is usually found in PIV with extreme strikes, which leads to the idea of extrapolating the PIV outside a certain strike range. During this internship, Bingji developed a new method of extrapolation, and proved the method will work for any (arbitrage-free) market data. He also found how to select optimal model parameters, using numerical algorithms. 2014 Stephen Berning held a Summer 2014 position at the John Deere Technology and Innovation Center in the University of Illinois Research Park as an Algorithm Scientist. Stephen learned about this opportunity at the 2013 Mathematics Corporate Forum. His main duty was working on a variation of the Vehicle Routing Problem for Deere customers. This included the mathematical development of the problem and the algorithm development and implementation for data extraction, for data analysis and for the problem itself. His duties also included aiding in the development of mathematical solutions to various problems being tackled by Deere. Stacey Butler worked with James O’Dwyer in the Plant Biology Department on modeling ecological systems. This internship was set up through the PI4 program. She worked on models of mutualistic networks, such as plant-pollinator networks. Using the Python programming language Stacey wrote code to implement the models. Finding equilibrium states for the network involved finding fixed points of a system of ODE’s. Different parameter values were tested to determine the effect on the network structure. Through the course of the internship, she learned about theoretical ecology and programming. Jed Chou spent the summer of 2014 as an intern at Personify, a startup in Champaign developing an image-segmentation app for video conferences. His primary duties were to understand the theoretical aspects of the algorithms implemented in the software and to try and optimize some parts of the software. To this extent, he and his partner wrote some code in Matlab, ran experiments on a large number of images, and reported the results to one of the developers at Personfiy. Jed found this internship through the NSF-funded program PI4. Erin Compaan held an internship at the Department of Defense in the summer of 2014, granted as part of the National Physical Sciences Consortium fellowship program. She worked on a network behavioral analytics problem using machine learning techniques. There was extensive collaboration with researchers, analysts, and other interns. Much of her time was spent programming in Python and R, which proved a challenge because of her lack of training. However, the experience was excellent, and she found her supervisors valued perseverance and willingness to learn as much as prior knowledge. In addition to programming and modelling skills, she took away a new appreciation for the challenges of working with real (and messy!) data to meet clients' often unclear goals. Matthew Ellis held a summer 2014 internship at the Institute of Genomic Biology (IGB) at the University of Illinois, working as part of a group from the Energy Biosciences Institute that focuses on modeling biofuel crops, such as sugarcane and agave. He found this internship through the PI4 program, which connected him with the research group after a “Computational Bootcamp,” where he learned Python and several applications, including image processing and triangulation. Working with another Math graduate student, Nouya Wang, he developed a model for an agave plant, and wrote a ray-tracing algorithm to calculate the amount of light an agave plant receives, which ultimately determines the plant’s rate of photosynthesis. Through his work, Matthew learned about 3D imaging, Bayesian data analysis, and ray-tracing. Elizabeth Field had an internship in the biology lab of Professor Karen Sears, during the summer of 2014, through the Program for Interdisciplinary and Industrial Internships at Illinois (PI4). She worked alongside fellow student Daniel Carmody and was also advised by Prof. Zoi Rapti, both of whom are also in the mathematics department. Her project focused on using a reaction-diffusion system of partial differential equations to model the limb development of tetrapods, such as mice and bats. The model was then used to conduct a sensitivity analysis to determine which parameters are most sensitive to small changes. The results of this project will be used by biologists studying the morphological development of particular tetrapods to help them better understand the various biological factors influencing the morphological development of such species. Benjamin Fulan held an internship in Prof. Andrew Ferguson’s lab in the UIUC Materials Science Department during the summer of 2014. He was matched with the internship as part of the Mathematics Department’s PI4 program. The primary goal of the project was to develop an algorithm to aid in the creation of a vaccine for HIV, a process complicated by the virus’s ability to defend itself via mutation. Given data describing the virus’s replicative fitness as a function of its amino acid sequence, the algorithm is designed to find the set of amino acids which will most impair the virus when targeted by the immune system. As a result of the internship, Ben gained an increased knowledge of both mathematical optimization techniques (such as genetic algorithms) and computer programming. Meghan Galiardi held an internship at Sandia National Laboratories during the summer of 2014, in Albuquerque. She connected with Sandia after meeting their representatives on their regular recruiting visits to the Mathematics Department. The representatives passed her CV on to the department at Sandia that eventually offered her an internship. Meghan worked with the complex systems group doing population health modeling. Most of her time was spent programming models in Matlab. Meghan learned the complexities that go into a model when it needs to be delivered to a customer. She advises anyone who is considering an internship at Sandia to meet with representatives and learn about opportunities at the Lab. Sandia has a very welcoming environment and is a great place to work. Meg also thanks the PI4 grant for travel support. Nuoya Wang held a summer 2014 internship at Institute of Genomic Engineering at the University of Illinois, Urbana-Champaign. Nuoya’s main task in the summer project was to develop a 3D canopy model and analyze the photosynthesis effect for plants. By analyzing 3D triangular meshed data, Nuoya developed an algorithm to extract plant features, such as canopy height, leaf length, leaf width and leaf curvature from given format files. With this information, statistical strategies can be used to generate new plant models. From this internship Nuoya gained a better understanding of applied geometry and graph theory. She also learnt a lot about data cleaning, data analysis and statistical methods. Nuoya learned about this internship from Professor Yuliy Baryshnikov, associated with the PI4 Program. During the summer of 2014, Anna Weigandt interned with Personify, at the UI Research Park. Anna studied an image segmentation algorithm called GrabCut. This algorithm analyzes an image and separates the pixels into foreground and background components. GrabCut works by setting up an energy minimization problem. Pixels of very different colors are easier to separate into different components than pixels with similar color data. Anna worked on the ‘finger’ problem. When segmenting images of people, often fingers are incorrectly marked as background pixels. She implemented various modifications to the base algorithm in Matlab, to attempt to address this issue. Anna connected with Personify through the PI4 program. Sishen Zhou learned about this opportunity at John Deere (UI Research Park) from the Mathematics Department grad-careers mailing list. The position was described as a full-time contract job at the beginning. Sishen contacted the manager, showed his interest and explained that he is not going to graduate soon. After a discussion, the manager turned the position into a summer 2013 internship. The main topic of the work is to create improved maps of yield distribution by analyzing data sets collected by several sensors on a combine harvester. As a result of the internship, Sishen developed his skills in signal processing, image processing and optimization, and got plenty of experience in handling large, real data sets. 2013 Chris Bonnell, a 2013 graduate of our PhD program, is now an Associate Predictive Modeler at Allstate in Evanston, Illinois. Chris held a 2012 summer internship with Travelers. The internship focused on (1) the building and testing of predictive models built on data, and (2) the design of tools to build new and better models. Chris originally found this internship through networking with graduate school contacts. Susannah Johnson worked at Sandia National Lab during the summer of 2013. She first connected with representatives from Sandia when they visited the Mathematics Department in December 2012. One of the visitors was looking for an intern with knowledge of theorem provers, one of the primary tools in Susannah’s research. After some further conversation and a formal background review, Susannah was offered the internship. At Sandia, Susannah worked with others to develop a theorem prover related to computer security. She learned about computer security, a new area for her, and developed her skills in creating mathematical models. Susannah says that her work was mostly theoretical in nature, while some of the other interns were working in very applied areas of mathematics at Sandia. During the summer of 2013, Ziying (Cedar) Pan interned at the State Farm Research & Development Center in the University of Illinois Research Park. Cedar found this opportunity through an on-campus career fair. During the internship, he worked with four other interns to develop a prototype of an onboard device for driving risk evaluation. His role in development focused on data clustering component in the system. He designed project's SQL database, implemented its SQL interface, and also implemented a machine learning algorithm for data clustering. As a result of his experience with State Farm, Cedar gained knowledge of databases, of software development in C++ and of machine learning. Austin Rochford held a summer 2013 internship at Monetate, an e-commerce analytics contractor. One of his primary duties was to find ways to improve the accuracy of the statistical models used for evaluating the success or failure of experiments run on e-commerce websites. There were several obvious points were the assumptions of the model did not correspond to reality. Austin developed strategies for countering assumptions of the model which did not correspond to reality and thereby increased the accuracy of the reports by approximately 40%. He also collaborated with user interface and user experience designers to develop an interface for reporting the outcomes of experiments to marketers. From his internship, Austin gained an understanding of the methodology of professional software development and large-scale data analysis. Austin learned about the Monetate internship by doing an online search for data-oriented software developer internships in the Philadelphia area. He also recommends Hacker News , a software developer news/discussion site that has monthly "Who's Hiring" posts. Austin’s coding experience, including publicly available projects (on GitHub, for instance), was valuable in landing this position. Rui Song held a 2013 summer internship at the insurance company Plymouth Rock. His primary task was to determine the variables which are significant in predicting some key response variables, such as loss ratio, pure premium, counts of claims. He also did theoretical work such as comparing different predictive models in several aspects. As a result of this internship, Rui gained an understanding of the gap between real data with lots of irregular input, and ideal data, and learned about normalizing real data. Rui learned about the Plymouth Rock internship through a personal connection, and was offered the position after interviewing. Yat Sen Wong spent the summer of 2013 as an intern at Neustar, a real-time cloud- based information services and analytics company. Yat Sen learned about this internship opportunity through the Engineering Career Fair and the Research Park Career Fair, both at the University of Illinois. His main duty was to implement scalable machine learning algorithms. This means not only building a mathematical model, but also ensuring that the model can give a satisfactory running time performance. when dealing with large data sets. As part of his work, he learned several programming languages including Hadoop, Hive, Python, Java and some elementary shell script.
Docsity logo



Copyright © 2024 Ladybird Srl - Via Leonardo da Vinci 16, 10126, Torino, Italy - VAT 10816460017 - All rights reserved