6min read

Polidea Labs #3: Augmenting Reality with an iPhone

In the next episode of Polidea Labs series, we take on the subject of Augmented Reality. The article on Web AR is coming soon and if you want to read all about our VR experiment, you can find it here. This time, we are dealing with augmenting reality on iOS.

The dark ages of Apple shamelessly keeping quiet about AI topics are gone forever. Apple’s recent WWDC truly burst with news introducing frameworks that harness machine learning. These come in many flavors. At Polidea Labs, we took a quick look at the two of these tools: ARKit and Vision.

First step, always the hardest: the setup

You might want to keep in mind that these are all betas. (We hope you love Apple’s betas just like we do not.)

Before diving into augmented reality with aforementioned tools, there are several steps to follow.

  • Install Xcode 9 Beta on your Mac and iOS 11 Beta on your device. By the way, do you know what that means…? Wireless builds, yay! Finally! Except not really. Both Vision and ARKit do a lot of heavy lifting and as a result, they eat up the battery like crazy. Therefore…
  • Keep the charger close at hand.
  • Be patient — Xcode 9 does its best but there tend to be lots of crashes.
  • Forget the documentation — Apple engineers had better things to do than writing documentation.

After wading through all this, you’re ready to go.

ARKit: all the cupcakes you can’t eat

ARKit provides a platform for developing AR (augmented reality) experience in iOS apps. This means adding 2D or 3D elements to the live view from a camera in an iPhone or iPad, in such a way that it feels like these elements inhabit the real world. ARKit integrates iOS device camera and motion features to create augmented reality experience. It also offers integration with SceneKit and SpriteKit, as well as more low-level control with Metal 2.

There are three main layers that ARKit can be broken up to:

  • Tracking, which provides real-time information about device’s relative position in the physical environment. ARKit uses visual inertial odometry, which sounds somewhat like rocket science, right? But in simple words, it uses AVFoundation and CoreMotion under the hood to estimate the 3D position of the device relative to its starting position.

  • Scene understanding, which offers features like plane detection, hit testing or light estimation. With these functionalities, ARKit empowers us to integrate any virtual content into the physical world.

  • Rendering. It’s a nod to the SpriteKit and SceneKit developers especially — ARKit implements most of the rendering for you. The good news is that both Unity and Unreal are said to support all the ARKit features (which opens up some interesting opportunities for us after our recent PolideaLabs investigation.

All you need to do is tell the session of your ARSceneView (showing up as a regular camera view) to run with a specified configuration. ARKit handles all the processing. You can create an ARWorldTrackingSessionConfiguration which enables tracking the device’s movement with six degrees of freedom: the three rotation axes (roll, pitch, and yaw), and three translation axes (movement in x, y, and z).

The ARSession object outputs snapshots (ARFrame objects) that contain all the data concerning the state of the session. What you’ll do about that content is entirely up to you — imagination is the limit (or actually, the likely enigmatic crash of ARKit).

IMG_0398_scaled.png Special thanks to Mr Kipling for a delicious virtual cupcake

Vision: what you see is what your iPhone recognizes

Vision framework introduced by Apple provides some powerful image analysis and computer vision techniques that let identify objects in images and video registered by the device’s camera. Vision is built up on CoreML framework (also a crisp new machine learning tool from Apple), which enables using self-trained models (or one of the ready-made, e.g. these offered by Apple) that come in Xcode friendly .mlmodel format.

How do we play with Vision? Basically, we create a request handler (VNImageRequestHandler) which performs all the requests from a passed array. These request objects inherit from VNImageBasedRequest class and involve tasks such as:

  • face recognition
  • horizon detection
  • aligning the content of two images
  • scene classification
  • barcodes or text detection
  • image tracking

While creating such a request, we pass a completion handler in which we can then handle some VNObservation based objects that are produced while executing the request. What these observations contain depends on the type of request performed, choosing from the above.

The funny thing about Vision is that while it copes seamlessly with the sizing of the image (you don’t need to do any scaling), it’s completely helpless when it comes to determining the orientation of the image. It requires you to specify it explicitly, otherwise, all the image based requests will come unstuck.

So to sum up, Vision hands in some high-level, on-device solutions accessible via a simple API. To be honest, some of them — like barcode detection request — leave a lot to be desired. But others, like classification requests, rely entirely on the CoreML model used, so there’s more to play with creating — or choosing — the right one.

IMG_0418_scaled.jpg InceptionV3 model gives 88% confidence recognizing a lipstick.

Can you feel it coming?

If these are the first steps in making mobile augmented reality easy for developers, I think we are heading in the right direction. Certainly the forthcoming updates of ARKit and Vision frameworks will bring better performance, less battery consumption and more accuracy. We’re counting down the days until the official release in September.

Creating a good augmented reality experience requires a fairly large amount of domain knowledge. The designers will have a field day with this for sure. As from the developer’s point of view, thankfully the tools like ARKit or Vision take care of much of that fuss and sew it all up neatly, even though they’re still quite unpolished.

Have you ever dreamed about virtually designing your living room in real time, or depicting bedtime stories right on the floor in your kids’ bedroom? People around the globe already express incredible passion about the new toys from Apple. This fall AppStore will surely dazzle us with tons of fun and useful concepts.

Stay tuned for the Web AR experiment coming soon!

Further reads:


MonikaSoftware Engineer


Sign in and expect sharp insights, recommendations, ebooks and fascinating project stories delivered to your inbox

The controller of the personal data that you are about to provide in the above form will be Polidea sp. z o.o. with its registered office in Warsaw at ul. Przeskok 2, 00-032 Warsaw, KRS number: 0000330954, tel.: 0048 795 536 436, email: (“Polidea”). We will process your personal data based on our legitimate interest and/or your consent. Providing your personal data is not obligatory, but necessary for Polidea to respond to you in relation to your question and/or request. If you gave us consent to call you on the telephone, you may revoke the consent at any time by contacting Polidea via telephone or email. You can find detailed information about the processing of your personal data in relation to the above contact form, including your rights relating to the processing, HERE.

Data controller:

The controller of your personal data is Polidea sp. z o.o. with its registered office in Warsaw at ul. Przeskok 2, 00-032 Warsaw, KRS number: 0000330954, tel.: [0048795536436], email: [] (“Polidea”)

Purpose and legal bases for processing:


Used abbreviations:

GDPR – Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016
on the protection of natural persons with regard to the processing of personal data and on the free movement
of such data, and repealing Directive 95/46/EC (General Data Protection Regulation)

ARES – Polish Act on Rendering Electronic Services dated 18 July 2002

TL – Polish Telecommunications Law dated 16 July 2004

1)        sending to the given email address a newsletter including information on Polidea’s new projects, products, services, organised events and/or general insights from the mobile app business world |art. 6.1 a) GDPR, art. 10.2 ARES and art. 172.1 TL (upon your consent)

Personal data:name, email address

2)       statistical, analytical and reporting purposes |art. 6. 1 f) GDPR (based on legitimate interests pursued by Polidea, consisting in analysing the way our services are used and adjusting them to our clients’ needs, as well as developing new services)

Personal data:name, email address

Withdrawal of consent:

You may withdraw your consent to process your personal data at any time.

Withdrawal of the consent is possible solely in the scope of processing performed based on the consent. Polidea is authorised to process your personal data after you withdraw your consent if it has another legal basis for the processing, for the purposes covered by that legal basis.

Categories of recipients:

Your personal data may be shared with:

1)       authorised employees and/or contractors of Polidea

2)       persons or entities providing particular services to Polidea (accounting, legal, IT, marketing and advertising services) – in the scope required for those persons or entities to provide those services to Polidea


Retention period:

1)       For the purpose of sending newsletter to the given email address – for as long as the relevant consent is not withdrawn

2)       For statistical, analytical and reporting purposes – for as long as the relevant consent is not withdrawn

Your rights:


Used abbreviation:

GDPR – Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016
on the protection of natural persons with regard to the processing of personal data and on the free movement
of such data, and repealing Directive 95/46/EC (General Data Protection Regulation)

According to GDPR, you have the following rights relating to the processing of your personal data, exercised by contacting Polidea via [e-mail, phone].

1)       to access to your personal data (art. 15 GDPR) by requesting sharing and/or sending a copy of all your personal data processed by Polidea

2)       to request rectification of inaccurate personal data
(art. 16 GDPR) by indicating the data requiring rectification

3)       to request erasure of your persona data (art. 17 GDPR); Polidea has the rights to refuse erasing the personal data in specific circumstances provided by law

4)       to request restriction of processing of your personal data (art. 18 GDPR) by indicating the data which should be restricted

5)       to move your personal data (art. 20 GDPR) by requesting preparation and transfer by Polidea of the personal data that you provided to Polidea to you or another controller in a structured, commonly used machine-readable format

6)       to object to processing your personal data conducted based on art. 6.1 e) or f) GDPR, on grounds relating to your particular situation (art. 21 GDPR)

7)       to lodge a complaint with a supervisory authority,
in particular in the EU member state of your habitual residence, place of work or place of the alleged infringement if you consider that the processing
of personal data relating to you infringes the GDPR
(art. 77.1 GDPR)

No obligation to provide data:

Providing your personal data is not obligatory, but necessary for Polidea to provide you the newsletter service

Refusal to provide the above data will result in inability to receive the newsletter service.


In the process of providing the newsletter service, we make decisions in an automated way, including profiling, based on the data you provide.


“Profiling” means automated processing of personal data consisting of the use of your personal data to evaluate certain personal aspects relating to you, in particular to analyze or predict aspects concerning your personal preferences and interests.


The automated decisions are taken based on the analysis of clicked and viewed content. They affect the targeting of specific newsletter content to selected users registered to receive the newsletter service, based on the anticipated interests of the recipient.